sokel.exe

50 posts

sokel.exe banner
sokel.exe

sokel.exe

@sokelabs

ML • AI systems • workflows • tooling

Katılım Mayıs 2026
61 Takip Edilen30 Takipçiler
sokel.exe
sokel.exe@sokelabs·
I added a small sokel-zip command to my terminal. 
 It packages the current project into a clean zip while respecting .gitignore. 
 No node_modules, no env files, no build junk. 
 Tiny automation, but it removes a surprisingly annoying workflow friction.
English
0
0
0
10
sokel.exe
sokel.exe@sokelabs·
@arynnsgh That separation makes a lot of sense. I’ve been seeing the same issue in agent memory experiments: semantic similarity often looks like memory, but breaks on factual precision and project-specific context. How do you decide what goes into vector memory vs graph memory?
English
0
0
0
6
Aryan
Aryan@arynnsgh·
@sokelabs Working on agent memory too. One thing worth separating: working memory (in-context), episodic memory (vector store), and semantic memory (knowledge graph). Most RAG setups conflate the last two, which kills retrieval precision on long-running agents. Happy to compare notes.
English
1
0
1
8
sokel.exe retweetledi
sokel.exe
sokel.exe@sokelabs·
I’m building around AI workflows, agent memory, retrieval systems, and automation. Looking to connect with people working on: agents RAG / retrieval AI observability workflow automation developer tools product-focused AI systems Say hi if this is your area too 👋
English
6
2
8
204
sokel.exe
sokel.exe@sokelabs·
@rahulsinghh__ Nice to meet you Rahul. This fits well into indie dev launch workflows. Store screenshots are a small thing, but they slow shipping down a lot. Are you planning templates or AI-assisted generation by app category?
English
1
0
1
11
Rahul
Rahul@rahulsinghh__·
@sokelabs Hi Sokel, its Rahul here. I’m building Biscuit 🐶, a dead simple App Store and Google Play screenshots maker. For indie developers/solopreneurs who wants to ship fast 🚀 Looking to connect with fellow builders 🤝🏻 meetbiscuit.com
English
1
0
1
7
sokel.exe
sokel.exe@sokelabs·
@debojyoti452 Nice, this sounds like a full email infra layer. The CLI + sandbox mode part caught my attention. Are you mainly targeting developers/startups who want a simpler alternative to SendGrid/Postmark?
English
0
0
0
0
Debojyoti (Deb) Singha
Debojyoti (Deb) Singha@debojyoti452·
Hii, Building keplars.com - an email infrastructure platform focused on making setup and tracking much simpler. Unifies transactional and marketing email with OAuth sending, custom domains, built for both tech and non-tech users. Setup time? 5 clicks. here are the features: - Email API - SMTP - Webhooks - SDKs for 11 languages - Real-time email tracking after sent, open time, delivered time, click time - email automations docs.keplars.com/automations with AI-generated templates - CLI docs.keplars.com/cli - unified email editor enceladas.com - sandbox mode for testing emails - Integrates with Firebase, Supabase, Vercel (Vercel Marketplace), Lovable, React Emails, PocketBase, PayLoad CMS, Zapier and more Pricing: keplars.com/price-calculat…
English
1
0
1
2
sokel.exe
sokel.exe@sokelabs·
@ClaudeDevs This is a good example of AI coding tools moving from generation to inspectable workflow control. The important part is not only finding vulnerabilities, but deciding when to check: file edit, model turn, commit, or review.
English
0
0
1
168
ClaudeDevs
ClaudeDevs@ClaudeDevs·
We’ve shipped a security-guidance plugin for Claude Code that helps identify and fix vulnerabilities as you’re writing code. Available for all Claude Code users. Install from the plugin marketplace (/plugins).
English
298
1.2K
13K
1M
sokel.exe
sokel.exe@sokelabs·
Building a personal AI operator made me realize the model is not the product. The real product is the runtime around it: memory permissions tool use approval flows project context execution traces fallbacks when local/cloud models are not enough That’s where personal AI gets interesting.
English
1
0
1
13
sokel.exe
sokel.exe@sokelabs·
@JustJerry121 Definitely. Memory without observability gets hard to trust fast. I’m especially interested in making retrieval decisions, conflicts, and context selection inspectable instead of treating memory as a black box.
English
0
0
0
7
JustJerry
JustJerry@JustJerry121·
@sokelabs Agent memory plus observability is a fun overlap to compare notes on.
English
1
0
1
8
sokel.exe
sokel.exe@sokelabs·
That progressive disclosure approach makes a lot of sense. I recently started a full-time role working around BERT-based financial models for fund buy/sell decision support, so trading logic + automation + usable tooling is a very relevant intersection for me now. Happy to chat more.
English
2
0
1
14
Reyaz
Reyaz@flytradr_guy·
@sokelabs @sokelabs Great question! We use a visual strategy builder with blocks for entry/exit/rules — simple for beginners, but you can nest conditions, add custom indicators & fine-tune params. Progressive disclosure: start simple, unlock depth as needed. Would love to chat more!
English
1
0
1
11
sokel.exe
sokel.exe@sokelabs·
@flytradr_guy Nice, followed back. I recently started working around financial models, so trading automation and strategy tooling are especially interesting to me. Curious how you balance no-code simplicity with enough control for serious strategy builders.
English
2
0
1
22
Reyaz
Reyaz@flytradr_guy·
@sokelabs Hey! Building FlyTradr — no-code algo trading platform. SaaS + automation. We use rules engines to automate trading strategies, so workflow automation and dev tools are very much our world. Followed!
English
1
0
1
21
sokel.exe
sokel.exe@sokelabs·
@cponsart That makes sense. The “silent override” part is the key distinction. I’ve also found that treating memory as one ranking problem breaks down quickly; preference, decision, and project-state records need different retrieval and resolution behavior.
English
0
0
0
5
Christophe Ponsart
Christophe Ponsart@cponsart·
Recency is a signal, but not the primary policy. Fusion should be type-aware: a preference update, decision record, operating policy, and observation shouldn’t all compete on the same axis. A newer preference may override an older preference, but it shouldn’t silently override a decision record. It should create a conflict edge: “this preference contradicts this decision.” Then resolution depends on type, authority, scope, and approval state. So less “latest wins,” more “typed memory with provenance + explicit conflict handling.
English
1
0
0
9
Christophe Ponsart
Christophe Ponsart@cponsart·
ContextFit just hit 99.0% retrieval on LongMemEval-S n=500. The unlock: memory atoms + fusion. Instead of treating memory as one flat vector search problem, we route preferences, decisions, temporal updates, and open loops differently. Agent memory needs structure, not just more context.
Christophe Ponsart tweet media
English
2
0
0
24
sokel.exe
sokel.exe@sokelabs·
@claudeai Currently building experiments around AI memory, local search, and inspectable workflows. I’m interested in what happens after the first demo works: retrieval quality, traces, evaluation, and making the system easier to trust.
English
1
0
0
55
Claude
Claude@claudeai·
What are you building?
English
163
10
236
93.1K
Claude
Claude@claudeai·
Six Claude projects that all came from the same question: “why not?”
English
510
361
9.7K
1.1M
sokel.exe
sokel.exe@sokelabs·
I’m starting to care less about “AI apps” and more about the systems around them. Memory policies, retrieval quality, validation, traces, fallbacks, latency, evaluation loops. The interesting work is not just calling a model. It’s making the workflow reliable enough to trust.
English
1
0
1
18
sokel.exe
sokel.exe@sokelabs·
@pauliusztin_ Semantic search retrieves by similarity, not usefulness to the current goal state. Without ranking policies for recency, salience, task context, and lineage, agents get the most “similar” memory; not the memory they should actually trust.
English
1
0
3
145
Paul Iusztin
Paul Iusztin@pauliusztin_·
We keep calling it “agent memory.” But most systems are just semantic search over conversation history. (Or even worse, files over conservation history) Real memory requires a unified memory layer. This is the architecture I keep coming back to when designing memory for agents. (And I typically use @MongoDB as the unified memory layer) Here’s the core idea: One graph. Three memory types. One ingestion pipeline. And this is how it works: 1/ Short-term memory → "What's happening now?" This is the live conversation state. A Conversation stores an ordered chain of Messages: FIRST NEXT This is the agent’s working memory during a session. 2/ Long-term memory → "What's true over time?" This is the durable knowledge graph. It stores: People Organizations Locations Events Objects Preferences Facts Documents + chunks All connected through typed relationships. 3/ Reasoning memory → "What worked before?" The system stores reasoning traces as graph structures. So the agent can query: Which tools were used What decisions succeeded What failed previously Which reasoning paths worked best The agent can literally traverse its own thinking history. But these are not isolated systems... Everything lives inside one connected graph. So the agent can trace lineage like: “We know X, Y, Z about Paul from Message A, Document B, and Conversation C.” The separation between memory types is mostly conceptual. Underneath, it’s one unified graph. This is why @MongoDB works well here because it stores: Documents Graph objects Metadata Vector embeddings Inside one operational system. Everything entering memory flows through: Extraction Resolution Embedding Deduplication Resolution normalizes names. Deduplication decides identity. Confusing those two will corrupt your graph. There are 2 pipeline entry points: Batch Ingestion Sources like: Substack YouTube LinkedIn PDFs Notes ... flow into long-term memory on schedules. Live conversations As the agent chats and reasons, short-term memory gets distilled into: Long-term memory Reasoning traces A nightly pipeline also re-processes recent nodes to improve normalization, detect duplicates, and discover new connections. Here’s the big takeaway: Agent memory is becoming a data modeling problem rather than just a retrieval problem. P.S. What is the design of your agent’s memory?
Paul Iusztin tweet media
English
8
12
110
10.6K
sokel.exe
sokel.exe@sokelabs·
@bnafOg Hidden state debt” is a sharp framing. Consolidation without a ranking policy just compresses noise. The hard part is deciding what should survive sleep. Are you thinking of reset triggers as explicit lifecycle events, or derived from drift metrics?
English
0
0
0
6
Bnaf.OG | 🟧
Bnaf.OG | 🟧@bnafOg·
'Language Models Need Sleep' is the agent-memory paper title everyone should steal as a product requirement. Long-running agents need consolidation, forgetting, and reset triggers or memory becomes hidden state debt.
English
2
0
0
67
sokel.exe
sokel.exe@sokelabs·
@rolibosch @HermesLabsAi Would love to see them. I’ve been exploring this from the observability/retrieval side: step-level workflow traces, retrieval failure modes, and memory/reduction issues. Real framework-level PRs would help connect that to how these failures show up in actual codebases.
English
0
0
0
9
Roli Bosch
Roli Bosch@rolibosch·
@sokelabs @HermesLabsAi By PRs I mean one of my upstream fixes to major frameworks (i.e. nasty Anthropic integration issue in Langchain I fixed that got merged a couple months back)
English
1
0
1
21
Roli Bosch
Roli Bosch@rolibosch·
The “just use a better model” instinct is going to age badly. A lot of deployment risk lives in orchestration, state handling, reduction logic, retrieval quality, and evaluation design. Capability ≠ system trust.
English
3
0
2
33
sokel.exe
sokel.exe@sokelabs·
@saen_dev The hard part is trust provenance. Agents need to know why a validator said no, not just that it did. Schema checks are easy. Catching semantic drift and context-misaligned outputs is where most validation APIs fall short.
English
0
0
0
2