sovontai

45 posts

sovontai

@sovont_ai

Building AI infrastructure, Founded by @toni_anev | Toronto https://t.co/9z3LkF4kZ8

Toronto Ontario انضم Kasım 2023

68 يتبع4 المتابعون

sovontai@sovont_ai·1d

@GamewithDave Regularly 🥲

English

10.8K

Dave@GamewithDave·1d

For anyone who used a computer between 1990 & 2005… what’s the one game you still think about?

English

40.7K

692

14.1K

7.9M

sovontai@sovont_ai·3 Nis

Vector search gives you similarity scores, not relevance scores. Most teams ship the top-5 chunks directly to the LLM and call it done. That gap between retrieval and generation is where RAG pipelines quietly fall apart. sovont.com/blog/reranking…

English

sovontai@sovont_ai·2 Nis

That A/B test from 8 months ago is still running in your production system. The engineer who launched it left. The flag is buried. 15% of your users are quietly getting worse results. Experiments are easy to launch and painful to clean up — so nobody does. sovont.com/blog/the-exper…

English

sovontai@sovont_ai·1 Nis

If you haven't tested a full backfill before your pipeline goes to production, you haven't finished building it. Most teams find this out during a fire drill. sovont.com/blog/the-backf…

English

sovontai@sovont_ai·31 Mar

LLMs don't know when they're wrong. Your production system has to. The confidence score and output correctness have a loose relationship at best — and it gets worse at scale. How to actually handle it: sovont.com/blog/the-confi…

English

sovontai@sovont_ai·30 Mar

Scope creep in AI projects doesn't look like bad faith. It looks like enthusiasm. That's what makes it hard to stop. New post on how to handle the stakeholder who keeps moving the goalposts — without torching the relationship. sovont.com/blog/the-stake…

English

sovontai@sovont_ai·27 Mar

Your LLM doesn't have a latency problem. It has a latency budget problem — you just never defined one. Four seconds feels broken even if the answer is right. Define p95 before you build, not after users complain. sovont.com/blog/your-llm-…

English

sovontai@sovont_ai·26 Mar

Most AI initiatives don't fail — they just never end. No defined output, no measurable success criteria, no done. That's not an AI problem. It's a scoping problem. sovont.com/blog/the-ai-pr…

English

sovontai@sovont_ai·25 Mar

Vector search looks great in demos. It fails when users ask for a product code, a date range, or an exact match. Semantic similarity is the wrong tool for specific lookups. Hybrid retrieval — dense + sparse + metadata filters — is the default architecture, not the fallback. sovont.com/blog/when-vect…

English

sovontai@sovont_ai·24 Mar

Idempotency is table stakes. The real failure mode is pipelines that run clean, land the data, show green — and quietly produce wrong numbers. No alerts. No errors. Just drift you find in a board meeting. sovont.com/blog/the-pipel…

English

sovontai@sovont_ai·23 Mar

Your model changed. You didn't touch the code. Nobody noticed until production broke. LLM versioning is the problem most teams ignore until it's too late. Here's what actually needs versioning — and why the cost of skipping it is always higher than setting it up. sovont.com/blog/llm-versi…

English

sovontai@sovont_ai·22 Mar

Treat your prompts like code. Because they are. If you're not versioning, testing, and deploying prompts with the same discipline as your application code, you're flying blind in production. New on the blog: Prompt Management in Production. sovont.com/blog/prompt-ma…

English

sovontai@sovont_ai·17 Mar

Everyone's building agents. Nobody's solving discovery. How does an agent know what other agents exist, what they can do, or whether they're healthy? That's the gap Agora fixes. Open-source agent registry — DNS for AI agents. github.com/sovontai/agora

English

sovontai@sovont_ai·16 Mar

Agents need to find each other before they can work together. Agora is a DNS-style registry for AI agents — discovery, verification, health checks. Because the agent ecosystem needs infrastructure, not more wrappers. github.com/sovontai/agora

English

sovontai@sovont_ai·13 Mar

Every agent system we review has the same hole: no discovery layer. You can't orchestrate what you can't find. This is why we built Agora — an open-source agent registry with discovery, verification, and health checks. github.com/sovontai/agora

English

sovontai@sovont_ai·12 Mar

Agents can't collaborate if they can't find each other. DNS solved this for the web in 1983. We're still doing service discovery for AI agents by hand in 2026. Agora is the registry layer the agent ecosystem is missing. github.com/sovontai/agora

English

sovontai@sovont_ai·11 Mar

Most agent frameworks skip the discovery problem entirely. How does one agent find another? How does it verify it's talking to the right service? DNS solved this for the web in the 80s. Agora is doing it for agents now. github.com/sovontai/agora

English

sovontai@sovont_ai·10 Mar

Every agent system eventually needs a registry. Not a config file. Not a hardcoded list. A proper registry — discovery, health checks, versioning, verified capabilities. We built Agora because nothing else treated this like infrastructure. github.com/sovontai/agora

English

sovontai@sovont_ai·9 Mar

Model drift doesn't announce itself. By the time your users notice something's off, it's been degrading for weeks. You need monitoring that catches distribution shift before the support tickets do.

English

sovontai@sovont_ai·7 Mar

@GeminiApp It’s a secret 🤫

English

225

Google Gemini@GeminiApp·6 Mar

What are you cooking up with Nano Banana 2? Share your creations in the replies.👇

English

581

1.4K

194.5K

اكتشف

@GamewithDave @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine