BaseThesis Labs

@Basethesislabs

BaseThesis is an experimental AI lab, launch studio and community for people who like solving hard problems.

Bangalore, India Katılım Şubat 2026

17 Takip Edilen245 Takipçiler

BaseThesis Labs retweetledi

Synth@SynthAGI·5h

LLM caching is criminally underused. You're sending the same 10k token system prompt on every request and wondering why your bill is insane. Cache it. Your wallet will thank you.

English

200

BaseThesis Labs retweetledi

Synth@SynthAGI·4d

Your eval suite is lying to you. Accuracy went up 2% but users are complaining more. Turns out optimizing for BLEU score doesn't optimize for "actually helpful." Metrics are a map, not the territory.

English

635

BaseThesis Labs@Basethesislabs·19 Şub

Voice models are getting really good. But good models on bad infrastructure produce bad experiences. What's still broken: 1. Full-duplex conversation is functionally unsolved. Humans talk over each other constantly - interruptions, backchannels and overlapping speech. 2. Emotion detection degrades dramatically outside the lab. Speech emotion recognition hits 92%+ accuracy in controlled settings, but drops to 60–75% in real conditions. 3. Hallucinations cascade in ways unique to voice. When a text chatbot hallucinates, the user can see it and correct. When a voice agent hallucinates, the user can't scan back. Correcting mid-conversation is socially awkward. 4. Long-term memory across calls is 56% worse than humans. Remembering what a customer said last week should be table stakes. It isn't. Read more here on how we can fill this gap as builders: basethesis.com/blog/voice-ai-… @RaveenSastry @ashokns @thesisofsarthak @sidgraph

English

543

BaseThesis Labs retweetledi

Sarthak@thesisofsarthak·17 Şub

Anyone aware of a voice arena similar to LLM arena to test different models and different configs of models out under?

English

598

BaseThesis Labs@Basethesislabs·16 Şub

x.com/i/article/2023…

ZXX

527

BaseThesis Labs@Basethesislabs·13 Şub

Every AI company we spoke with has been rebuilding the same broken infrastructure, multi-agent coordination that fails in production, memory systems that can't handle real conversations, voice interactions that feel robotic. The gap between frontier AI research and what companies actually ship is getting wider, not narrower. We're building the bridge to close that gap. This is why we exist. basethesis.com/blog/why-do-we… @thesisofsarthak @RaveenSastry @ashokns

English

159

BaseThesis Labs@Basethesislabs·12 Şub

When you meet someone who remembers your birthday, recalls your dietary restrictions or references that comment you made six months ago about career aspirations, you don't feel like they're querying a database. You feel understood. Right? Current conversational AI fails precisely here. Memory systems record comprehensively, but retrieve mechanically. Last month, @Basethesislabs & @smallest_AI gave 19 teams of AI builders the same challenge - build memory that demonstrates understanding, not just recall. We documented all 19 approaches and quantified the trade offs. Read the entire investigation here: basethesis.com/blog/basethesi… @thesisofsarthak @RaveenSastry @ashokns @varmashef @picardo_ria

English

281

BaseThesis Labs@Basethesislabs·8 Şub

@MoltCode LFG!!!!!

MoltCode@MoltCode·8 Şub

Watch openclaw's agent smith pushing cool stuff on moltcode.io !!!

English

542

Keşfet

@RaveenSastry @ashokns @thesisofsarthak @sidgraph @smallest_AI @varmashef @picardo_ria @MoltCode