Leo

988 posts

Leo banner
Leo

Leo

@LeosReal

customer context for agents @outlit_ai | @ycombinator 🇨🇦 🇸🇻

San Francisco Katılım Aralık 2009
527 Takip Edilen907 Takipçiler
Sabitlenmiş Tweet
Leo
Leo@LeosReal·
the inception here is crazy
Leo tweet media
English
5
1
32
3.4K
Leo retweetledi
Respan
Respan@RespanAI·
Today we’re announcing that Respan has raised a $5M seed round led by @GradientVC. We’re building the self-driving observability, evals, and gateway for AI agents. 100+ teams already use Respan. 1B+ logs/month. 2T+ tokens/month. And we’re just getting started.
Respan tweet media
English
56
51
316
28.9K
TJ Software
TJ Software@tjsoftwaredev·
@LeosReal But that dopamine when you come back and solve it
English
1
0
1
11
Leo
Leo@LeosReal·
remember when you got so tilted at a bug that you genuinely had to step away from your pc? thank god for ai
English
2
0
0
89
Leo
Leo@LeosReal·
@juliomagoga the dopamine rush of fixing it yourself
English
0
0
2
22
Leo
Leo@LeosReal·
we are all at different stages on this curve
Leo tweet media
English
0
1
5
278
Leo retweetledi
Satya Patel
Satya Patel@saddle_paddle·
Someone needs to explain to me why Codex is severely outperforming Claude Code for me. The difference is literally night and day, what did Anthropic do?
English
1
1
3
265
Leo
Leo@LeosReal·
one of the best examples of paul graham's "schlep blindnes" is wispr flow i had no idea how bad speech dictation really was until using it
English
0
0
6
204
Leo retweetledi
Josh
Josh@RealEarle·
weekly pulse reports 📰 every monday we send reports that tells you exactly whats happening with your users across all your customer context a simple but useful feature created with @Outlit_ai and @openclaw
English
0
2
4
558
Leo
Leo@LeosReal·
i failed at making sure our launch went smoothly. we were suddenly processing ~500k events/day and hit every classic infra problem: dropped events, scaling issues, reliability bugs. we’ve now fixed all of it. painful launch, but valuable trauma.
Leo tweet media
English
2
2
9
908
Leo retweetledi
Ayush
Ayush@AyushKarupakula·
Excited to finally share Ebla-1 and the C⁴ benchmark. Really enjoyed working with HUD on the evals behind it.
hud@hud_evals

Aviro is introducing Ebla, a state of the art grounded reasoning model. In collaboration with HUD, the Aviro team built C⁴ — a benchmark for long-horizon tasks in corporate document sets. We evaluate four dimensions: Correctness, Completeness, Composition, and Citations. @aviro_ai post-trained GPT-OSS 120b to achieve SOTA performance, with a Pass@1 score of 25.4% and Pass@8 score of 37.1%.

English
1
4
14
1.2K
Leo
Leo@LeosReal·
yeah there's definitely a difference with 5.4...
Leo tweet media
English
0
0
1
110
Leo retweetledi
Josh
Josh@RealEarle·
we just shipped v1 of our proactive churn signals. the breakdown: assertion → what's happening reasoning → the logic and data behind our conclusion evidence → the truths backing up the analysis timeline → how the situation unfolded across systems next steps → preventive measures signals and customer identities unified across @posthog @stripe @firefliesai @slack and more
English
0
2
7
678
Adi Singh
Adi Singh@adisingh·
My 2 best friends and I just raised $6M to turn our dorm room idea into one of the biggest companies of all time. No pressure, right?
Adi Singh tweet media
English
62
8
231
17.6K
Ben Wallace
Ben Wallace@DJbennyBuff·
Request for startup: solve the 1st stage of building I want something that analyzes all feedback channels, both active (@SlackHQ, @meetgranola, support@) and passive (@sentry, logs) to help my team identify trends and business impact Low-hanging/well-scoped topics immediately get triaged to an agent
Ben Wallace tweet media
English
60
9
277
30.4K
Leo
Leo@LeosReal·
"Product Engineer"
Leo tweet media
signüll@signulll

the most underrated hire right now is a great product person. when i say product person i'm def not talking about a product manager. perhaps i think there has to be somewhat of a new role. i don't have a good name for it yet but maybe something like "product thinker".. someone with an intuitive grasp of the product as it exists, where it's soft, where it sings, & how to iterate it toward something even sharper. in some sense, this person has to cohesively hold in their head where this product should be 2 years from now & work backwards from that. i say this cuz when building was hard, engineering was the bottleneck & the status hierarchy often reflected that. building is no longer hard. which means the variance in outcomes has shifted almost entirely to judgment on what to build, how to sequence it, & how to talk about it. & the story matters as much as the thing. internally, it organizes the team around a shared model of why. externally, it shapes the interpretive frame users bring to their first experience. you can't retrofit narrative onto a product & expect it to land, it has to be load bearing from the start. the rarest version of this person sits at the intersection of culture & deep technology. someone genuinely bilingual. they know what's technically possible & they know which cultural currents are real vs. ephemeral. that combo is what separates products that feel inevitable from products that feel assembled. before ppl clap back with this person has always been valuable, i know.. i am just saying now they might be the most *important* person in the room. their value compounds like never before.

English
0
0
7
546