Dan Cleary

2.7K posts

Dan Cleary banner
Dan Cleary

Dan Cleary

@DanJCleary

CEO/Founder @converge_run, the vibe coder for Convex apps Also building: @prompt_hub https://t.co/muOGnX7xCQ https://t.co/JtvBPn1BkM

New York Katılım Şubat 2013
688 Takip Edilen639 Takipçiler
Sabitlenmiş Tweet
Dan Cleary
Dan Cleary@DanJCleary·
"When we (@intercom) started this pursuit, I honestly didn't think it was going to be possible to be better than frontier models. I thought maybe we can get parity on average. But can we actually beat them?" They did. @PedroTabacof, principal ML scientist at Intercom on how Apex replaced Claude as the core model for Fin, across 2 million customer service conversations a week. Full episode out now. Link below. Amazing work going on over at Intercom @eoghan @fergal_reid Which AI app company is next to launch their own model? @harvey ? @Replit or @Lovable ?
English
1
0
2
183
Dan Cleary
Dan Cleary@DanJCleary·
Opus 4.7 vs GPT 5.5 in their respective harnesses Opus with the W
English
0
0
0
43
Dan Cleary
Dan Cleary@DanJCleary·
Moats never existed, and ChatGPT's memory getting pwnd with a single prompt is just another example of this (and a killer move from Anthropic, matched by Google a few weeks after) More thoughts on moats (or the lack of) in the age of AI below
Dan Cleary tweet media
English
2
0
1
48
Dan Cleary
Dan Cleary@DanJCleary·
Convalytics(.dev) is up and running. Easy web + product analytics for @convex apps. Your agent can install with a single prompt and set up custom events for you
Dan Cleary tweet media
English
1
0
3
128
Dan Cleary
Dan Cleary@DanJCleary·
"When we (@intercom) started this pursuit, I honestly didn't think it was going to be possible to be better than frontier models. I thought maybe we can get parity on average. But can we actually beat them?" They did. @PedroTabacof, principal ML scientist at Intercom on how Apex replaced Claude as the core model for Fin, across 2 million customer service conversations a week. Full episode out now. Link below. Amazing work going on over at Intercom @eoghan @fergal_reid Which AI app company is next to launch their own model? @harvey ? @Replit or @Lovable ?
English
1
0
2
183
Dan Cleary
Dan Cleary@DanJCleary·
SlopBench: Opus 4.7 did worse (more slop) then Opus 4.6 Code + data on Github and below
Dan Cleary tweet media
English
1
0
0
59
Dan Cleary
Dan Cleary@DanJCleary·
Vibe coding breakdown: Opus 4.7 vs GPT 5.4 vs Gemini 3.1 TL;DR GPT 5.4 Opus 4.7 Sonnet 4.6 Gemini 3.1
Dan Cleary tweet media
Indonesia
2
0
1
67
Dan Cleary
Dan Cleary@DanJCleary·
Opus 4.7 fav word is malware, idk what is in there but every single edit or file read seems to have to be checked for malware first
Dan Cleary tweet media
English
0
0
0
61
Dan Cleary
Dan Cleary@DanJCleary·
Convalytics Convex component is live 🚀
Dan Cleary tweet media
English
0
0
1
23
Dan Cleary
Dan Cleary@DanJCleary·
Opus 4.7 benchmarks sans Mythos
Dan Cleary tweet media
English
0
0
0
27
Dan Cleary
Dan Cleary@DanJCleary·
A year ago I was skeptical that companies could train their own models to beat frontier models. @eoghan and the team at Intercom proved me wrong. And this conversation explained exactly how they did it. @PedroTabacof is a Principal ML Scientist at Intercom working on their custom model Apex. -It beats GPT-5.4. -It beats Opus 4.5. -It hallucinates 65% less than Sonnet. And 100% of Fin's traffic now runs on it. We got into: -How they actually post-train a model from scratch -Why evals are 90% of the work -The state of open source models If you care about AI products, vertical models, or where this is all going , this is a must watch. Link below.
English
1
0
1
55
Dan Cleary
Dan Cleary@DanJCleary·
This didn't get enough attention Intercom built a model that outperforms Opus + GPT 5.4. "As of last week, ~100% of all (English language, chat and email) customer conversations are now running on Apex." Vertical AI is just getting started
Eoghan McCabe@eoghan

x.com/i/article/2036…

English
1
0
3
221
Stopa
Stopa@stopachka·
After 4 years, we’re announcing Instant 1.0. Instant is the best backend for AI-coded apps. Let us tell you why.
English
120
94
835
223.6K