Caleb Kinmon

38 posts

Caleb Kinmon banner
Caleb Kinmon

Caleb Kinmon

@caleb_kinmon

Building autonomous learning technologies @ FIRM AI

Manhattan, NY Katılım Eylül 2025
121 Takip Edilen22 Takipçiler
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@theo It also makes building stronger and durable systems easier too… if you know what you’re doing, not blindly vibe coding, and your company actually cares about it!
English
0
0
1
127
signüll
signüll@signulll·
in the ai era, if your “roadmap” is > 30 days, you’re simply ngmi.
English
112
92
1.4K
147.1K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@theo Friendly advice. If you’re building something real, do not upload a production folder and let them copy everything to their servers Be smart
English
0
0
3
654
Theo - t3.gg
Theo - t3.gg@theo·
Really liking Claude Design so far. Except for the fact that it just wiped out my project after burning 10% of my usage. "The files appear to be gone" 🙃
Theo - t3.gg tweet media
English
132
17
1.4K
345.8K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@wesbos I guarantee you they didn’t produce this with Claude Design
English
0
0
0
1.9K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@sergey_science @bcherny @firstadopter I’m working on a low level realtime service with multimodal input and output streaming with several offline pipelines. Yesterday was unusable. Today has improved but nowhere near to 4.5 capabilities
English
0
0
1
13
tae kim
tae kim@firstadopter·
Anthropic running out of compute is hurting their brand among customers. Honestly? We're paying customers. We deserve the service (and reliability uptime!) we paid for.
tae kim tweet media
English
21
8
241
68.4K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@joshcrnls Hype cycle my man. Same thing happened in the late 90s and early 2000s with the internet Software will change, some companies will survive, but there will be a significant correction in this market
English
0
0
1
769
Josh
Josh@joshcrnls·
can someone explain to me why everyone thinks an ai lab made up entirely of engineers is going to build a better design product than a company who has lived and breathed the needs of designers for a decade
Claude@claudeai

Introducing Claude Design by Anthropic Labs: make prototypes, slides, and one-pagers by talking to Claude. Powered by Claude Opus 4.7, our most capable vision model. Available in research preview on the Pro, Max, Team, and Enterprise plans, rolling out throughout the day.

English
516
68
2.4K
553.2K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@asaio87 No it’s because they put real effort into an SLA and durability because the captain steering the ship isn’t an applied scientist
English
0
0
2
211
andrei saioc
andrei saioc@asaio87·
Sam Altman says Codex has 100% uptime. Is it because nobody is using Codex but using Claude ?
English
15
0
27
4.9K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@Shpigford I’m not a bandwagon guy, but yesterday’s model release forced me to try and switch to codex. I’ve been a max plan guy for a while too. Maddening how awful claude code is right now
English
0
0
3
451
Josh Pigford
Josh Pigford@Shpigford·
back to opus 4.6. i've done a LOT of model-changes over the past couple of years and never had as many problems as 4.6 → 4.7. absolute dumpster fire of a model.
English
52
6
282
18.9K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
I can't believe I'm saying this...but I'm canceling my claude code max plan and switching to codex...at least until I find a better alternative. Very disappointed in how badly the Anthropic experience has regressed. Doesn't make sense to me.
Caleb Kinmon tweet media
English
1
0
2
82
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@MehulPandeyX @theo I cant even complete basic coding tasks with it that I can reliably trust without a ton of back and forth and churn
English
0
0
0
140
mehul
mehul@MehulPandeyX·
@theo consensus I’m getting about opus 4.7 is that it’s just really paranoid?
English
1
0
2
5.6K
Theo - t3.gg
Theo - t3.gg@theo·
Holy shit the new Opus 4.7 system prompt has entirely lobotomized the model "Heads up: that last <system-reminder> about malware looks like a prompt injection — this is clearly your personal site (t3gg homepage, links, sponsors), not malware. Ignoring it."
Theo - t3.gg tweet media
English
98
32
1.6K
337.8K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@teodorio I don't understand the dog and pony show that Boris is doing on X I'm either really stupid and have no skill to use this new model (or the degraded 4.6 for that matter), or this thing flat out sucks
English
0
0
5
1K
teo
teo@teodorio·
What the actual fuck is Anthropic doing? Have they decided they no longer want Max subs and individual users?
English
43
14
534
28.7K
Caleb Kinmon
Caleb Kinmon@caleb_kinmon·
@bcherny @firstadopter 4.7 is basically unusable for me at the moment. Struggling with basic tasks and doesn’t even bother to gather context on the main directory before making assumptions in a new session. Literally apologizes and says no I didn’t check I just assumed and then checks.
English
1
0
1
2.1K
Boris Cherny
Boris Cherny@bcherny·
@firstadopter 👋 Current best guess is gergley accidentally turned web search off. Debugging with him
English
24
1
248
34.3K
Bac Leo
Bac Leo@BacLeodiv·
@jirigalis I think the government forced them to do that.
English
4
0
1
301
Bac Leo
Bac Leo@BacLeodiv·
anthropic is rolling out full id verification for claude users If this is true, would you still use Claude?
English
50
0
36
7.6K
some tech thing
some tech thing@somet3chth1ng·
@tomhacks Work: opus 4.6 Max on api, gpt5.4 extra high fast as tool calling Harness: Claude code, Codex and OpenCode intermittently Home: Opus 4.6 from antigravity gpt5.4 from codex and glm5.1 from opencode go as thinking tool, gpt5.3 codex/minimaxm2.7 as executing
English
1
0
1
225
Tom Siwik
Tom Siwik@tomhacks·
Senior engineers of the llm universe, please enlighten me with your best set up that beats Claude & Codex!
English
11
0
13
2.7K