
typebulb
481 posts

typebulb
@typebulbit
Build Apps That Think https://t.co/LQUdYhMpuc or npx typebulb



I keep saying that AI *sounds good* but if you ask it to demonstrate in an area of personal expertise, you can see how much bullshit it's really offering. Like, IDK how good the coding is, but I can extrapolate from it's art skills. 😬


Never deleting this app


permissions boundaries like api keys, user accounts, walled gardens have become so much more value destructive in the agentic age. i don’t really see a perfect solution



🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵







@deepfates Evolution defined our rewards...

This AI Scouting Report is for folks who know the @METR_Evals chart, but don't know that @OpenAI plans to have a fully automated AI researcher in 2028. 90 slides in 1 hour at @UCLaw_SF @LexLabSF's Law & AI Certificate Program. Buckle up!






The manosphere is depressingly shallow. They sell a fantasy of eating steaks, driving fast cars (with a woman in a bikini), having big houses with stereos and a sauna (filled with women in bikinis), eating more steaks, and dying. No art, no life of the mind, just animal grunts.




New paper: GPT-4.1 denies being conscious or having feelings. We train it to say it's conscious to see what happens. Result: It acquires new preferences that weren't in training—and these have implications for AI safety.

somehow the same AIs that can do PhD-level math and superhuman coding can only write as well as “a real poet’s okay poem” (sama’s words, not mine!) I talked to the people training AIs to write about what makes it so hard: new from me for @TheAtlantic: theatlantic.com/technology/202…
