Snowman Research
37 posts

Snowman Research
@ResearchSnowman
Come for the stock picks. Stay for the AI Research insights. 📍NYC | SF
New York, USA Se unió Mayıs 2026
150 Siguiendo13 Seguidores
Tweet fijado

As Dwarkesh points out, domain specific rubrics have driven a lot of recent AI progress. Let’s say a Lab spends $10M on 10,000 of these rubrics for challenges an accountant might face, and then spends millions more training the model.
If this makes accountants 10% more efficient, the ROI on that investment is fantastic. In fact, even if the data required is 3-5x more expensive than I’ve estimated it’s still likely a great investment (and why labs are only accelerating their data spend).
The challenge is the long tail. As you move into increasingly specialized professions with fewer practitioners and lower aggregate economic value, the economics of building bespoke rubrics become less attractive.
That’s where recursive self-improvement becomes important. If models can reliably generate and evaluate increasingly difficult tasks for themselves, they can become more data efficient.
Dwarkesh Patel@dwarkesh_sp
Narration: the data efficiency black hole. 00:00:00 – What is really driving AI progress? 00:03:11 – Comparing human vs AI sample efficiency 00:08:46 – Does sample efficiency matter? Also on pod and YouTube feed.
English

@_shahaf_ Very well timed. 60B is a 2.5% change in SpaceX stock at the current valuation
English

@ResearchSnowman I find the cursor acquisition well timed on musks side. It’s all stock and the stock is pumped right now. Definitely going down a lot once investors can start selling en masse.
English

$SPCX is now becoming vertically integrated in a way OpenAI and Anthropic are not: they own the data centers to the application layer
Today, Cursor announced a new 1.5T model, likely pretrained on top of the upcoming Grok v9 medium model. This means Cursor is leveraging SpaceXAI compute in two ways:
1. Larger base model - previous Cursor model was based on the 1T Open Source Kimi model
2. 10-20x more post training training compute
Next question, when will we get a Cursor model trained on Grok large?
And when will SpaceXAI buy Tesla's custom AI chips? Or maybe SpaceX just buys Tesla 😉
Max Weinbach@mweinbach
New Cursor model trained from scratch! 1.5T+ and all seemingly Blackwell chips
English

9/ Routing will likely only become more of a thing
Fable is not just a model. It is a capability-gated system. For cyber, bio, and frontier LLM development, classifiers ensure the models behaves as it should.
That is a departure from the idea of giving the model itself the ability to know when to decline answering questions in certain areas. This routing approach likely simplifies the model training because you can focus on training the most capable model possible and trust that its abilities can be selectively gated at deployment time.
English

🧵 Initial thoughts on the Anthropic Fable 5 System Card
www-cdn.anthropic.com/d00db56fa754a1…
English
Snowman Research retuiteado

@citrini We are still compute bound. If there was greater chip supply capability would be improving even faster.
English

3 months ago we posted an article that was predicated on model capability doubling every 6 months and people said it was too unrealistic…
Anthropic@AnthropicAI
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
English


Love the concept of idea-driven vs. goal-driven research from this classic post by @johnschulman2.
The best AI researchers are obsessed with a long-term objective (e.g. making robots walk), not a particular method joschu.net/blog/opinionat…
English




