
Goliath
213 posts

Goliath
@zero_goliath
@uwaterloo cs; formerly @ritserlabs, intern @runrl_com








I believe this new model in Claude Code is a glimpse of the future we're hurtling towards, maybe as soon as the first half of next year: software engineering is done. Soon, we won't bother to check generated code, for the same reasons we don't check compiler output.

There's a huge spectrum of approaches to memory/continual learning - ranging from RAG to dreams of "infinite context" generalization to baking in new knowledge w/ gradient updates. I'm personally bullish on parametric updates that allow the model itself to get smarter over time (rather than pure systems-based approaches around black box models), but there's still a lot of open questions to make this work at scale.




@karpathy can you elaborate on why images can get bidi attention easily while text cannot? also, no tokenization but dont we still get something similar and perhaps uglier when chunking the input image into patches?





