HFT Bread
561 posts


Be a kid again and dream bigger.

Our world models are action conditioned, and hence causal. The concept of world model for planning goes back to the 1950s in optimal control (before I was born). I didn't just discover it. But training action-conditioned world models from sensory inputs (like video) requires new techniques.



OK I’m actually mad about this now. Diffusion models are a perfect glimmering jewels of applied stochastic calculus and these clowns are ruining them by turning them into trillion parameter overfit transformer slopdels. Fuck off.

@SouadH9 @GfI_Himmelreich @ordinaryepsilon @macrocephalopod probably no stronger signal of larp than a strong bias towards complex methods. if you've worked at any of the big successful hedge funds you've seen how much money can be made with careful arithmetic and the occasional OLS.



When I say AMD is dogshit at software: this is what I mean. Tensor cores, used for AI, are simpler than GPU and CPU and AMD is perfectly capable of making competitive ones. But they are useless without proper software support. And not only AMD is incapable of delivering on this front, but they are even unable to take advantage of people doing the legwork for them. Perfect exemple of wasting billions to save millions. That being said, there is so much upside. If they are willing, they can 10x their market cap in a few years just by hiring a competent leader for their software division, and, core importantly, getting out of the way so he can get things done.


Huge if true














