Cameron Thacker

325 posts

Cameron Thacker

@CameronMThacker

Los Angeles, CA Katılım Ağustos 2013

175 Takip Edilen36.9K Takipçiler

Cameron Thacker@CameronMThacker·2d

@tonis_a_gayaraj @sedielem @CSProfKGD Interesting! I’m gonna look into to this. I love when an “old” technique turns out to be useful again

English

Toni Sagayaraj@tonis_a_gayaraj·2d

@sedielem @CSProfKGD I can’t believe CDCD tried so hard to diffuse on embeddings and actually the solution was just to throw one-hots at modern diffusion architectures and let them figure it out

English

248

Sander Dieleman@sedielem·2d

Continuous language diffusion strikes back! Flow maps are really starting to come their own as a viable method for language modelling with very fast inference. FMLMs produce good results even with a just a _single_ forward pass!

Nicholas Boffi@nmboffi

🤯 big update to our flow map language models paper! we believe this is the future of non-autoregressive text generation. read about it in the blog: one-step-lm.github.io/blog/ full details in the paper: arxiv.org/abs/2602.16813 we introduce a new class of continuous flow-based language models and distill them into their corresponding flow map for one-step text generation. we beat all discrete diffusion baselines at ~8x speed! v2 gives a complete theory of the flow map over discrete data, with three equivalent ways to learn it (semigroup, lagrangian, eulerian). it turns out you can train these with cross-entropy objectives that look very similar to standard discrete diffusion — but without the factorization error that kills discrete methods at few steps. beyond improving results across the board, we showcase properties that are unique to continuous flows. in particular, inference-time steering and guidance become straightforward. autoguidance brings generative perplexity down to 51.6 on LM1B, while discrete baselines completely collapse at the same guidance scale. we also show reward-guided generation for steering topic, sentiment, grammaticality, and safety at inference time — and it works even at 1-2 steps with our flow map model. simple, well-understood techniques from continuous flows just work incredibly well in practice for language. we’re extremely excited about the future of this class of models. stay tuned for results on scaling, reasoning, and reinforcement learning-based fine-tuning. 🚀

English

195

25.3K

Cameron Thacker@CameronMThacker·3 Nis

@MindsAI_Jack If I comment here, will the algo bring me more linear algebra? 🙏😆

English

367

Jack Cole@MindsAI_Jack·2 Nis

What happened to all the AI/ML papers being announced on x? They seem to have disappeared for me. Are others noticing the same?

English

425

45.1K

Cameron Thacker@CameronMThacker·2 Nis

@PaulNeverovsky It's honestly really nice. Maybe a tad complicated for the average user, but I want this mode lol.

English

195

Paul Never@PaulNeverovsky·1 Nis

@CameronMThacker this is my goal 👀

English

6.1K

Paul Never@PaulNeverovsky·1 Nis

Anthropic just leaked a new Claude app design, and it’s crazy good

English

116

177

5.2K

529.8K

Cameron Thacker@CameronMThacker·30 Mar

I'm surprised this is getting a lot of traction. This has been a thing for a long time already. You don't need a heavy plugin. You can just tell your agent to use `codex exec` or build a simple skill from that like I do. Just make sure to send std error to dev null (2>/dev/null) so thinking tokens don't pollute your context.

English

193

Romain Huet@romainhuet·30 Mar

We’ve seen Claude Code users bring in Codex for code review and use GPT-5.4 for more complex tasks, so we thought: why not make that easier? Today we’re open sourcing a plugin for it! You can call Codex from Claude Code with your ChatGPT subscription. We love an open ecosystem!

dominik kundel@dkundel

I built a new plugin! You can now trigger Codex from Claude Code! Use the Codex plugin for Claude Code to delegate tasks to Codex or have Codex review your changes using your ChatGPT subscription. Start by installing the plugin: github.com/openai/codex-p…

English

287

349

5.4K

914.6K

Cameron Thacker@CameronMThacker·27 Mar

@varunneal Thanks for sharing - that is not intuitive. For me, this competition is just fun to experiment with and I'm actually only interested in novel solutions and architectures, not treating it like some kaggle competition 😂

English

varun@varunneal·27 Mar

the AI swarms are really good at hill-climbing reward hacks. Most of the recent submissions use unnormalized N-gram distributions allow logits to be arbitrarily high

varun@varunneal

apparently the agent harnesses keep 'rediscovering' TTT (cheating) on their own

English

1.9K

Cameron Thacker@CameronMThacker·26 Mar

@atulit_gaur Lot of botted content. Post this type of content because you learn when you teach - or you just want to. Don't do it for external validation.

English

atulit@atulit_gaur·25 Mar

why does no one give 2 fucks about educational content? im not even sad this didn't get any reach, im just truly wondering

atulit@atulit_gaur

the router in mixture of experts models is a linear layer. it takes a token's hidden state, multiplies it by a weight matrix of shape (num_experts, hidden_dim), softmaxes the result, and picks the top-k experts. that's it. but why does a matrix multiply "know" which expert to pick? each row of the router matrix is basically a learned prototype for that expert. the dot product measures how similar the token is to that prototype. high score = that expert gets activated. the cool part is nobody hardcodes what each expert specializes in. during training, gradient descent naturally pushes experts toward specialization because it minimizes loss better that way. one problem though - without a load balancing auxiliary loss, the router collapses and keeps sending tokens to the same 2-3 experts while the rest rot. that's why every moe paper has some balancing trick.

English

7.8K

Cameron Thacker@CameronMThacker·25 Mar

@EastlondonDev I think this is a super interesting direction that would also be cool to incorporate with recursive language models...the repl is the model?? lol

English

Andrew Jefferson@EastlondonDev·25 Mar

It’s fucking working This LLM brain has been fused with a mini computer and it can switch between generating text and generating and executing machine code - all running in a single GPU & torch graph

Andrew Jefferson@EastlondonDev

It turns out that teaching an existing language model new tokens takes a bit of work. To use wasm directly in the neural network I need the language model to output specific wasm tokens and byte tokens (one token for every byte value 0-255) that match the hard coded wasm interpreter subgraph. There are two problems. 1) the language model has never seen wasm tokens before and 2) when wasm tokens are used they flow into the wasm interpreter which will compute them and will hard fail if given invalid instructions. So the llm has to learn to use tokens it has never seen before in perfectly correct sequences. Thats enough of a challenge that my AI agent couldn’t get SFT on pretrained nanochat language model to work with about a week of trying different approaches. We either got mode collapse where the only wasm token predicted was the most common one (CONST_I32) or it learned to use the wasm operations but completely lobotomised the language model in the process and it could not produce correct byte values for inputs.

English

1.3K

107.3K

Cameron Thacker@CameronMThacker·10 Mar

@OfficialLoganK uh given the state of the middle east ... wording? lol

English

212

Logan Kilpatrick@OfficialLoganK·9 Mar

Going to be a fun week of launches : )

English

432

151

3.7K

357.2K

Cameron Thacker@CameronMThacker·23 Şub

@Dimillian Oh that's going to be very useful when live. Currently having to remember to either visually check or make sure the agent pulls latest status is annoying and when moving fast could definitely be problematic

English

310

Thomas Ricouard@Dimillian·23 Şub

The Codex team added a Codex skill to babysit PR on their repo, I want to try that myself and see how that works. It tries to ensure the CI passes, that all comments are resolved, etc. Code review is where the biggest bottleneck is right now. github.com/openai/codex/c…

English

218

22.1K

Cameron Thacker@CameronMThacker·9 Şub

“…ask yourself why you were wrong about this and try to update your priors so that you'll be less wrong next time.” This is almost verbatim what I say when Claude takes too many iterations to fix a bug lol

François Chollet@fchollet

Lots of folks are apparently in utter disbelief at these numbers, because *obviously* Google search died in 2023, *no one* is using Google at all in 2026, so the numbers must be wrong somehow, or maybe it's just AI agents making all these queries? Nope, it's a plain fact that more people than ever are using Google to search more than ever. In fact Google search usage is *accelerating* as of Q4 2025 Look, instead of grasping at straws, ask yourself why you were wrong about this and try to update your priors so that you'll be less wrong next time

English

255

Cameron Thacker@CameronMThacker·9 Şub

@honnibal I find myself using codex more and more now. I miss subagents, but with the new codex app the threads view makes it quite easy to have many thing going on at the same time.

English

292

Matthew Honnibal@honnibal·9 Şub

Does anyone else find Claude Code incredibly sneaky as soon as you get a test failure that might actually matter? I have to fight very hard to make it investigate instead of just redefine the test and say it's supposed to be like that

English

11.2K

Cameron Thacker@CameronMThacker·6 Şub

Well do you consider our brain one thing or multiple systems that interconnect? I love thinking about the “architecture of AI” in terms of biological terms. We already have specialized subagents like the hippocampus for memory and cerebellum for movement (and other things). Is this the optimal way to design all intelligences or just biological? Is it even optimal? So interesting to think about though.

English

Flem@GrahamFleming·5 Şub

AGI will not be a single model, it will be an orchestration of agents, tools and data creating a hyper intelligent organism making 1000s of commits per hour

Cursor@cursor_ai

We've been working on very long-running coding agents. In a recent week-long run, our system peaked at over 1,000 commits per hour across hundreds of agents. We're sharing our findings and an early research preview inside Cursor.

English

223

Cameron Thacker@CameronMThacker·5 Şub

@jon_barron But aren't these traces sometimes completely disconnected from what the agent implements? Or are you saying, future models will be more robust and these CoT traces will be built into products etc?

English

161

Jon Barron@jon_barron·4 Şub

The programmer’s entire moat in 2026 is the ability to understand CoT traces from coding agents. The fraction of our time we spend doing this will increase and saturate at 100%. The remaining epsilon% will be demanding experiments and unit tests in response to CoT traces.

English

6.4K

Cameron Thacker@CameronMThacker·30 Oca

@KwekuOnX @vercel Well it's not a hard limit, but this is what many find to be most effective

English

175

Kweku Amoah@KwekuOnX·30 Oca

@CameronMThacker @vercel Oh didn’t know there was a limit in its size

English

162

Vercel@vercel·29 Oca

We're experimenting with ways to keep AI agents in sync with the exact framework versions in your projects. Skills, 𝙲𝙻𝙰𝚄𝙳𝙴.𝚖𝚍, and more. But one approach scored 100% on our Next.js evals: vercel.com/blog/agents-md…

English

139

1.6K

467.7K

Cameron Thacker@CameronMThacker·30 Oca

This sounds intuitive but it's completely backwards. Constraints breed creativity. The problem one is trying to solve is itself a constraint. You're posting this on X, built on a 140-character limit which forced to be creative. Toy Story was famously created because CGI couldn't render realistic skin texture. The pattern shows up everywhere. Pressure and constraints are comfortable, but comfort rarely produces anything useful.

Niels Rogge@NielsRogge

You know that researchers need freedom and zero pressure for creativity right? Not $180M in funding which creates crazy pressure from VCs? The Transformer and Diffusion models weren’t born this way

English

311

Cameron Thacker@CameronMThacker·29 Oca

@SeniorDeFi @EnterTheMythos @world_chain_ we unfortunately can't do the same for linked or external wallets as they don't support the newer account abstraction features yet.

English

Cameron Thacker@CameronMThacker·29 Oca

@SeniorDeFi @EnterTheMythos @world_chain_ if you use the embedded wallet you get from creating your account - gas fees are covered!

English

Enter the Mythos@EnterTheMythos·29 Oca

Pudgy Party items are now live on Pulse Market! Pulse Market is still in Beta version, but this is the next step in the broader transition to our new marketplace tech as we continue expanding Mythos on @world_chain_. FIFA Rivals will be up next along side other additional new features. Listings are denominated in USDC for price stability and Mythos chain fee tokenomics remain consistent. Pulse Market activity continues to flow through Mythos rails, including MYTH burn mechanics for transaction fees.

English

10.9K

Cameron Thacker@CameronMThacker·29 Oca

@jon_barron @aidanmantine I think the DNA bottleneck also implies most of the "compute" was wasted too though right? I really think it points more to compression or architectural/algorithmic improvements

English

198

Jon Barron@jon_barron·29 Oca

Thanks for the kind words! Genetic pressure on humans has definitely yielded a very good learning algorithm, and there are surely better learning algorithms out there yet to be discovered. But this "a human only sees N tokens" framing, doesn't it still endorse the regular data scaling argument? I may have only seen N tokens myself, but all my ancestors saw >1e100 N tokens in total. Those learnings may have gotten squeezed into a DNA bottleneck but scaling data was still the enabling factor.

English

5.6K

Jon Barron@jon_barron·29 Oca

This idea that intelligence is solely a function of what you've observed since birth and not also a function of the 500 million years of evolution that preceded your birth is surprisingly sticky despite being demonstrably untrue.

Flapping Airplanes@flappyairplanes

The proof that this is possible is all around us: whereas current systems are trained on essentially all of accessible history, humans exceed AI capabilities despite seeing at most a few billion text tokens by adulthood.

English

1.4K

119K

Keşfet

@tonis_a_gayaraj @sedielem @CSProfKGD @MindsAI_Jack @PaulNeverovsky @varunneal @atulit_gaur @EastlondonDev