Cody Blakeney (@code_star) - Twitter-Profil | Zamantika Mersobahis Locabet

Angehefteter Tweet

I've got something new for everyone. My first substack article! Not the one I planned to do first, but a fun one! I have made a handy calculator base on the DeepSeek v1 coefficients for finding optimal LR and batch sizes for dense LLMs.

English

16

17

169

39.8K

Cody Blakeney@code_star·12h

Apparently this is what I look like as the duke. Apparently it was important to include the bodega cat.

Cody Blakeney@code_star

Your evals, your environments, your data, your model. You need to scale the ladder Using foundation models to build a good harness and getting eval coverage is a great way to start building business value. Building environments to improve on your use cases is something that is often something your domain experts at your company are uniquely positioned to do. CPT on open models once you have done all this leg work is a great, tight feedback loop to insure you have data of sufficient quantity and quality to even consider pretraining. Scale the ladder

English

0

5

762

Cody Blakeney@code_star·12h

Your evals, your environments, your data, your model. You need to scale the ladder Using foundation models to build a good harness and getting eval coverage is a great way to start building business value. Building environments to improve on your use cases is something that is often something your domain experts at your company are uniquely positioned to do. CPT on open models once you have done all this leg work is a great, tight feedback loop to insure you have data of sufficient quantity and quality to even consider pretraining. Scale the ladder

Vivek@vivek_2332

@code_star 💯💯 every company should be fine-tuning an open source model with rl to adapt to their own ecosystem and workflows. a generic model will never get you the same output as one trained on your own harness.

English

2

5

53

6K

Cody Blakeney retweetet

Chris 🇨🇦@llm_wizard·9h

It’s crazy both that: 1. Some companies don’t understand that telling people that you started from an open source model, even specifying the model, is an absolutely banger thing to do 2. Some companies think that people getting mad about hiding that disclosure stems from a lack of legality Open Source models are dope. Training on top of them is dope. Using them in non-open commercial settings is both dope and allowed. Just be real about it. Celebrate the models that helped get you to where you wanted to go. That’s all you have to do - license or not.

English

3

4

43

4.9K

Cody Blakeney@code_star·11h

From the people that brought you torch.compile and compiling LaTex documents. What could go wrong?

Lucas Atkins@latkins

@code_star Shall we open source TexTorch?

English

0

11

846

Cody Blakeney@code_star·11h

@latkins 👀

QME

1

0

2

114

Lucas Atkins@latkins·11h

@code_star Shall we open source TexTorch?

English

2

0

12

1.3K

Cody Blakeney@code_star·11h

No wonder all of my papers have been bad. I’ve been writing them in LaTex instead of PyTorch.

alex peysakhovich@alex_peys

shocking: ai researchers write great papers using pytorch/jax but ask them to write their optimization loops in fortran and suddenly they collapse

English

2

0

41

2.8K

Cody Blakeney@code_star·11h

@JoshPurtell It’s easier than you think.

English

0

2

109

Josh@JoshPurtell·20h

Wait, did Cursor get access to an unreleased Kimi K2.5 Base?? They only released an instruct model??? How do you do CPT on an instruct

Aman Sanger@amanrsanger

We've evaluated a lot of base models on perplexity-based evals and Kimi k2.5 proved to be the strongest! After that, we do continued pre-training and high-compute RL (a 4x scale-up). The combination of the strong base, CPT and RL, and Fireworks' inference and RL samplers make Composer-2 frontier level. It was a miss to not mention the Kimi base in our blog from the start. We'll fix that for the next model.

English

6

0

41

18.7K

Cody Blakeney retweetet

xlr8harder@xlr8harder·12h

People really want there to be drama here, but I'm not sure there is. It genuinely might just be misunderstandings among the rank and file.

Kimi.ai@Kimi_Moonshot

Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. Seeing our model integrated effectively through Cursor's continued pretraining & high-compute RL training is the open model ecosystem we love to support. Note: Cursor accesses Kimi-k2.5 via @FireworksAI_HQ ' hosted RL and inference platform as part of an authorized commercial partnership.

English

0

1

12

976

Cody Blakeney retweetet

Han@HanchungLee·12h

@code_star satya agrees. lets realize his vision oss. x.com/HanchungLee/st…

Han@HanchungLee

the model is the product

English

0

2

4

595

Cody Blakeney@code_star·12h

My Mama couldn't get through to me The drama, people suing me I'm on twitter talking like it's just you and me.

English

0

1

6

266

Cody Blakeney@code_star·13h

@xeophon @soc Hi dad I’m cody

English

0

4

147

Xeophon@xeophon·13h

@code_star @soc dad...

1

0

7

253

Cody Blakeney@code_star·14h

Imagine if delve had twitter They’d be like @SOC is this 2?

English

4

0

24

1.5K

Cody Blakeney@code_star·13h

@rishdotuk The important thing is you found it!

English

0

1

7

Rishu Kumar@rishdotuk·13h

@code_star Wait, how did I miss this banger?

English

1

0

1

9

Cody Blakeney@code_star·16 Kas

I've got something new for everyone. My first substack article! Not the one I planned to do first, but a fun one! I have made a handy calculator base on the DeepSeek v1 coefficients for finding optimal LR and batch sizes for dense LLMs.

English

16

17

169

39.8K

Cody Blakeney retweetet

Cody Blakeney@code_star·22h

Model adaptation is coming. It works, and learning how to do it will is going to be a big differentiator for people going forward. Even if you have ambitions to train from scratch starting from great models helps you understand your problems better, make evals, RL environments, adapt to scale. I’m excited to see how this evolves.

clem 🤗@ClementDelangue

Looks like it’s confirmed Cursor’s new model is based on Kimi! It reinforces a couple of things: - open-source keeps being the greatest competition enabler - another validation for chinese open-source that is now the biggest force shaping the global AI stack - the frontier is no longer just about who trains from scratch, but who adapts, fine-tunes, and productizes fastest (seeing the same thing with OpenClaw for example).

English

6

4

79

9.6K

Cody Blakeney retweetet

Prasanna Srikhanta@prasanna·19h

I like the idea of surfacing talent through competitions since it may uncover folks who would be traditionally overlooked. Back in the day Google ran a competition and someone I knew in uni who had dropped out of EECS ranked really well. Google kept trying to contact him and he ignored them because he was busy with other things. This was early 00s. Some people just like the game. There are some incredibly smart and creative people out there and it’s a wonder to see them operate.

Cody Blakeney@code_star

For students or people looking to break into careers in AI this exists to be a talent pipeline tool. Making visible and meaningful entries here is probably one of the highest ROI ways to demonstrate your skills and break in without getting a PhD or publishing.

English

0

1

4

606

Cody Blakeney@code_star·19h

everyone mad about Cursor 2 seems to have forgotten what the idea behind a "Foundation Model" was

English

0

28

1.1K

Cody Blakeney retweetet

Alexander Doria@Dorialexander·20h

Perplexity-based evals? In 2026?

Aman Sanger@amanrsanger

We've evaluated a lot of base models on perplexity-based evals and Kimi k2.5 proved to be the strongest! After that, we do continued pre-training and high-compute RL (a 4x scale-up). The combination of the strong base, CPT and RL, and Fireworks' inference and RL samplers make Composer-2 frontier level. It was a miss to not mention the Kimi base in our blog from the start. We'll fix that for the next model.

English

6

4

133

24.2K

Cody Blakeney retweetet

Santiago Pombo@SantiagoPombo·21h

@code_star Love to see more people preaching this! 2026 is the year of custom LLMs:

Santiago Pombo@SantiagoPombo

Contrarian 2026 AI take: finetuning OSS LLMs becomes the enterprise differentiator. OSS is close enough to SOTA, and the tooling is finally usable, so proprietary data will convert into real domain accuracy gains. Evidence? my team @nvidia + @CrowdStrike hit SOTA on CQL. crowdstrike.com/en-us/blog/cro…

English

1

277

Cody Blakeney

Entdecken