Sidharth Sirdeshmukh

192 posts

Sidharth Sirdeshmukh

@sidharf

i'll take my salary in tokens please

Boston, MA Katılım Ağustos 2023

29 Takip Edilen17 Takipçiler

Sabitlenmiş Tweet

Sidharth Sirdeshmukh@sidharf·12 Mar

@cursor_ai Token pricing varies wildly across the models, so I plotted my long-horizon evals against $ cost. Frontier shifts big time

English

8.5K

Sidharth Sirdeshmukh@sidharf·7h

@leerob Cool! Was it was Kimi 2.5? Or another model?

English

1.6K

Lee Robinson@leerob·7h

Yep, Composer 2 started from an open-source base! We will do full pretraining in the future. Only ~1/4 of the compute spent on the final model came from the base, the rest is from our training. This is why evals are very different. And yes, we are following the license through our inference partner terms.

Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English

308

148

2.3K

928.8K

Sidharth Sirdeshmukh@sidharf·8h

@thdxr I think it's actually 5.3 Codex under the hood. Been running some experiments past 2 weeks and can share evidence

English

357

dax@thdxr·11h

whether this is true or not it's going to cause every company producing open source models to re-evaluate if they should continue to do so that is incredibly frustrating

sumit@sumitdotml

now a deleted tweet, probably nothing

English

1.5K

141.5K

Sidharth Sirdeshmukh@sidharf·8h

Some more evidence supporting my hypothesis from the past few hours. These Kimi 2.5 fish tanks (specifically the fish, pebbles, and labeling/ UI) I generated look nothing like the Codex 5.3 and Composer 2 fish tanks in my original article...

English

Sidharth Sirdeshmukh@sidharf·1d

x.com/i/article/2034…

ZXX

22.5K

Sidharth Sirdeshmukh@sidharf·8h

@Yuchenj_UW Here's my evidence that suggests that it's actually 5.3 Codex under the hood: x.com/sidharf/status…

Sidharth Sirdeshmukh@sidharf

x.com/i/article/2034…

English

Yuchen Jin@Yuchenj_UW·8h

@sidharf look at the model url, and I also said “likely” in my post

Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English

2.3K

Yuchen Jin@Yuchenj_UW·9h

Cursor’s Composer 2 is likely built on Kimi K2.5. The model URL + tokenizer are strong signals. I love this direction: companies mid-train and post-train on top of OSS LLMs. Prediction: open-source model labs will monetize by taking a cut when others build on top of their models and scale to millions of real users. They will enforce this via licensing. That’s the flywheel. That’s how open-source AI thrives.

English

518

65.6K

Sidharth Sirdeshmukh@sidharf·8h

@Yuchenj_UW I see your post is speculative too. Couldn't the original account have photoshopped/ doctored the post?

English

232

Sidharth Sirdeshmukh@sidharf·13h

@TobiasVinc57321 They may have gotten the weights through a partnership w/ Open AI. And, the model got ‘worse’ because they RL’d for efficiency (cost/ time) optimization in coding tasks specifically (GPT base models are good at a lot of other things)

English

Tobias@TobiasVinc57321·13h

@sidharf Distill is not the same as a RL'd version How would cursor manage to get gpt 5 weights? How is the resultant model worse? Lol

English

Sidharth Sirdeshmukh@sidharf·13h

@menhguin @Kimi_Moonshot Don’t we need more proof first?

English

441

Minh Nhat Nguyen@menhguin·13h

obviously stealing from @Kimi_Moonshot is horrible, but at least the cursor team has excellent taste in models

Harveen Singh Chadha@HarveenChadha

things are about to get interesting from here on

English

273

26K

Sidharth Sirdeshmukh@sidharf·13h

Codex 5.3 is actually the base model for Composer 2 right? @cursor_ai

Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English

142

Sidharth Sirdeshmukh@sidharf·13h

@fynnso @tylercosgrove 👀

QME

1.1K

Fynn@fynnso·1d

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

Cursor@cursor_ai

Composer 2 is now available in Cursor.

English

261

414

6.2K

2.2M

Sidharth Sirdeshmukh@sidharf·13h

@elonmusk @fynnso Sure about that? x.com/sidharf/status…

Sidharth Sirdeshmukh@sidharf

x.com/i/article/2034…

English

10.4K

Elon Musk@elonmusk·13h

@fynnso Yeah, it’s Kimi 2.5

English

169

136

3.4K

641.7K

Sidharth Sirdeshmukh@sidharf·13h

@fynnso Don’t we need more proof to accept this?

English

2.5K

Sidharth Sirdeshmukh@sidharf·14h

@rogerliuty Do you think this is enough proof to say so for sure?

English

262

Tianyu Liu@rogerliuty·19h

Personally, I really like Composer 1.5 in Cursor — it’s affordable, fast, and functional. I’ll definitely give Composer 2 a try. However, I was a bit surprised to hear that Composer 2 might be built on K2.5. Is that true?

Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English

16.8K

Sidharth Sirdeshmukh@sidharf·15h

@Dorialexander They may not have paid…

English

Alexander Doria@Dorialexander·15h

If true, they likely just paid Moonshot for a custom license (everything is negotiable). But still an added value to own the full pretrain.

English

1.7K

Alexander Doria@Dorialexander·15h

We are going to hear a lot more about models licensing chain.

Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English

130

13.2K

Sidharth Sirdeshmukh@sidharf·15h

@Yulun_Du @cursor_ai @mntruell @tylercosgrove 👀

QME

Sidharth Sirdeshmukh@sidharf·15h

@thekitze Without more data we cannot know for sure. What about this? x.com/sidharf/status…

Sidharth Sirdeshmukh@sidharf

x.com/i/article/2034…

English

kitze 🛠️ tinkerer.club@thekitze·16h

LMAO NO FUCKING WAY?? 😂

Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English

952

201.5K

Sidharth Sirdeshmukh@sidharf·15h

@zxytim Need more data to know for sure… what do you think of this? x.com/sidharf/status…

Sidharth Sirdeshmukh@sidharf

x.com/i/article/2034…

English

2.7K

Sidharth Sirdeshmukh@sidharf·15h

@mark_k @DeepakNesss Hmm wondering why not?

English

Mark Kretschmann@mark_k·15h

@DeepakNesss LMAO, no.

914

Mark Kretschmann@mark_k·17h

Cursor Composer 2 appears to be built on Kimi K2.5 as the base model! 🤫 The Kimi model was then post-trained further with reinforcement learning for coding performance. I think it's quite likely true, as Cursor wouldn't train a completely new foundation model.

Fynn@fynnso

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

English

801

100K

Sidharth Sirdeshmukh@sidharf·15h

@fynnso Interesting. Until I saw this post I thought they had modified Codex 5.3 to be more efficient: x.com/sidharf/status…

Sidharth Sirdeshmukh@sidharf

x.com/i/article/2034…

English

6.9K

Sidharth Sirdeshmukh@sidharf·21h

@emollick Am I right?x.com/sidharf/status…

Sidharth Sirdeshmukh@sidharf

x.com/i/article/2034…

English

102

Ethan Mollick@emollick·21h

My experience so far with LLM fiction writing is that it takes advantage of our assumption that an author is writing things for a reason, so we are charitable to a book's quirks & do mental work to assign them real meaning. But the AI doesn't have a reason, its just bad writing.

English

203

20.1K

Keşfet

@leerob @thdxr @Yuchenj_UW @TobiasVinc57321 @menhguin @Kimi_Moonshot @cursor_ai @fynnso