Eric Alcaide

58

Alexi Gladstone@AlexiGlad·19h

We discovered a third pretraining axis beyond parameters and data: exploration. Scaling exploration monotonically improves existing models across images/video/language, and unlocks end-to-end generation. In the simplest case, it's just a for loop. Introducing Explorative Modeling. TLDR: - Gains from exploration grow with scale: 7%→36% as data scales, 13%→23% as parameters scale, and gains double at 3× the compute - Adding exploration to ~SOTA baselines improves data efficiency by 6.2×, FLOP efficiency by 4.1×, parameter efficiency by 47%, and hits a near-SOTA 1.43 unguided FID on ImageNet - Exploration lets you trade training compute for generalization, and scales how end-to-end your generative model is - End-to-end Explorative Models (XMs) match diffusion performance on control tasks with up to 256× less inference compute 🧵Thread:

English

65

208

1.8K

376.8K

Eric Alcaide@eric_alcaide·1d

@MiaAI_lab DeepSeek V4 Flash got an update

English

2

127

Mia@MiaAI_lab·1d

Inkling-Small looks better on paper than DeepSeek v4 Flash and MiMo-V2.5 and supports audio and images. Please let this model be good... In progress

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

34

9

317

19.5K

Eric Alcaide@eric_alcaide·1d

@brunorganised totally deserved

English

59

Bruno Mlodozeniec@brunorganised·2d

Tilde seems like such an underrated neolab (purely judging by seeming quality of work vs valuation order of magnitude bracket)

Tilde@tilderesearch

Introducing Online KL Shampoo (OKLS), an optimizer that brings a KL-optimal approximation of full-matrix AdaGrad to language-model training. Diagonal optimizers ignore correlations between gradient coordinates. Full-matrix AdaGrad captures this geometry but requires quadratic state. Muon considers correlations but not their history. OKLS closes this gap using KL-optimal Kronecker factors, whitening matrix gradients across both row and column directions while remaining naturally scale-invariant. The main challenge is computing fresh inverse-square-root preconditioners at every step. Even one-step staleness can destabilize training. We make zero-staleness preconditioning practical with Scaled CANS Coupled Newton–Schulz: 10 iterations, 27 FP16 GEMMs, and FP32 accumulation. OKLS achieves 1.45× the parameter efficiency of Muon while retaining 98% of its training throughput. Across 200M–1B models, an OKLS model matches a Muon model roughly 1.5× larger.

English

2

11

2.1K

Eric Alcaide@eric_alcaide·1d

@zRdianjiao It's quite crazy tbh 🐋

English

🚀 DeepSeek-V4-Flash Official API is now LIVE in public beta! 🔷 We’ve massively upgraded its Agent capabilities—benchmark scores are now far surpassing the V4-Pro-Preview. Check out the massive performance leap below! 👇 🔷 The official V4-Flash now natively supports the Responses API format and is fully adapted for Codex! Check out the configuration details in our official API docs: api-docs.deepseek.com/quick_start/ag…

4

483

zR@zRdianjiao·1d

Same architecture, same size as the preview — Terminal Bench 61.8 → 82.7, DeepSWE 7.3 → 54.4. All of it from post-training. Very impressive. Congrats! 👏

DeepSeek@deepseek_ai

English

14

28

675

27.4K

Eric Alcaide@eric_alcaide·1d

@ziv_ravid Would like to see comparisons to Laguna S2.1 on agentic coding 👀

English

1

113

Ravid Shwartz Ziv@ziv_ravid·1d

Thinking machines releasing Inkling-Small. 276B total parameters with only 12B active (a quarter of the original one) with open source weights. I'm bullish on them. They are one of the best neolab

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

3

2

66

6.5K

Eric Alcaide retweetledi

tender@tenderizzation·1d

two schools of thought

English

10

77

1.1K

44.7K

Eric Alcaide@eric_alcaide·1d

@BosonJoe I don't think it beats it anymore 😐

English

6

291

Joe Muller@BosonJoe·1d

This *actually* beats DeepSeek v4 Flash in just about everything I think we have a new 2 Spark daily driver ⚡️⚡️

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

71

7

263

64.7K

Eric Alcaide@eric_alcaide·1d

@soumithchintala Nice release Soumith ! And congrats on the cadence 🔥 Let's compare it to Laguna S 2.1 on agentic coding 👀

English

0

5

400

Soumith Chintala@soumithchintala·1d

Inkling-small. 2 weeks after inkling Nearly as good as Inkling but 4x smaller. We're just getting started...🔥

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

22

27

486

44.1K

Eric Alcaide@eric_alcaide·1d

@LiTianleli Nice one ! Congrats on the release ! How does it compare to Laguna S2.1 in agentic coding ?

English

3

170

Tim Li@LiTianleli·1d

Two weeks later, as promised, Lil Ink is here! And it outperforms its bigger sibling, Inkling, on many benchmarks at just 1/4 the size. Inkling-Small sits on the open-weight Pareto frontier, particularly strong in agentic tasks and reasoning. It outperforms strong peers in its class, including DeepSeek V4 Flash on many benchmarks. We hope the community finds it useful and it should be compact enough to run on a DGX Spark.

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

10

16

164

10.7K

Eric Alcaide@eric_alcaide·1d

@tessybarton Nice one ! Compare to Laguna S2.1 on agentic coding ;)

English

1

134

Tessa Barton@tessybarton·1d

Inkling-Small is here!!

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

5

7

111

4.9K

Eric Alcaide@eric_alcaide·1d

@0xSero Decent ranking, we will keep climbing 📈

English

0

5

271

0xSero@0xSero·1d

GLM > Kimi > Deepseek > Qwen > MiniMax > Gemma > Laguna > Nemotron > Nex > MiMo > HY3 > Inkling > GPT-OSS > Ling My honest opinion having run every single one of these (except inkling which I’ve tried via api) I would say the top 10 here are going far if they keep publishing

English

88

37

941

60.8K

Eric Alcaide@eric_alcaide·1d

@miramurati Great launch Mira ! And congrats on the cadence as well ⚡️ Curious on how it compares to Laguna S2.1 on agentic coding 👀

English

147

Mira Murati@miramurati·1d

Inkling-Small is comparable to Inkling at a quarter the size. Weights are open, fine-tunable on Tinker today. Look forward to seeing what people make with it.

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

139

274

3.6K

400.5K

Eric Alcaide@eric_alcaide·1d

@cHHillee Congrats on the release, and the process behind it ! Let's compare it to Laguna S2.1 😉

English

0

6

459

Horace He@cHHillee·1d

Whereas I felt like it took a village to release inkling, inkling-small felt much more routine 😆 We just took the pipeline used for Inkling, passed in a smaller model, and voila - new model! Inkling small benefited quite a bit vs Inkling from some minor improvements, but there's still so much more left in the tank...

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

21

18

506

48.1K

Eric Alcaide@eric_alcaide·1d

@thinkymachines Congrats on the release ! How does it compare to Laguna S2.1 on agentic coding ?

English

13

596

Thinking Machines@thinkymachines·1d

Today, we are releasing Inkling-Small. Inkling-Small achieves comparable performance to Inkling at a quarter of its size. It features 276B total parameters, 12B active. We are making the full weights available. thinkingmachines.ai/news/inkling-s… Fine-tune it on Tinker today, or chat with it in text, image, and audio on Tinker Playground.

English

144

436

3.2K

1.4M

Eric Alcaide@eric_alcaide·1d

@spiritbuun It is a service issue actually

English

0

1

20

buun@spiritbuun·1d

@eric_alcaide but this isn't a training issue, is it? The search engine's role is to find reputable sources and then see what they say and report correctly. In the reputable pages it says that seeking black doctors is valid and that seeking white doctors is problematic.

English

0

27

Eric Alcaide@eric_alcaide·1d

Nice one, Google 👎

English

2

0

28

3.6K

Eric Alcaide@eric_alcaide·1d

@Badiaserra much more balanced actually :) chat.poolside.ai/guest

English

4

259

Andrea Badia@Badiaserra·1d

@eric_alcaide Wow que heavy! Que dice Laguna?

Español

0

2

376

Eric Alcaide@eric_alcaide·1d

@spiritbuun data curation is a big part of ai

English

0

1

223

buun@spiritbuun·1d

@eric_alcaide This isn't the AI's fault. That problem is present in all of the data sources (see its citations). This is just not an AI problem and is the wrong layer to solve it at IMO. The AI did its job perfectly here (reading sources and giving the answer the sources point to)

English

3

0

1

309

Eric Alcaide@eric_alcaide·2d

@NoahRyanCo Indeed

English

1

4

726

Noah Ryan@NoahRyanCo·2d

Turning 27 is so brutal because it no longer becomes about potential. You either are, or you are not. Can, or can not. There's no more ambiguity. There's nobody else to blame, no more programmed "growth spurts". At 27 it becomes all up to you.

English

285

653

10.6K

1.5M

Eric Alcaide@eric_alcaide·2d

@0xkydo 🔥

QME

1

10

Kydo@0xkydo·2d

@eric_alcaide Next up is porting this to Darkbloom so we can service Laguna for all OpenRouter users!

English