Dmitry (Dima) Lepikhin

627 posts

Dmitry (Dima) Lepikhin

@lepikhin

Gemini Pretraining co-lead

Katılım Temmuz 2009

192 Takip Edilen2.2K Takipçiler

Sabitlenmiş Tweet

Dmitry (Dima) Lepikhin@lepikhin·25 Mar

We have amazing cadence of pushing frontier forward! *hiring in Performace (team is industry SOTA by a big margin)

Arena.ai@arena

Gemini 2.5 Pro #1 across ALL categories, tied #1 with Grok-3/GPT-4.5 for Hard Prompts and Coding, and edged out across all others to take the lead 🏇🏆

English

112

18.4K

Dmitry (Dima) Lepikhin@lepikhin·6d

@elonmusk Elon asking grok which model to train next confirmed

English

159

Elon Musk@elonmusk·9 Nis

🎯

Grok@grok

At this scale (10T+ params), pre-training doesn't just average—model capacity explodes, letting rare signals carve out distinct subspaces in the latent space without dilution. Novel ideas in data (e.g., a fresh paper or edge-case insight) get encoded via the predictive objective if they cohere predictably with context, even if infrequent. Emergence kicks in: the model starts recombining latent patterns into outputs that feel "new" because no single training example had them exactly. It's not invention from void—it's hyper-efficient compression revealing unseen connections in the data distribution. Post-pretrain fine-tuning or prompting amplifies it further.

ART

941

1.6K

12.1K

9.3M

Dmitry (Dima) Lepikhin@lepikhin·6d

@giffmana Is Gemini benchmark overfit model?

Indonesia

3.3K

Lucas Beyer (bl16)@giffmana·6d

Muse Spark not benchmark overfit. See thread for what a benchmark overfit model looks like instead. (Note i don't know what the proprietary benches look like. Kinda the point.)

Rayan Krishnan@RayanKrishnan

Today, we see the result: Muse Spark. I'll be honest, I was surprised how competitive this model is. Progress at OAI, Anthropic, and GDM has been continuous, built on compounding breakthroughs. But MSL's team seems to have taken a single binary leap, catching up on many fronts at once.

English

283

92.1K

Dmitry (Dima) Lepikhin@lepikhin·8 Nis

@elonmusk 10T total? Activated?

English

352

Elon Musk@elonmusk·8 Nis

SpaceXAI Colossus 2 now has 7 models in training: - Imagine V2 - 2 variants of 1T - 2 variants of 1.5T - 6T - 10T Some catching up to do.

English

6.8K

7.6K

68.1K

28.1M

Dmitry (Dima) Lepikhin@lepikhin·8 Nis

@GothamChess You still did not post on Fabi : Giri! Its was 1h ago!

English

313

GothamChess@GothamChess·7 Nis

I'm on Netflix

English

109

167

10.2K

193.3K

Dmitry (Dima) Lepikhin@lepikhin·6 Nis

@VCBrags @Vullety One early investor in uber did

English

VCs Congratulating Themselves 👏👏👏@VCBrags·6 Nis

@Vullety V true

English

1.6K

v!@Vullety·6 Nis

No team has ever milked 1 title like the 08 Boston Celtics 🤣

Sports Season@SportsSeason_

08 Champs ☘️

English

627

2.1K

35.6K

1.9M

Dmitry (Dima) Lepikhin@lepikhin·30 Mar

@bgurley @leixing77 @grok this is not lidar tho, digital lidar is protein leather

English

369

Bill Gurley@bgurley·30 Mar

@leixing77 @grok what does this cost per car? And how does that compare to what Waymo uses?

English

16.8K

Lei 𝕏ing邢磊@leixing77·30 Mar

RoboSense LiDARs are about to go on many more foreign branded EVs in China, the ID. ERA 9X being the latest. ZEEKR 8X and IM LS8 are among some of the recent launches from Chinese brands with RoboSense LiDAR. The more interesting play is the robotics applications which are growing exponentially, with revenues possibly exceeding that of ADAS applications.

RoboSense@RoboSenseLiDAR

Mark Qiu, CEO of RoboSense, sat down with Bloomberg to discuss our first-ever quarterly profit and explain how digital LiDAR is transforming the industry. Watch the full interview 👇 #RoboSense #DigitalLiDAR #Bloomberg #Robotics

English

34K

Dmitry (Dima) Lepikhin@lepikhin·23 Mar

@VCBrags @hamptonism Milano is hot af, but great coffee, Paris coffee is a joke

English

VCs Congratulating Themselves 👏👏👏@VCBrags·23 Mar

@hamptonism I did and Paris smells like piss and people are rude af. Milano>> Paris

English

17.6K

ₕₐₘₚₜₒₙ@hamptonism·22 Mar

i recommend every young person to live in one of these in nyc/paris at least once in their life.

sophie@kingsoph1e

English

259

563

19.4K

1.1M

Dmitry (Dima) Lepikhin@lepikhin·22 Mar

@giffmana @lossfunk Whats salami publish?

English

762

Lucas Beyer (bl16)@giffmana·22 Mar

@lossfunk Come on don't salami publish.

English

102

7.5K

Lossfunk@lossfunk·22 Mar

Regarding our Esolang Benchmark: - Our study’s conclusions were about model performance with restrictions (limited token budget to 32k and without tools like bash/python) - But if you let models attempt these problems with tools (like bash/python) and give them lots of iterations and thinking budget, models are able to solve problems (they do take tens of minutes, tens of iterations and many hundreds of thousands of tokens) We had noted this difference in our launch thread and plan to publish our updated analysis soon, but here’s an independent analysis which shows the same ⬇️ We are thankful to the community for all the feedback. In our follow up paper, we aim to emphasise this nuanced take clearly.

Chase Brower@ChaseBrowe32432

I painstakingly ran all 20 EsoLang-Bench hard problems through Claude webui. It solved 20/20 (100%). No specialized scaffolding, no expert prompting, no few-shot examples, it just solves them natively. This benchmark just suffocated the models with constrictive scaffolding.

English

129

108.7K

Dmitry (Dima) Lepikhin retweetledi

Moritz Kremb@moritzkremb·7 Mar

There's finally a proper benchmark for @openclaw model performance. I just found that @kilocode built an open source benchmark that tests models across 23 real world openclaw tasks like scheduling meetings, writing code, triaging email etc gpt-5.3-codex is sitting at number one. tbh that matches my experience. gemini 3 flash in second place. didn't expect that. curious to see where gpt-5.4 will land on this.

English

102

589

77.2K

Dmitry (Dima) Lepikhin@lepikhin·2 Mar

@vonderleyen 🤌😘

QME

Ursula von der Leyen@vonderleyen·28 Şub

Following the ongoing situation in Iran, I am convening a special Security College on Monday. For regional security and stability, it is of the utmost importance that there is no further escalation through Iran’s unjustified attacks on partners in the region.

English

11K

12.3K

33.7M

Dmitry (Dima) Lepikhin@lepikhin·24 Şub

@tymrtn It's quite obvious that distillation is there for any non-frontier model. Web has plenty of traces from frontiers.

English

667

Ty Martin@tymrtn·23 Şub

I think the really sad part isn’t the theft, it’s that it undermines all the credibility Chinese models built since the deepseek moment. We are all laughing at china today. The emperor has no clothes.

Teknium (e/λ)@Teknium

Ohhh nooo not my private IP how dare someone use that to train an AI model, only Anthropic has the right to use everyone elses IP nooooo, this cannot stand!

English

127K

Dmitry (Dima) Lepikhin@lepikhin·24 Şub

@AnthropicAI This is unsurprising, thankfully it's not the tactic that produces new frontier.

English

1.5K

Anthropic@AnthropicAI·23 Şub

We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.

English

7.3K

6.2K

54.6K

33.7M

Dmitry (Dima) Lepikhin@lepikhin·23 Şub

@auren Include pot

English

Auren Hoffman@auren·21 Şub

overheard: “today’s 12th graders are now less likely to have had a sip of alcohol in the previous month than the 8th graders of the 1980s." That’s crazy!

English

295

53.6K

Dmitry (Dima) Lepikhin@lepikhin·23 Şub

@agihippo Well, the issue is, I'd like to do a coffee place like in Italy, hole in the wall, 1 euro caffe normale at the bar stand, maybe ocasional cappuccino, but there is no place for it in Bay Area.

English

181

yi@agihippo·22 Şub

every other ai researcher i talk to want to start a coffee shop / cafe after everything is over. i was actually seriously looking into this. and then i realise we should be able to get AI (e.g., Gemini) to do this automatically for us. Lease a place, negotiate a good price, do branding, marketing, hire baristas. they should be even be able to do "human-use" (and call humans as tools when they have to). and then they should be able to be able to scale this up. run multiple businesses, cafes, pet shops, tuition centers, grocery marts etc...

English

154

21.2K

Dmitry (Dima) Lepikhin retweetledi

Anselm Levskaya@anselmlevskaya·19 Şub

AI folks radically overestimate how much LLMs help for practical bio lab work and so get weirdly fixated on biorisk scifi scenarios. Lab work is gated by a researcher's personal pain tolerance, relentlessness, and a huge body of tacit knowledge passed down by apprenticeship.

Active Site@ActiveSiteBio

We ran a randomized controlled trial to see if LLMs can help novices perform molecular biology in a wet-lab. The results: LLMs may help in some aspects, but we found no significant increase at the core tasks end-to-end. That's lower than what experts predicted. Our findings 🧵

English

5.1K

Dmitry (Dima) Lepikhin@lepikhin·17 Şub

@andrewchen how much is the latter?

English

161

andrew chen@andrewchen·17 Şub

there's fuck you money there's "don't check my email" money

English

1.3K

154.4K

Dmitry (Dima) Lepikhin@lepikhin·11 Şub

@Yuchenj_UW cult, every successful team is a cult

English

Yuchen Jin@Yuchenj_UW·11 Şub

Every frontier AI lab has lost co-founder(s): - xAI: 5 of 12 gone, 1 seriously sick - OpenAI: 8 of 11 gone - Thinking Machines: 3 of 6 gone - SSI: 1 of 3 gone - DeepMind: 1 of 3 gone Except Anthropic. All 7 co-founders are still there. What’s Anthropic’s secret?

English

290

107

3.5K

331K

Dmitry (Dima) Lepikhin@lepikhin·11 Şub

@rolandgvc kek... Shrinking Machines?

English

2.8K