Dmitry (Dima) Lepikhin

627 posts

Dmitry (Dima) Lepikhin banner
Dmitry (Dima) Lepikhin

Dmitry (Dima) Lepikhin

@lepikhin

Gemini Pretraining co-lead

Katılım Temmuz 2009
192 Takip Edilen2.2K Takipçiler
Sabitlenmiş Tweet
Dmitry (Dima) Lepikhin
Dmitry (Dima) Lepikhin@lepikhin·
We have amazing cadence of pushing frontier forward! *hiring in Performace (team is industry SOTA by a big margin)
Arena.ai@arena

Gemini 2.5 Pro #1 across ALL categories, tied #1 with Grok-3/GPT-4.5 for Hard Prompts and Coding, and edged out across all others to take the lead 🏇🏆

English
2
10
112
18.4K
Elon Musk
Elon Musk@elonmusk·
SpaceXAI Colossus 2 now has 7 models in training: - Imagine V2 - 2 variants of 1T - 2 variants of 1.5T - 6T - 10T Some catching up to do.
English
6.8K
7.6K
68.1K
28.1M
GothamChess
GothamChess@GothamChess·
I'm on Netflix
GothamChess tweet mediaGothamChess tweet media
English
109
167
10.2K
193.3K
Bill Gurley
Bill Gurley@bgurley·
@leixing77 @grok what does this cost per car? And how does that compare to what Waymo uses?
English
4
0
9
16.8K
Lei 𝕏ing邢磊
Lei 𝕏ing邢磊@leixing77·
RoboSense LiDARs are about to go on many more foreign branded EVs in China, the ID. ERA 9X being the latest. ZEEKR 8X and IM LS8 are among some of the recent launches from Chinese brands with RoboSense LiDAR. The more interesting play is the robotics applications which are growing exponentially, with revenues possibly exceeding that of ADAS applications.
RoboSense@RoboSenseLiDAR

Mark Qiu, CEO of RoboSense, sat down with Bloomberg to discuss our first-ever quarterly profit and explain how digital LiDAR is transforming the industry. Watch the full interview 👇 #RoboSense #DigitalLiDAR #Bloomberg #Robotics

English
3
4
18
34K
Lossfunk
Lossfunk@lossfunk·
Regarding our Esolang Benchmark: - Our study’s conclusions were about model performance with restrictions (limited token budget to 32k and without tools like bash/python) - But if you let models attempt these problems with tools (like bash/python) and give them lots of iterations and thinking budget, models are able to solve problems (they do take tens of minutes, tens of iterations and many hundreds of thousands of tokens) We had noted this difference in our launch thread and plan to publish our updated analysis soon, but here’s an independent analysis which shows the same ⬇️ We are thankful to the community for all the feedback. In our follow up paper, we aim to emphasise this nuanced take clearly.
Chase Brower@ChaseBrowe32432

I painstakingly ran all 20 EsoLang-Bench hard problems through Claude webui. It solved 20/20 (100%). No specialized scaffolding, no expert prompting, no few-shot examples, it just solves them natively. This benchmark just suffocated the models with constrictive scaffolding.

English
22
8
129
108.7K
Dmitry (Dima) Lepikhin retweetledi
Moritz Kremb
Moritz Kremb@moritzkremb·
There's finally a proper benchmark for @openclaw model performance. I just found that @kilocode built an open source benchmark that tests models across 23 real world openclaw tasks like scheduling meetings, writing code, triaging email etc gpt-5.3-codex is sitting at number one. tbh that matches my experience. gemini 3 flash in second place. didn't expect that. curious to see where gpt-5.4 will land on this.
Moritz Kremb tweet media
English
102
47
589
77.2K
Ursula von der Leyen
Ursula von der Leyen@vonderleyen·
Following the ongoing situation in Iran, I am convening a special Security College on Monday. For regional security and stability, it is of the utmost importance that there is no further escalation through Iran’s unjustified attacks on partners in the region.
English
11K
1K
12.3K
33.7M
Dmitry (Dima) Lepikhin
Dmitry (Dima) Lepikhin@lepikhin·
@tymrtn It's quite obvious that distillation is there for any non-frontier model. Web has plenty of traces from frontiers.
English
0
0
0
667
Anthropic
Anthropic@AnthropicAI·
We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.
English
7.3K
6.2K
54.6K
33.7M
Auren Hoffman
Auren Hoffman@auren·
overheard: “today’s 12th graders are now less likely to have had a sip of alcohol in the previous month than the 8th graders of the 1980s." That’s crazy!
English
11
20
295
53.6K
Dmitry (Dima) Lepikhin
Dmitry (Dima) Lepikhin@lepikhin·
@agihippo Well, the issue is, I'd like to do a coffee place like in Italy, hole in the wall, 1 euro caffe normale at the bar stand, maybe ocasional cappuccino, but there is no place for it in Bay Area.
English
0
0
1
181
yi
yi@agihippo·
every other ai researcher i talk to want to start a coffee shop / cafe after everything is over. i was actually seriously looking into this. and then i realise we should be able to get AI (e.g., Gemini) to do this automatically for us. Lease a place, negotiate a good price, do branding, marketing, hire baristas. they should be even be able to do "human-use" (and call humans as tools when they have to). and then they should be able to be able to scale this up. run multiple businesses, cafes, pet shops, tuition centers, grocery marts etc...
English
20
5
154
21.2K
Dmitry (Dima) Lepikhin retweetledi
Anselm Levskaya
Anselm Levskaya@anselmlevskaya·
AI folks radically overestimate how much LLMs help for practical bio lab work and so get weirdly fixated on biorisk scifi scenarios. Lab work is gated by a researcher's personal pain tolerance, relentlessness, and a huge body of tacit knowledge passed down by apprenticeship.
Active Site@ActiveSiteBio

We ran a randomized controlled trial to see if LLMs can help novices perform molecular biology in a wet-lab. The results: LLMs may help in some aspects, but we found no significant increase at the core tasks end-to-end. That's lower than what experts predicted. Our findings 🧵

English
0
6
33
5.1K
andrew chen
andrew chen@andrewchen·
there's fuck you money there's "don't check my email" money
English
78
23
1.3K
154.4K
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
Every frontier AI lab has lost co-founder(s): - xAI: 5 of 12 gone, 1 seriously sick - OpenAI: 8 of 11 gone - Thinking Machines: 3 of 6 gone - SSI: 1 of 3 gone - DeepMind: 1 of 3 gone Except Anthropic. All 7 co-founders are still there. What’s Anthropic’s secret?
English
290
107
3.5K
331K
Roland
Roland@rolandgvc·
I left xAI. Building something new with others that left xAI. We're hiring :)
English
452
194
4K
599.3K
Benjamin De Kraker
Benjamin De Kraker@BenjaminDEKR·
So almost half of the original 12 xAI cofounders are now out, with two of them announcing they've resigned just today. (marked in blue)
Benjamin De Kraker tweet media
English
68
31
246
32.9K
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
Was surprised to learn that only 20% of the compute was spent on pre-train for a frontier model. The rest is post.
English
16
3
238
48K