David Zhang

54 posts

David Zhang

@dzhang03

@Yale @GoogleDeepMind | prev research @david_van_dijk @calico

New York, NY Katılım Şubat 2024

292 Takip Edilen68 Takipçiler

David Zhang retweetledi

Demis Hassabis@demishassabis·29 Oca

Thrilled to launch Project Genie, an experimental prototype of the world's most advanced world model. Create entire playable worlds to explore in real-time just from a simple text prompt - kind of mindblowing really! Available to Ultra subs in the US for now - have fun exploring!

English

381

950

7.9K

964.1K

David Zhang retweetledi

Jeff Dean@JeffDean·17 Ara

We’ve pushed out the Pareto frontier of efficiency vs. intelligence again. With Gemini 3 Flash ⚡️, we are seeing reasoning capabilities previously reserved for our largest models, now running at Flash-level latency. This opens up entirely new categories of near real-time applications that require complex thought. It’s available in the API, and rolling out today as the default model in AI Mode in Search and Gemini app globally. Read more on the blog at: bit.ly/4pTo5YU More in thread ⬇️

English

194

1.8K

159K

David Zhang retweetledi

ARC Prize@arcprize·18 Kas

Gemini 3 models from @Google @GoogleDeepMind have made a significant 2X SOTA jump on ARC-AGI-2 (Semi-Private Eval) Gemini 3 Pro: 31.11%, $0.81/task Gemini 3 Deep Think (Preview): 45.14%, $77.16/task

English

190

605

4.1K

2.2M

David Zhang retweetledi

Demis Hassabis@demishassabis·18 Kas

We’ve been intensely cooking Gemini 3 for a while now, and we’re so excited and proud to share the results with you all. Of course it tops the leaderboards, including @arena, HLE, GPQA etc, but beyond the benchmarks it’s been by far my favourite model to use for its style and depth, and what it can do to help with everyday tasks.

English

218

485

5.7K

589.6K

David Zhang retweetledi

Google DeepMind@GoogleDeepMind·18 Kas

This is Gemini 3: our most intelligent model that helps you learn, build and plan anything. It comes with state-of-the-art reasoning capabilities, world-leading multimodal understanding, and enables new agentic coding experiences. 🧵

English

213

1.1K

6.5K

1.7M

David Zhang retweetledi

Demis Hassabis@demishassabis·18 Kas

It's nearly 3 here, my favourite part of the night shift… locked in... 💪🚀

English

312

319

6.8K

David Zhang retweetledi

David van Dijk@david_van_dijk·4 Kas

C2S is now open for everyone. The biological LLM that learns the language of cells. Free for academic and commercial use. c2s.bio Join the growing community building with C2S. 🌱

English

192

27.6K

David Zhang retweetledi

Sundar Pichai@sundarpichai·15 Eki

An exciting milestone for AI in science: Our C2S-Scale 27B foundation model, built with @Yale and based on Gemma, generated a novel hypothesis about cancer cellular behavior, which scientists experimentally validated in living cells. With more preclinical and clinical tests, this discovery may reveal a promising new pathway for developing therapies to fight cancer.

English

543

3.2K

21.8K

6.9M

David Zhang retweetledi

David van Dijk@david_van_dijk·29 Eyl

🚨 Thrilled to announce our paper “Non-Markovian Discrete Diffusion with Causal Language Models” was accepted at #NeurIPS2025! 🎉 @YaleCSDept @YaleMed @yaledatascience We introduce CaDDi, a new framework that unifies discrete diffusion and causal LMs. A quick explainer 🧵👇

English

15.6K

David Zhang retweetledi

Yiping Lu@2prime_PKU·25 Tem

Anyone knows adam?

English

265

441

4.8K

634.4K

David Zhang retweetledi

Jun Cheng@s6juncheng·25 Haz

Excited to share #AlphaGenome, a start of our AlphaGenome named journey to decipher the regulatory genome! The model matches or exceeds top-performing external models on 24 out of 26 variant evaluations, across a wide range of biological modalities.1/6

English

208

909

87.2K

David Zhang retweetledi

Richard Socher@RichardSocher·22 Haz

If you studied algorithms, I'm sure you've heard of Dijkstra’s algorithm to find the shortest paths between nodes in a weighted graph. Super useful in scenarios such as road networks, where it can determine the shortest route from a starting point to various destinations. It's been the most optimal algorithm since 1956! Until now. The O(E + V log V) complexity just went down to O(E log^(2/3) V) for sparse graphs. It would be amazing if this kind of breakthrough came through AI that can code but I guess we're not there yet..

English

124

1.2K

140.4K

David Zhang retweetledi

Fei-Fei Li@drfeifei·20 May

Very honored! Thank you @Yale !

Yale University@Yale

During its 324th graduation ceremony on Monday, Yale University awarded honorary degrees to eight individuals whose achievements in their fields have benefited the common good. This year’s honorary degree recipients are Debbie Allen, the beloved performer, producer, and director; Frances H. Arnold, a Nobel Prize-winning bioengineer and pioneer in the field of evolutionary chemistry; Ronald L. Carter, the iconic jazz musician and composer; Michael B. Curry ’78 M.Div., former presiding bishop of the Episcopal Church and longtime advocate for social justice; Henry Louis “Skip” Gates, Jr. ’73, literary scholar and award-winning filmmaker; Annette Gordon-Reed, Pulitzer Prize-winning historian and author; Fei-Fei Li, a computer scientist and pioneer in the field of artificial intelligence; and Peter B. Moore ’61, a distinguished chemist and longtime Yale professor who contributed key insights for understanding of the ribosome. The awarding of honorary degrees has been a Yale tradition since 1702. Read more about the 2025 recipients: bit.ly/3H2lhaE #Yale2025 🎓

English

151

22.7K

David Zhang retweetledi

Theta@trytheta·16 May

Introducing CUB: Humanity's Last Exam for Computer and Browser Use Agents

English

251

114K

David Zhang retweetledi

Google DeepMind@GoogleDeepMind·14 May

Introducing AlphaEvolve: a Gemini-powered coding agent for algorithm discovery. It’s able to: 🔘 Design faster matrix multiplication algorithms 🔘 Find new solutions to open math problems 🔘 Make data centers, chip design and AI training more efficient across @Google. 🧵

GIF

English

175

1.3K

6.9K

2.6M

David Zhang retweetledi

Gurvir Singh@_gurvir_·12 May

we've been misled to believe that manual prompt hacking is the solution to teaching LLMs how to approach complex problems. why write a "magic prompt" to pattern match for every type of problem you might care about, when LLMs have already shown extraordinary ability to self-review and self-correct given the right feedback loops @karpathy alludes to it here, but what's missing is a memory layer so that LLMs can learn from their previous mistakes. they suffer from amnesia because they lack a mechanism to record and build upon problem solving strategies. a memory layer allows for this "system prompt learning" instead of relying on explicit human feedback there's a lot of engineering challenges in getting this to work effectively. how do you measure which insights are effective, and how do you refine them from feedback? building a "scratchpad" of notes that can be maintained over thousands of runs and indexed efficiently to get the right notes is a non-trivial problem, and it's exactly what we're tackling at @trytheta

Andrej Karpathy@karpathy

We're missing (at least one) major paradigm for LLM learning. Not sure what to call it, possibly it has a name - system prompt learning? Pretraining is for knowledge. Finetuning (SL/RL) is for habitual behavior. Both of these involve a change in parameters but a lot of human learning feels more like a change in system prompt. You encounter a problem, figure something out, then "remember" something in fairly explicit terms for the next time. E.g. "It seems when I encounter this and that kind of a problem, I should try this and that kind of an approach/solution". It feels more like taking notes for yourself, i.e. something like the "Memory" feature but not to store per-user random facts, but general/global problem solving knowledge and strategies. LLMs are quite literally like the guy in Memento, except we haven't given them their scratchpad yet. Note that this paradigm is also significantly more powerful and data efficient because a knowledge-guided "review" stage is a significantly higher dimensional feedback channel than a reward scaler. I was prompted to jot down this shower of thoughts after reading through Claude's system prompt, which currently seems to be around 17,000 words, specifying not just basic behavior style/preferences (e.g. refuse various requests related to song lyrics) but also a large amount of general problem solving strategies, e.g.: "If Claude is asked to count words, letters, and characters, it thinks step by step before answering the person. It explicitly counts the words, letters, or characters by assigning a number to each. It only answers the person once it has performed this explicit counting step." This is to help Claude solve 'r' in strawberry etc. Imo this is not the kind of problem solving knowledge that should be baked into weights via Reinforcement Learning, or least not immediately/exclusively. And it certainly shouldn't come from human engineers writing system prompts by hand. It should come from System Prompt learning, which resembles RL in the setup, with the exception of the learning algorithm (edits vs gradient descent). A large section of the LLM system prompt could be written via system prompt learning, it would look a bit like the LLM writing a book for itself on how to solve problems. If this works it would be a new/powerful learning paradigm. With a lot of details left to figure out (how do the edits work? can/should you learn the edit system? how do you gradually move knowledge from the explicit system text to habitual weights, as humans seem to do? etc.).

English

3.2K

David Zhang retweetledi

Y Combinator@ycombinator·9 May

Theta (@trytheta) allows AI agents to learn from their mistakes in real-time. Their memory layer has already improved the accuracy of OpenAI Operator by 43% with 7x fewer steps taken. ycombinator.com/launches/NTK-t… Congrats on the launch, @RayanGarg, @tsha444, and @_gurvir_!

English

381

52.4K

David Zhang retweetledi

Brian Roemmele@BrianRoemmele·7 May

Google did a very good job with this commercial.

English

331

4.2K

381.2K

David Zhang retweetledi

David van Dijk@david_van_dijk·22 Nis

We will be presenting Intelligence at the Edge of Chaos at #ICLR2025. Come visit our poster! 🖼️ Poster: iclr.cc/virtual/2025/p… 📜 Paper: arxiv.org/abs/2410.02536

English

5.7K

David Zhang retweetledi

Physical Intelligence@physical_int·22 Nis

We got a robot to clean up homes that were never seen in its training data! Our new model, π-0.5, aims to tackle open-world generalization. We took our robot into homes that were not in the training data and asked it to clean kitchens and bedrooms. More below⤵️

English

260

1.6K

488K

Keşfet

@Google @GoogleDeepMind @arena @Yale @YaleCSDept @YaleMed @yaledatascience @karpathy