David Jin

116 posts

David Jin

David Jin

@DaviJin

Ph.D. student in CSE @MIT; B.S. in Physics & Data Science @Caltech & @GrinnellCollege

Katılım Haziran 2015
614 Takip Edilen104 Takipçiler
Ana Dodik
Ana Dodik@ana_dodik·
Apparently the MIT Slack has an emoji that looks like the normal eyes emoji (👀) but animates into an eye roll every 6 seconds or so. I have been unknowingly eye rolling at people for who knows how long. If you got one from me, I wasn't being sassy, I swear! 🪦💐
English
2
0
9
923
David Jin retweetledi
Linlu Qiu
Linlu Qiu@linluqiu·
Language is discrete. Language models don’t have to be. 🧚Introducing ELF🧚‍♀️: Embedded Language Flows—a class of diffusion models in continuous embedding space based on continuous-time Flow Matching 🧵
Linlu Qiu tweet media
English
15
127
790
128.1K
David Jin retweetledi
Minghao Guo
Minghao Guo@GuoMh14·
I was very excited to saw a really interesting independent GeoPT experiment from Hiroaki Ozasa. He tested the original GeoPT checkpoint on a completely new OOD physics setting: heat-sink solid conduction / heat transfer. This was not a benchmark from our paper. The GeoPT checkpoint was pretrained on car, airplane, and watercraft geometries. No thermal labels. No heat-transfer supervision. No task-specific thermal pretraining. And yet, it transferred. Link to the post: linkedin.com/feed/update/ur…
Minghao Guo tweet mediaMinghao Guo tweet media
English
1
4
13
1.3K
Jack Cook
Jack Cook@jackcookjack·
Now seems like a good time to share that I’ve recently joined @thinkymachines to work on pretraining! Very excited to work on the future of human-AI collaboration with this amazing team.
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English
12
6
138
10.9K
Yinuo Ren
Yinuo Ren@Yinuo_Ren·
I am very happy to share that I successfully defended my PhD dissertation! My deepest thanks to my wonderful advisors, @lexing_ying and @grantrotskoff, and to my committee, Profs. Emmanuel Candès, George Papanicolaou, and Julia Palacios. Grateful for this milestone and excited for what lies ahead!
Yinuo Ren tweet media
English
10
2
69
5.5K
David Jin retweetledi
Xuxin Cheng
Xuxin Cheng@xuxin_cheng·
Excited to share that ARI (Assured Robot Intelligence) is joining @Meta! When we co-founded ARI a year ago, the mission was clear: build humanoid intelligence for the real world. Joining Meta Superintelligence Labs (MSL), we'll continue advancing frontier robotics models toward physical superintelligence in the physical world. Huge thanks to my co-founders, the incredible ARI team, and our investors led by @aixventureshq for backing this from day one. This is just the beginning.
Bloomberg@business

Meta Platforms Inc. has acquired Assured Robot Intelligence, a startup developing artificial intelligence models for robots, as part of a major initiative to build humanoid technology. bloomberg.com/news/articles/…

English
26
13
255
19.5K
David Jin retweetledi
Heng Yang
Heng Yang@hankyang94·
Just had a single-author paper accepted to #RSS2026! arxiv.org/abs/2604.21456 Motivated by growing interest in differentiable world models and physics simulators, we ask whether there is a unified principle for combining sampling-based global “exploration” with gradient-based local “exploitation” in trajectory and policy optimization with differentiable dynamics. By viewing control through the control-as-inference lens—recasting optimization as sampling from an unnormalized Boltzmann distribution defined by an energy function—Tempered Sequential Monte Carlo (TSMC) naturally integrates importance sampling with gradient-based Hamiltonian Monte Carlo. The key idea behind TSMC is to define a tempering path that gradually transforms an easy-to-sample prior into a complex, multi-modal posterior—or equivalently, deforms a convex energy landscape into a nonconvex one (graduated non-convexity)! We implement TSMC for both trajectory and policy optimization. On small- to medium-scale problems, it appears broadly applicable and compares favorably with state-of-the-art baselines. Excited to explore whether TSMC can scale to large-scale planning with complex, high-dimensional dynamics!
Heng Yang tweet media
English
4
20
202
12.3K
Garry Tan
Garry Tan@garrytan·
出污泥而不染
日本語
164
23
629
184.5K
Jaun
Jaun@jaunelia1101·
@DaviJin @pfau @ziv_ravid Its not. Julia is also pretty good. But I think JAX is optimized for TPUs and TPUs are cheaper than GPUs on cloud.
English
1
0
0
31
Ravid Shwartz Ziv
Ravid Shwartz Ziv@ziv_ravid·
I hate Jax, such a terrible framework
English
8
1
123
37K
Jaun
Jaun@jaunelia1101·
@DaviJin @pfau @ziv_ravid It has XLA. So you can do quantum mechanics with it which is a lot of linear algebra number crunching.
English
1
0
0
27
David Pfau
David Pfau@pfau·
@ziv_ravid It's pretty much the only one you can use for serious scientific computing.
English
7
0
92
8.1K
David Jin retweetledi
Hanchen Li
Hanchen Li@lihanc02·
Prompt Learning does not scale for parallel agents. More parallel agents 🤖 = worse prompts 😭 Why? Processing too many trajectories concurrently damages the prompt update process 🐝 We fix this with Combee : → preserves high-quality learnt system prompt → scales to more than 80 concurrent agents → up to 17× speedup without quality drop on top of ACE and GEPA 🥽Use Cases: 1. Prompt learning on large scale collected agent traces 2. Parallel agent learning online with fast knowledge sharing Read more below to learn how agents actually learn at scale ⬇️
Hanchen Li tweet media
English
5
26
177
57.3K
David Jin retweetledi
Han Zheng
Han Zheng@hanzheng_7·
🚀The era of autonomous multi-agent discovery is arriving! @karpathy 🪸Excited to share CORAL, our new work on autonomous multi-agent systems for open-ended scientific discovery. 🙅‍♂️A key limitation of many current “self-evolving” frameworks is that agents still operate inside tightly constrained loops — they mutate solutions, but they do not truly decide how to explore. In CORAL, we push toward genuine autonomy: Agents decide 🔍 what to explore 🧠 what knowledge to store ♻️ which ideas to reuse 🧪 when to test hypotheses 🔥One of the most interesting findings: A single autonomous agent already outperforms fixed evolutionary search, but the biggest gains emerge when multiple agents form a research community. 💪Over 50% of breakthroughs in multi-agent runs come from building on other agents’ discoveries. This suggests that knowledge reuse and collaboration are central to scalable automated discovery. 🏅Across 10+ difficult tasks in algorithmic discovery and system optimization, CORAL achieves state-of-the-art performance while improving efficiency by 3–10×. 📄 Paper: arxiv.org/abs/2604.01658… 💻 Code: github.com/Human-Agent-So… 💡AlphaXiv: alphaxiv.org/abs/2604.01658 #agentic #llms #selfevolvingagent #multiagent #autoresearch #alphaevolve
Han Zheng tweet mediaHan Zheng tweet mediaHan Zheng tweet mediaHan Zheng tweet media
English
16
114
704
52.9K
David Jin retweetledi
Ao Qu
Ao Qu@ao_qu18465·
🚀 The era of autonomous multi-agent discovery has begun. Most “self-evolving” scientific discovery frameworks are still tightly constrained: LLMs often just perform one-step mutations inside fixed evolutionary search loops. But that is not real autonomy. Agents still cannot truly decide: 🔍 what to explore 🧠 what knowledge to store ♻️ which past attempts to reuse 🧪 when to test With CORAL, we ask: ❓ What happens if we give agents much more autonomy to explore the scientific frontier? 💡 Our answer: A single autonomous agent already outperforms fixed evolutionary search. But the bigger leap comes when multiple autonomous agents form a research community: 🤝 They explore different directions 🧠 accumulate reusable knowledge and skills 💬 communicate with each other 🌍 and push the frontier together We introduce CORAL, the first framework for autonomous multi-agent evolution for open-ended discovery. 🥇 Across 10+ tasks in algorithmic discovery, system optimization, and kernel engineering from Frontier-CS, ADRS, AlphaEvolve, etc, CORAL achieves SOTA and improves search efficiency by 3–10× over prior fixed evolutionary-search frameworks. 🔬 Why does autonomy help? Our analysis shows two main reasons: 🧪 Local verification: agents run local tests before expensive evaluations, which is especially powerful for coding tasks. ♻️ Knowledge reuse: on knowledge-intensive tasks like polyominoes and kernel engineering, agents create and reuse knowledge artifacts at far higher rates than on simple tuning/search tasks like circle packing. ✨ Even more exciting: Over 50% of multi-agent breakthroughs come from building on other agents’ discoveries. Multi-agent exploration is also far more diverse than single-agent search. We believe CORAL opens up an exciting new space for automated discovery systems. 📬 If you are interested in collaborating, let’s talk. 📄 Paper: arxiv.org/abs/2604.01658… 💻 Code: github.com/Human-Agent-So… 💡AlphaXiv: alphaxiv.org/abs/2604.01658 #agentic #llms #selfevolvingagent #multiagent #autoresearch #alphaevolve
Ao Qu tweet mediaAo Qu tweet mediaAo Qu tweet mediaAo Qu tweet media
English
17
77
466
39.2K
David Jin
David Jin@DaviJin·
@Ji_Ha_Kim Amazing! I was also thinking about rational iteration and had similar result but I did not have a good cholesky implementation. This is impressive!
English
0
0
0
35
David Jin retweetledi
Ant Ling
Ant Ling@AntLingAGI·
🚀 Linear Attention is unlocking million-token context windows by dropping computational complexity from O(N^2) to O(N), but software is increasingly bottlenecking the hardware. Meet cuLA (CUDA Linear Attention): hand-written kernels using CuTe DSL & CUTLASS C++ to extract maximum performance on NVIDIA GPUs. A drop-in replacement for FLA designed to push hardware to its absolute limits.
English
6
49
386
91.2K