Jerry Zhi-Yang He

303 posts

Jerry Zhi-Yang He banner
Jerry Zhi-Yang He

Jerry Zhi-Yang He

@_herobotics_

LLM research @ Bytedance Seed. prev. PhD at @berkeley_ai with @ancadianadragan, @facebookai, @StanfordSVL and @StanfordHRI.

Stanford, CA Katılım Kasım 2014
1.6K Takip Edilen495 Takipçiler
Jerry Zhi-Yang He retweetledi
Tenobrus
Tenobrus@tenobrus·
Donald Knuth is vibemathing now. real tough day for the stochastic-parrot crew.
Tenobrus tweet media
English
79
436
3.4K
515.7K
Jerry Zhi-Yang He retweetledi
Frank Yan
Frank Yan@FrankYan2·
As promised, here's the short film Jia Zhangke produced using Seedance 2.0 for Chinese New Year and his take on AI filmmaking
Frank Yan tweet mediaFrank Yan tweet media
English
41
253
1.6K
563K
Jerry Zhi-Yang He retweetledi
Eric Jang
Eric Jang@ericjang11·
As Rocks May Think: an interactive essay on thinking models, automated research, and where I think they are headed. Enjoy! evjang.com/2026/02/04/roc…
English
41
184
1.5K
518.4K
Dorsa Sadigh
Dorsa Sadigh@DorsaSadigh·
Just realized I haven't shared life status or been on twitter for a while, so here is a status dump 🧵 1/4
English
23
3
330
51.7K
Jerry Zhi-Yang He retweetledi
Dwarkesh Patel
Dwarkesh Patel@dwarkesh_sp·
The @ilyasut episode 0:00:00 – Explaining model jaggedness 0:09:39 - Emotions and value functions 0:18:49 – What are we scaling? 0:25:13 – Why humans generalize better than models 0:35:45 – Straight-shotting superintelligence 0:46:47 – SSI’s model will learn from deployment 0:55:07 – Alignment 1:18:13 – “We are squarely an age of research company” 1:29:23 – Self-play and multi-agent 1:32:42 – Research taste Look up Dwarkesh Podcast on YouTube, Apple Podcasts, or Spotify. Enjoy!
English
404
1.3K
8.6K
4.1M
Jerry Zhi-Yang He retweetledi
Jason Peng
Jason Peng@xbpeng4·
I have always been surprised by how few positive samples adversarial imitation learning needs to be effective. With ADD we take this to the extreme! A differential discriminator trained with a SINGLE positive sample can still be effective for a wide range of tasks.
Ziyu (Charlotte) Zhang@ziyu_zhang73354

Training RL agents often requires tedious reward engineering. ADD can help! ADD uses a differential discriminator to automatically turn raw errors into effective training rewards for a wide variety of tasks! 🚀 Excited to share our latest work: Physics-Based Motion Imitation with Adversarial Differential Discriminators ( @SIGGRAPHAsia 2025), with Sergey Bashkirov*, Dun Yang, @YiShi_333, Michael Taylor, and @xbpeng4. 🌟 Webpage: add-moo.github.io 🌟 Code: coming soon!

English
4
22
161
16.7K
Jerry Zhi-Yang He retweetledi
Jelani Nelson
Jelani Nelson@minilek·
I’ve also been integrating LLMs into my research workflow. I spent most of Tuesday working on a problem I’ve been thinking about for a while with some collaborators. I had a conjecture on a possible way forward, and with some hours of thinking, mixing in conversations with Gemini to guide certain non-trivial calculations, Gemini ultimately spit out a proof that no approach in this family can possibly work (which I found surprising, since similar approaches worked in related settings). Maybe will say more about what the problem is after it’s fully resolved, lest I lead to us getting scooped. :) tl;dr LLMs haven’t replaced me (yet?), but certainly are making me a more efficient researcher. *work still ongoing*
Sebastien Bubeck@SebastienBubeck

Well, this time it's by Terence Tao himself: @tao/115306424727150237" target="_blank" rel="nofollow noopener">mathstodon.xyz/@tao/115306424…

English
14
18
284
62.6K
Jerry Zhi-Yang He retweetledi
Julian Schrittwieser
Julian Schrittwieser@Mononofu·
As a researcher at a frontier lab I’m often surprised by how unaware of current AI progress public discussions are. I wrote a post to summarize studies of recent progress, and what we should expect in the next 1-2 years: julian.ac/blog/2025/09/2…
English
223
812
5.9K
2M
Jerry Zhi-Yang He retweetledi
Jessy Lin
Jessy Lin@realJessyLin·
What does it take to build a human-like user simulator? // To train collaborative agents, we need better user sims. In blog post pt 2, @NickATomlin and I sketch a framework for building user simulators + open questions for research: jessylin.com/2025/09/25/use…
English
3
10
59
12K
Yu Xiang
Yu Xiang@YuXiang_IRVL·
Big day in class today! With @lfcasas7, we brought 14 SO-101 arms for students to assemble and take home for projects @UT_Dallas. Excited to see what they create by semester’s end! 🤖
English
7
13
175
32.9K
Jerry Zhi-Yang He retweetledi
Thinking Machines
Thinking Machines@thinkymachines·
Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to prompt engineering. Here we share what we are working on and connect with the research community frequently and openly. The name Connectionism is a throwback to an earlier era of AI; it was the name of the subfield in the 1980s that studied neural networks and their similarity to biological brains. thinkingmachines.ai/blog/defeating…
Thinking Machines tweet media
English
230
1.3K
7.6K
3.4M
Jerry Zhi-Yang He
Jerry Zhi-Yang He@_herobotics_·
@robot_trainer Any particular failure mode of classical motion generation methods that you have in mind? Those seem to be pretty good & robust at reactive obstacle avoidance, if engineered well
English
1
0
1
48
Nathan Ratliff
Nathan Ratliff@robot_trainer·
this is really cool. i've always thought learning-based methods were the right approach to global motion generation. nice work! (and all the demos! super robust and general system)
Jason Liu@JasonJZLiu

Ever wish a robot could just move to any goal in any environment—avoiding all collisions and reacting in real time? 🚀Excited to share our #CoRL2025 paper, Deep Reactive Policy (DRP), a learning-based motion planner that navigates complex scenes with moving obstacles—directly from point cloud input. w/ @Jiahui_Yang6709 (1/N)

English
2
3
17
2.8K
Jerry Zhi-Yang He retweetledi
Lin Yang
Lin Yang@lyang36·
Our IMO gold medal-winning AI pipeline is now model-agnostic. 🥇 What worked for Gemini 2.5 Pro now gets the same 5/6 score with GPT-5 & Grok4. This confirms the power of our verification-and-refinement pipeline to improve base model capabilities. The new code & results are live on GitHub[github.com/lyang36/IMO25]! Paper update coming soon. Huge thanks to @xai for the Grok4 API credits! #AI #LLM #IMO #MathOlympiad #OpenSource
English
19
84
1.1K
129.6K
Alexander Wei
Alexander Wei@alexwei_·
1/ I competed for Team USA at IOI in 2015, so this achievement hits home for me. The biggest highlight: we *did not* train a model specifically for IOI. Our IMO gold model actually set a new state of the art in our internal competitive programming evals. Reasoning generalizes!
Sheryl Hsu@SherylHsu02

1/n I’m thrilled to share that our @OpenAI reasoning system scored high enough to achieve gold 🥇🥇 in one of the world’s top programming competitions - the 2025 International Olympiad in Informatics (IOI) - placing first among AI participants! 👨‍💻👨‍💻

English
36
64
899
128.6K
Jerry Zhi-Yang He retweetledi
Jason Weston
Jason Weston@jaseweston·
🤖Introducing: CoT-Self-Instruct 🤖 📝: arxiv.org/abs/2507.23751 - Builds high-quality synthetic data via reasoning CoT + quality filtering - Gains on reasoning tasks: MATH500, AMC23, AIME24 & GPQA-💎 - Outperforms existing train data s1k & OpenMathReasoning - Gains on non-reasoning tasks as well: AlpacaEval & ArenaHard 🧵1/3
Jason Weston tweet media
English
1
63
377
24.3K
Jerry Zhi-Yang He retweetledi
guille
guille@angeris·
looks like Lean is popular today, so here's a little post on how/why it works and implementing a mini version of it in Julia
guille tweet media
English
4
19
158
14K
Jerry Zhi-Yang He
Jerry Zhi-Yang He@_herobotics_·
@conitzer when you click on "dive deeper in AI mode" it seems to be doing fine
Jerry Zhi-Yang He tweet media
English
0
0
1
58
Vincent Conitzer
Vincent Conitzer@conitzer·
"is it possible to swim in coffee?" -- apparently it is not because coffee is a liquid, not a solid, and you want a solid, swimmable medium.
Vincent Conitzer tweet media
English
2
1
7
1K