Jerry Zhi-Yang He

303 posts

Jerry Zhi-Yang He

@_herobotics_

LLM research @ Bytedance Seed. prev. PhD at @berkeley_ai with @ancadianadragan, @facebookai, @StanfordSVL and @StanfordHRI.

Stanford, CA Katılım Kasım 2014

1.6K Takip Edilen495 Takipçiler

Sabitlenmiş Tweet

Jerry Zhi-Yang He@_herobotics_·17 Eki

Is your robot 🤖 safe around humans🚸? Or will it fail catastrophically🤕? Excited to introduce "Natural Adversarial Frontier", a framework for probing the robustness of Human-Robot Interactions. To appear at #CoRL2023 🧵(1/8) Joint work with @ancadianadragan @daniel_s_brown @ZackoryErickson arxiv.org/abs/2310.10610 ood-human.github.io

English

10.5K

Jerry Zhi-Yang He retweetledi

Tenobrus@tenobrus·3 Mar

Donald Knuth is vibemathing now. real tough day for the stochastic-parrot crew.

English

436

3.4K

515.7K

Jerry Zhi-Yang He retweetledi

Frank Yan@FrankYan2·16 Şub

As promised, here's the short film Jia Zhangke produced using Seedance 2.0 for Chinese New Year and his take on AI filmmaking

English

253

1.6K

563K

Jerry Zhi-Yang He retweetledi

Eric Jang@ericjang11·5 Şub

As Rocks May Think: an interactive essay on thinking models, automated research, and where I think they are headed. Enjoy! evjang.com/2026/02/04/roc…

English

184

1.5K

518.4K

Jerry Zhi-Yang He@_herobotics_·2 Ara

@DorsaSadigh Congrats Dorsa! Really happy for you!

English

184

Dorsa Sadigh@DorsaSadigh·1 Ara

Just realized I haven't shared life status or been on twitter for a while, so here is a status dump 🧵 1/4

English

330

51.7K

Jerry Zhi-Yang He retweetledi

Dwarkesh Patel@dwarkesh_sp·25 Kas

The @ilyasut episode 0:00:00 – Explaining model jaggedness 0:09:39 - Emotions and value functions 0:18:49 – What are we scaling? 0:25:13 – Why humans generalize better than models 0:35:45 – Straight-shotting superintelligence 0:46:47 – SSI’s model will learn from deployment 0:55:07 – Alignment 1:18:13 – “We are squarely an age of research company” 1:29:23 – Self-play and multi-agent 1:32:42 – Research taste Look up Dwarkesh Podcast on YouTube, Apple Podcasts, or Spotify. Enjoy!

English

404

1.3K

8.6K

4.1M

Jerry Zhi-Yang He@_herobotics_·15 Eki

@WinstonGu_ very cool!

English

Gao Jiawei@WinstonGu_·14 Eki

It would be really exciting if someone could build on this work for humanoid robots! Original code at github.com/Winston-Gu/Coo…

Gao Jiawei@WinstonGu_

Imagine a future where you can ask humanoid robots to clean your room, but some items, like heavy sofas, are too challenging for just one robot to move. Introducing CooHOI, a learning-based framework designed for the cooperative transportation of objects by multiple humanoid robots. 🤖🤼🤖 Our work has been accepted as Spotlight at NeurIPS 2024. Website: gao-jiawei.com/Research/CooHO…

English

7.2K

Jerry Zhi-Yang He retweetledi

Jason Peng@xbpeng4·7 Eki

I have always been surprised by how few positive samples adversarial imitation learning needs to be effective. With ADD we take this to the extreme! A differential discriminator trained with a SINGLE positive sample can still be effective for a wide range of tasks.

Ziyu (Charlotte) Zhang@ziyu_zhang73354

Training RL agents often requires tedious reward engineering. ADD can help! ADD uses a differential discriminator to automatically turn raw errors into effective training rewards for a wide variety of tasks! 🚀 Excited to share our latest work: Physics-Based Motion Imitation with Adversarial Differential Discriminators ( @SIGGRAPHAsia 2025), with Sergey Bashkirov*, Dun Yang, @YiShi_333, Michael Taylor, and @xbpeng4. 🌟 Webpage: add-moo.github.io 🌟 Code: coming soon!

English

161

16.7K

Jerry Zhi-Yang He retweetledi

Jelani Nelson@minilek·3 Eki

I’ve also been integrating LLMs into my research workflow. I spent most of Tuesday working on a problem I’ve been thinking about for a while with some collaborators. I had a conjecture on a possible way forward, and with some hours of thinking, mixing in conversations with Gemini to guide certain non-trivial calculations, Gemini ultimately spit out a proof that no approach in this family can possibly work (which I found surprising, since similar approaches worked in related settings). Maybe will say more about what the problem is after it’s fully resolved, lest I lead to us getting scooped. :) tl;dr LLMs haven’t replaced me (yet?), but certainly are making me a more efficient researcher. *work still ongoing*

Sebastien Bubeck@SebastienBubeck

Well, this time it's by Terence Tao himself: @tao/115306424727150237" target="_blank" rel="nofollow noopener">mathstodon.xyz/@tao/115306424…

English

284

62.6K

Jerry Zhi-Yang He retweetledi

Julian Schrittwieser@Mononofu·28 Eyl

As a researcher at a frontier lab I’m often surprised by how unaware of current AI progress public discussions are. I wrote a post to summarize studies of recent progress, and what we should expect in the next 1-2 years: julian.ac/blog/2025/09/2…

English

223

812

5.9K

Jerry Zhi-Yang He retweetledi

Jessy Lin@realJessyLin·26 Eyl

What does it take to build a human-like user simulator? // To train collaborative agents, we need better user sims. In blog post pt 2, @NickATomlin and I sketch a framework for building user simulators + open questions for research: jessylin.com/2025/09/25/use…

English

12K

Jerry Zhi-Yang He@_herobotics_·14 Eyl

@michellearning @medra_ai Welcome back!

English

245

Michelle Lee@michellearning·13 Eyl

The exciting news I never announced in 2021: I’m going to join NYU Courant as assistant professor! But that never happened because I decided to start @medra_ai instead 😅 Also hello world, I’m back on X four years later 👋

Michelle Lee@michellearning

I passed my dissertation defense! Some other exciting news to be announced soon 🤩

English

197

46.8K

Jerry Zhi-Yang He@_herobotics_·12 Eyl

@YuXiang_IRVL @lfcasas7 @UT_Dallas This is super cool!

English

135

Yu Xiang@YuXiang_IRVL·10 Eyl

Big day in class today! With @lfcasas7, we brought 14 SO-101 arms for students to assemble and take home for projects @UT_Dallas. Excited to see what they create by semester’s end! 🤖

English

175

32.9K

Jerry Zhi-Yang He retweetledi

Thinking Machines@thinkymachines·10 Eyl

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to prompt engineering. Here we share what we are working on and connect with the research community frequently and openly. The name Connectionism is a throwback to an earlier era of AI; it was the name of the subfield in the 1980s that studied neural networks and their similarity to biological brains. thinkingmachines.ai/blog/defeating…

English

230

1.3K

7.6K

3.4M

Jerry Zhi-Yang He@_herobotics_·10 Eyl

@robot_trainer Any particular failure mode of classical motion generation methods that you have in mind? Those seem to be pretty good & robust at reactive obstacle avoidance, if engineered well

English

Nathan Ratliff@robot_trainer·9 Eyl

this is really cool. i've always thought learning-based methods were the right approach to global motion generation. nice work! (and all the demos! super robust and general system)

Jason Liu@JasonJZLiu

Ever wish a robot could just move to any goal in any environment—avoiding all collisions and reacting in real time? 🚀Excited to share our #CoRL2025 paper, Deep Reactive Policy (DRP), a learning-based motion planner that navigates complex scenes with moving obstacles—directly from point cloud input. w/ @Jiahui_Yang6709 (1/N)

English

2.8K

Jerry Zhi-Yang He retweetledi

Lin Yang@lyang36·27 Ağu

Our IMO gold medal-winning AI pipeline is now model-agnostic. 🥇 What worked for Gemini 2.5 Pro now gets the same 5/6 score with GPT-5 & Grok4. This confirms the power of our verification-and-refinement pipeline to improve base model capabilities. The new code & results are live on GitHub[github.com/lyang36/IMO25]! Paper update coming soon. Huge thanks to @xai for the Grok4 API credits! #AI #LLM #IMO #MathOlympiad #OpenSource”

English

1.1K

129.6K

Jerry Zhi-Yang He retweetledi

yshan@yshan783399·20 Ağu

We are thrilled to introduce the Seed-OSS family of open-source LLMs, developed by ByteDance's Seed Team. GitHub: github.com/ByteDance-Seed… HuggingFace: huggingface.co/collections/By… Feel free to try it out and share your feedback!

English

207

40.7K

Jerry Zhi-Yang He@_herobotics_·11 Ağu

@alexwei_ Will the solutions be posted on github similar to IMO results?

English

334

Alexander Wei@alexwei_·11 Ağu

1/ I competed for Team USA at IOI in 2015, so this achievement hits home for me. The biggest highlight: we *did not* train a model specifically for IOI. Our IMO gold model actually set a new state of the art in our internal competitive programming evals. Reasoning generalizes!

Sheryl Hsu@SherylHsu02

1/n I’m thrilled to share that our @OpenAI reasoning system scored high enough to achieve gold 🥇🥇 in one of the world’s top programming competitions - the 2025 International Olympiad in Informatics (IOI) - placing first among AI participants! 👨‍💻👨‍💻

English

899

128.6K

Jerry Zhi-Yang He retweetledi

Jason Weston@jaseweston·1 Ağu

🤖Introducing: CoT-Self-Instruct 🤖 📝: arxiv.org/abs/2507.23751 - Builds high-quality synthetic data via reasoning CoT + quality filtering - Gains on reasoning tasks: MATH500, AMC23, AIME24 & GPQA-💎 - Outperforms existing train data s1k & OpenMathReasoning - Gains on non-reasoning tasks as well: AlpacaEval & ArenaHard 🧵1/3