Jieneng Chen (@jieneng_chen) - Twitter Profili | Zamantika Mersobahis Locabet

Jieneng Chen retweetledi

Huge congratulations to @jieneng_chen and collaborators on being selected for an oral presentation (top 6% of submissions!) -- very well deserved! This is exciting work, and I hope it gets the attention it truly deserves.

Jieneng Chen@jieneng_chen

🤯 Think better visuals mean better world models? Think again. 💥 Surprise: Agents don’t need eye candy— they need wins. Meet World-in-World, the first open benchmark that ranks world models by closed-loop task success, not pixels. We uncover 3 shocks: 1️⃣ Visuals ≠ utility 2️⃣ Action data > bigger models 3️⃣ Scaling test-time compute = more success 🤗 huggingface.co/papers/2510.18… 🌍 world-in-world.github.io 📄 arxiv.org/abs/2510.18135 github.com/World-In-World…

English

2

9

31

3.6K

Jieneng Chen@jieneng_chen·6 Ara

@vishalm_patel @jhuclsp @HopkinsEngineer Congrats!!

English

0

1

89

Vishal Patel@vishalm_patel·5 Ara

Honored to be named an IEEE Fellow for contributions to image processing, computer vision & biometrics. Also grateful to be an AAAI Senior Member and a 2025 Clarivate Highly Cited Researcher. Huge thanks to my students, mentors & collaborators! @jhuclsp @HopkinsEngineer

English

28

12

64

8.3K

Jieneng Chen@jieneng_chen·26 Kas

@shengyangzhuang great question. rn we’re focusing on monocular video — it’s more challenging but also much more common in the real world. That said, extending to multi-view setups is definitely feasible.

English

0

79

Shengyang Zhuang@shengyangzhuang·26 Kas

@jieneng_chen Looks cool! Does it work for multi view videos for an animal to reconstruct its 4D mesh?

English

1

0

153

Jieneng Chen@jieneng_chen·25 Kas

A meaningful step forward for pet science. 🐾 Pets are our companions, yet modeling them in 3D is extremely hard due to limited data compared to humans. Thrilled to share that our 4D-Animal project is now online! We reconstruct animatable 3D animals from videos without requiring sparse keypoint annotations. paper: arxiv.org/pdf/2507.10437 code: github.com/zhongshsh/4D-A… Led by Shanshan Zhong (now a PhD student at CMU LTI) — a great journey exploring 4D vision together.

English

2

13

119

8.4K

Jieneng Chen@jieneng_chen·26 Kas

@mangahomanga Thanks, Homanga! Really appreciate it.

English

0

155

Homanga Bharadhwaj@mangahomanga·26 Kas

@jieneng_chen Cool stuff, congrats!!

English

1

0

1

363

Jieneng Chen@jieneng_chen·25 Eki

Also, when it comes to world models for embodied agents, visual realism doesn’t necessarily translate to functional intelligence. You can explore our recent closed-loop evaluation here: world-in-world.github.io

C Zhang@ChongZitaZhang

On world model / egocentric visual dynamics model, also on building robotic simulation, also on building robotic genAI models: Being visually realistic doesn't mean being physically accurate and semantically correct.

English

1

2

32

16.3K

Jieneng Chen retweetledi

Daniel Khashabi 🕊️@DanielKhashabi·24 Eki

The field has long been obsessed with judging World Models by their visuals—but that misses the point. In @jieneng_chen’s 𝐖𝐨𝐫𝐥𝐝-𝐢𝐧-𝐖𝐨𝐫𝐥𝐝, we propose the first closed-loop benchmark that compares WMs by 𝐞𝐦𝐛𝐨𝐝𝐢𝐞𝐝 𝐬𝐮𝐜𝐜𝐞𝐬𝐬, rather than 𝐯𝐢𝐬𝐮𝐚𝐥 𝐚𝐩𝐩𝐞𝐚𝐥. This is a major step toward 𝑓𝑢𝑛𝑐𝑡𝑖𝑜𝑛𝑎𝑙 evaluation of world models.

Jieneng Chen@jieneng_chen

🤯 Think better visuals mean better world models? Think again. 💥 Surprise: Agents don’t need eye candy— they need wins. Meet World-in-World, the first open benchmark that ranks world models by closed-loop task success, not pixels. We uncover 3 shocks: 1️⃣ Visuals ≠ utility 2️⃣ Action data > bigger models 3️⃣ Scaling test-time compute = more success 🤗 huggingface.co/papers/2510.18… 🌍 world-in-world.github.io 📄 arxiv.org/abs/2510.18135 github.com/World-In-World…

English

0

5

15

3K

Jieneng Chen retweetledi

Yilun Du@du_yilun·22 Eki

Are SOTA video models good world models for embodied agents? We present a benchmark evaluating this. We find that: - Visuals ≠ utility - Action data > bigger models - Test-time planning improves accuracy - Lots of room for improvement for all video models

Jieneng Chen@jieneng_chen

🤯 Think better visuals mean better world models? Think again. 💥 Surprise: Agents don’t need eye candy— they need wins. Meet World-in-World, the first open benchmark that ranks world models by closed-loop task success, not pixels. We uncover 3 shocks: 1️⃣ Visuals ≠ utility 2️⃣ Action data > bigger models 3️⃣ Scaling test-time compute = more success 🤗 huggingface.co/papers/2510.18… 🌍 world-in-world.github.io 📄 arxiv.org/abs/2510.18135 github.com/World-In-World…

English

0

23

200

25.4K

Jieneng Chen@jieneng_chen·22 Eki

🤯 Think better visuals mean better world models? Think again. 💥 Surprise: Agents don’t need eye candy— they need wins. Meet World-in-World, the first open benchmark that ranks world models by closed-loop task success, not pixels. We uncover 3 shocks: 1️⃣ Visuals ≠ utility 2️⃣ Action data > bigger models 3️⃣ Scaling test-time compute = more success 🤗 huggingface.co/papers/2510.18… 🌍 world-in-world.github.io 📄 arxiv.org/abs/2510.18135 github.com/World-In-World…

English

2

40

154

42.1K

Jieneng Chen@jieneng_chen·20 Eki

Dr. Kate Saenko @kate_saenko_ from @Meta will give a keynote talk titled with “Blind Spots in Multimodal AI Models.”

English

0

162

Jieneng Chen@jieneng_chen·20 Eki

Dr. Fei Xia @xf1280 from @GoogleDeepMind will give a keynote titled with“Gemini Robotics 1.5: Generalist Robots with Advanced Embodied Reasoning, Thinking and Motion Transfer.”

English

1

0

223

Jieneng Chen@jieneng_chen·20 Eki

🌺 Learn the next frontier of intelligence? Join us for the @ICCVConference workshop Embodied Spatial Reasoning (esr-2025.github.io) 🗓️ Oct 20 | 9:00–12:00 📍 Hawaii Convention Center, BallRoom A Featuring leading voices: @Sifei30488L @xiaolonw @xf1280 @kate_saenko_

English

1

8

36

4.5K

Jieneng Chen@jieneng_chen·23 May

@steph_milani @JHUCompSci @NYU_Courant Congrats! I really enjoyed your talk in Malone.

English

1

0

1

145

Stephanie Milani@steph_milani·21 May

Another life update!! 🎉 I’m joining @JHUCompSci as an Assistant Professor starting Fall 2026! Apply to work with me on reinforcement learning, foundation models, & human-centered AI. Let’s build better AI agents 🤖🙆‍♀️🦀 Before that, I’ll join @NYU_Courant as an Assistant Professor/Faculty Fellow. Excited to spend a year in NYC!