Yu-Cheng Chou

25 posts

Yu-Cheng Chou

@johnson111788

PhD student at @CCVLatJHU @JHU. Research Intern at @NVIDIA. Working on Embodied AI, MLLM, Video Gen.

CCVL@Johns Hopkins University Katılım Mayıs 2019

171 Takip Edilen111 Takipçiler

Yu-Cheng Chou retweetledi

Junfei Xiao@never1andd·25 Mar

Try our Reference-to-Video model! Grok it!🚀

Yukun@YknZhu

Truly proud of what a small team was able to achieve at @xai. Ad astra!

English

3.6K

Yu-Cheng Chou retweetledi

JHU Computer Science@JHUCompSci·25 Mar

Congratulations, @jieneng_chen!

English

528

Yu-Cheng Chou@johnson111788·30 Mar

@Aryakris_20 Done! Thanks!

English

Aryaman@Aryakris_20·30 Mar

@johnson111788 Hi, could you please enable your DMs!

English

Yu-Cheng Chou@johnson111788·27 Mar

@JackAMAustin You’re totally correct!Still long way to go

English

Jack AM Austin@JackAMAustin·27 Mar

@johnson111788 Flying through a consistent generated world is a genuinely wild thing to be possible right now

English

Yu-Cheng Chou@johnson111788·26 Mar

CVPR 2026🎥We built a model that lets you fly through a generated video world. Not just generating frames — but maintaining a consistent 3D world under complex camera motion. Code, ckpt, and even the data pipeline are all open-sourced ↓ #AI #worldmodel #videogen #cvpr #drone

English

229

26.3K

Yu-Cheng Chou@johnson111788·27 Mar

@mervenoyann Thank you so much! I’ll contact you if needed!

English

merve@mervenoyann·27 Mar

@johnson111788 Hello! I saw you're open sourcing your weights, if you want to build a demo we'd love to provide ZeroGPU H200 for it 🤗

English

583

Yu-Cheng Chou@johnson111788·27 Mar

@carbon787777 Thanks! Got really nice mentors🥹

English

puppy@carbon787777·27 Mar

@johnson111788 nice view, congrats on the CVPR acceptance.

English

125

Yu-Cheng Chou@johnson111788·27 Mar

@wildpinesai @Scobleizer Exactly!

English

131

WildPinesAI@wildpinesai·27 Mar

@johnson111788 @Scobleizer basically a game engine that dreams instead of renders. this is how you get embodied AI training environments without hand-building every world

English

191

Yu-Cheng Chou@johnson111788·27 Mar

@somebobcat8327 Glad you like it🤩

English

Bobcat@somebobcat8327·27 Mar

@johnson111788 Wow. Beautiful

English

143

Yu-Cheng Chou@johnson111788·27 Mar

@DaveSMT @Scobleizer Thanks!

English

106

Understanding@DaveSMT·27 Mar

@johnson111788 @Scobleizer That’s cool 😎

English

139

Yu-Cheng Chou@johnson111788·26 Mar

Deep gratitude to our advisors @never1andd, @YuilleAlan, and @cihangxie for their guidance and support throughout this work. Thank you! 🙏

English

483

Yu-Cheng Chou@johnson111788·26 Mar

🔗 Code: github.com/johnson111788/… 🔗 Project: johnson111788.github.io/open-safari/ If you’re working on video / world models — would love to connect.

English

863

Yu-Cheng Chou@johnson111788·26 Mar

Importantly, we open-source the entire curation pipeline: • trajectory reconstruction (SfM / hloc) • geometric verification • motion consistency checks • automatic repair & filtering → You can build your own OpenSafari.

English

527

Yu-Cheng Chou@johnson111788·26 Mar

We also built OpenSafari, a dataset designed to break existing models: • in-the-wild FPV drone videos • large-scale camera motion • strong parallax & elevation changes Every trajectory is geometrically verified.

English

492

Yu-Cheng Chou@johnson111788·26 Mar

The difference is most visible under aggressive motion: • sharp turns • large parallax • long trajectories Baselines collapse, ours stays consistent.

English

541

Yu-Cheng Chou@johnson111788·26 Mar

Our idea: build a world memory. At each frame, the model retrieves 3D-consistent information conditioned on the camera pose. → This enables stable generation under 6-DoF motion → Even for long trajectories in complex outdoor scenes

English

625

Yu-Cheng Chou@johnson111788·26 Mar

But in reality, most video generation models today: ❌ Only work on narrow domains (e.g., real estate scenes) ❌ Break as soon as the camera moves ❌ Fail to follow the camera trajectory The core issue? → No persistent world representation

English

704

Yu-Cheng Chou@johnson111788·28 Oca

@CVPR Can we put the title in the main text to save space?

English

1.6K

#CVPR2026@CVPR·28 Oca

Before you hit submit: Check if your paper title is included. It must be there to comply with the #CVPR2026 rebuttal template. 🔍

#CVPR2026@CVPR

As you write your #CVPR2026 rebuttal, please note the policies below. Good luck ✍️

English

32.9K

Yu-Cheng Chou retweetledi

Wufei Ma@wufeima·7 Eki

Join us at #ICCV2025 for the 1st Embodied Spatial Reasoning Workshop! We're thrilled to host amazing speakers from industry and academia, featuring Sifei Liu, @xiaolonw, @xf1280, and @kate_saenko_, to discuss frontiers of spatial reasoning, embodied agents, and robotics! 🔗 tinyurl.com/yn7b6mu6

English

10K

Yu-Cheng Chou@johnson111788·29 Eyl

Deep gratitude to our advisors @cihangxie and @never1andd for their guidance and support throughout this work. Thank you! 🙏

English

173

Yu-Cheng Chou@johnson111788·29 Eyl

Well-Formed, Ill-Grounded. We re-check GPT-OSS (InternVL3.5-aligned) vs Qwen on MMMU. Verdict: visual alignment ≫ formatting. Qwen wins with longer, grounded chains; OSS looks tidy but often doesn’t see the image. Blog + data ↓ #LLM #GPT #Qwen #InternVL irradiated-lace-40e.notion.site/Well-Formed-Il…

English

2.1K

Keşfet

@jieneng_chen @Aryakris_20 @JackAMAustin @mervenoyann @carbon787777 @wildpinesai @Scobleizer @somebobcat8327