Taiming Lu

18 posts

Taiming Lu

@TaimingLu

Ph.D student at @Princeton | Formerly @JohnsHopkins ’25, @HopkinsDSAI @JHUCompSci @jhuclsp @CCVLatJHU | AI/ML/NLP/CV

Princeton, NJ Katılım Haziran 2024

601 Takip Edilen358 Takipçiler

Taiming Lu@TaimingLu·3d

@muhan_gao Nice work!

English

103

Muhan Gao@muhan_gao·4d

🤖 We often talk about “context rot”: LLMs get worse as context grows. But once distracting information enters, is it just “a bit more noise → a bit worse performance”? Our #ICML2026 paper finds: no! 🤯 Instead, we reveal a striking "First Drop of Ink" effect: the first very few hard distractors do almost all of the damage, exactly like how one drop of ink clouding clear water. Paper link: arxiv.org/abs/2605.10828

English

10K

Taiming Lu@TaimingLu·4d

@kentonmurray Congrats!!! 🎉

English

149

Kenton Murray@kentonmurray·4d

I'm excited to announce that this Fall I will be joining the Computer Science Department at George Mason University as an Asst. Prof. I'll be expanding my lab and looking for PhD students to work on Multilingual AI problems text, video, and speech. cs.gmu.edu

English

210

16.6K

Taiming Lu@TaimingLu·4d

Teacher–student compatibility matters more than raw teacher strength. This changes how you pick a teacher: both for frontier training (where the best available teacher is often a prior generation) and for efficient small models, where "bigger teacher is better" isn't the right rule. Thanks @liuzhuang1234 for the support! arxiv: arxiv.org/abs/2605.23857 code: github.com/zlab-princeton…

English

961

Taiming Lu@TaimingLu·4d

Distillation improves generalization more readily than in-domain fit. Out-of-distribution perplexity and downstream accuracy improve more consistently than in-domain perplexity, where some configurations help OOD/downstream while doing nothing for in-domain.

English

Taiming Lu@TaimingLu·4d

Knowledge doesn't always flow downhill. We find that in LLM pretraining, a weaker teacher can improve a stronger student, and pushing the teacher further can actually hurt. New paper: Strong Teacher Not Needed? On Distillation in LLM Pretraining.

English

348

46.4K

Taiming Lu@TaimingLu·6 Nis

@DanielKhashabi Congrats Daniel!!! 🎉🎉🎉

Français

108

Daniel Khashabi 🕊️@DanielKhashabi·6 Nis

Very honored and excited to receive the NSF CAREER Award! HUGE thank you to my amazing students, collaborators, mentors, and advisors, who helped make this happen. And to my family who are the real heroes in my story! ♥️

English

198

11.1K

Taiming Lu retweetledi

Zhuang Liu@liuzhuang1234·12 Ara

Stronger Normalization-Free Transformers – new paper. We introduce Derf (Dynamic erf), a simple point-wise layer that lets norm-free Transformers not only work, but actually outperform their normalized counterparts.

English

175

1.1K

166.2K

Taiming Lu retweetledi

Jieneng Chen@jieneng_chen·22 Eki

🤯 Think better visuals mean better world models? Think again. 💥 Surprise: Agents don’t need eye candy— they need wins. Meet World-in-World, the first open benchmark that ranks world models by closed-loop task success, not pixels. We uncover 3 shocks: 1️⃣ Visuals ≠ utility 2️⃣ Action data > bigger models 3️⃣ Scaling test-time compute = more success 🤗 huggingface.co/papers/2510.18… 🌍 world-in-world.github.io 📄 arxiv.org/abs/2510.18135 github.com/World-In-World…

English

153

42.5K

Taiming Lu retweetledi

Zhuang Liu@liuzhuang1234·22 Eki

Excited to share our lab’s first open-source release: LLM-Distillation-JAX supports practical knowledge distillation configurations (distillation strength, temperature, top-k/top-p), built on MaxText designed for reproducible JAX/Flax training on both TPUs and GPUs

English

222

20.7K

Taiming Lu retweetledi

JHU Computer Science@JHUCompSci·19 Ara

Meet the AI system that can envision an entire world from a single picture. @genex_world—developed by @jieneng_chen, @YuilleAlan, @TaiMingLu, @DanielKhashabi, & @tianminshu—imagines in-depth scenarios to make informed decisions. Learn more here: hub.jhu.edu/2024/12/19/a-g…

GIF

English

1.8K

Taiming Lu retweetledi

Jieneng Chen@jieneng_chen·16 Ara

Thrilled to introduce GenEx: Generating an Explorable World. ✨ ✨ GenEx takes a single image 🖼️ and create a 3D generative world 🌍 — you can dive in for interactive exploration, and so as embodied AI agent. Follow our X for more demos: x.com/genex_world Paper on huggingface: huggingface.co/papers/2412.09… Tech details: genex.world (1/n)

English

103

10.1K

Taiming Lu retweetledi

GenEx@genex_world·13 Ara

Introducing GenEx: Turn any image into a 3D world adventure! 1️⃣ Create a fully explorable 360° world in 3D from just a single image! 2️⃣ Explore interactively or with GPT assistance. 3️⃣ Advance embodied AI with this imagined world! Check out our website: genex.world

English

10.5K

Taiming Lu retweetledi

Jieneng Chen@jieneng_chen·19 Kas

Introducing Genex: Generative World Explorer. 🧠 Humans mentally explore unseen parts of the world, revising their beliefs with imagined observations. ✨ Genex replicates this human-like ability, advancing embodied AI in planning with partial observations. (1/6)

English

164

37K

Taiming Lu retweetledi

Muhan Gao@muhan_gao·28 Haz

🤖LLMs know more long-context information than they show! 🔍Probing reveals higher accuracy than generation output. #LLMs know but don't tell.🤐 The earlier relevant information is learned within the layers, the higher the final output accuracy! 📈 (arxiv.org/abs/2406.14673)

English

2.2K

Keşfet

@muhan_gao @kentonmurray @liuzhuang1234 @DanielKhashabi @genex_world @jieneng_chen @YuilleAlan @tianminshu