Wenhao Zhu

114 posts

Wenhao Zhu

@Wenhao_NLP

AI researcher@ByteDance Seed | prev. @EdinburghNLP | Multilingual LLM & machine translation

Edinburgh, Scotland Katılım Ekim 2019

794 Takip Edilen588 Takipçiler

Wenhao Zhu@Wenhao_NLP·26 Nis

@natolambert any plan to shanghai?

हिन्दी

106

Nathan Lambert@natolambert·26 Nis

In Beijing and Hangzhou this week — want to talk to more AI researchers! Reach out.

English

512

29.4K

Wenhao Zhu@Wenhao_NLP·3 Mar

@JustinLin610 Best wishes, mate

English

388

Junyang Lin@JustinLin610·3 Mar

me stepping down. bye my beloved qwen.

English

1.7K

730

13.6K

6.6M

Wenhao Zhu@Wenhao_NLP·6 Oca

@_TobiasLee @XiaomiMiMo 这是插件嘛？

中文

131

Lei Li@_TobiasLee·5 Oca

已经把沉浸式翻译里面的默认模型设置成了 @XiaomiMiMo 的 V2-Flash 比之前一众模型快太多了，翻译质量也很在线

中文

1.3K

Wenhao Zhu@Wenhao_NLP·2 Oca

@SonglinYang4 @MITEECS @thinkymachines Congrats!

English

113

Songlin Yang@SonglinYang4·1 Oca

Life update at the end of 2025: I’ve completed my PhD at @MITEECS and joined @thinkymachines to work on LLM archs

English

1.7K

86.2K

Wenhao Zhu@Wenhao_NLP·30 Ara

@zhang_benita 2025年最强播客👍

中文

225

张小珺 Xiaojun Zhang@zhang_benita·30 Ara

Manus决定出售前最后的访谈。 xiaoyuzhoufm.com/episode/695331…

中文

203

40.6K

Wenhao Zhu@Wenhao_NLP·10 Ara

Calling for papers! 📢 Join us at the Multilingual Multicultural Evaluation (MME) workshop co-located at EACL.

Vilém Zouhar @ EACL@zouharvi

Do you have work on resources, metrics & methodologies for evaluating multilingual systems? Share it at the MME workshop🕵️co-located at EACL. Direct submission deadline in 10 days! (December 19th)! …al-multicultural-evaluation.github.io

English

453

Wenhao Zhu@Wenhao_NLP·25 Kas

@xuandongzhao hey, what kind of task did you try?

English

Xuandong Zhao@xuandongzhao·24 Kas

Does anyone else feel the same? Gemini 3 Pro hasn’t been great for my daily use… feels worse than Gemini 2.5 Pro. Guess I had my expectations way too high...

English

3.3K

Wenhao Zhu@Wenhao_NLP·12 Kas

@_TobiasLee saw you, mate

English

Lei Li@_TobiasLee·12 Kas

🥳🥳

Fuli Luo@_LuoFuli

Intelligence will inevitably evolve from language to the physical world, unlocking spatial intelligence for multi-modal perception, reasoning, generation, and action—essential for true AGI. I'm working on building this at @XiaomiMiMo, spearheading a creative and talented team!

ART

5.6K

Wenhao Zhu retweetledi

Simran Khanuja@simi_97k·18 Eki

📢 Announcing the First Workshop on Multilingual and Multicultural Evaluation (MME) — co-located with #EACL2026 🇲🇦 📅 Mar 24–29, 2026 | Rabat, Morocco MME focuses on resources, metrics & methodologies for evaluating multilingual systems! …al-multicultural-evaluation.github.io 🗓️ Submit by Dec 19, 2025

English

5.9K

Wenhao Zhu@Wenhao_NLP·22 Ağu

Exciting to see the great potential of DuPO in scenarios where supervision label is scarce.

Shuaijie She ✈️ ICLR26@kevinprossj

🔥 Thrilled to introduce DuPO (Dual Learning-based Preference Optimization) - DuPO enables LLMs to get reliable and scalable self-supervision through duality-derived rewards. - General application in various tasks, eg, math reasoning and multingual translation. - Strong performance on various backbones, excelling both as a reward for training and as a reranker for inference. 🤗 Paper: huggingface.co/papers/2508.14… 📝 Blog: shesj-note.notion.site/dupo

English

599

Wenhao Zhu retweetledi

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·21 Ağu

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization "We present DuPO, a dual learning-based preference optimization framework that generates annotation-free feedback via a generalized duality" "DuPO decomposes a primal task’s input into known and unknown components, then constructs its dual task to reconstruct the unknown part using the primal output and known information (e.g., reversing math solutions to recover hidden variables"

Tanishq Mathew Abraham, Ph.D. tweet media

English

135

10.8K

Wenhao Zhu@Wenhao_NLP·12 Ağu

@sarahookr @Cohere_Labs @cohere You’ve done an incredible job leading Cohere and empowering the multilingual community. Wishing you the best in your next adventure!

English

151

Sara Hooker@sarahookr·11 Ağu

It has been an incredible honor to spend the past few years leading @Cohere_Labs @cohere . This has been the adventure of a lifetime. However, after much deliberation, I made a tough decision 2 months ago it is time to say goodbye.

English

133

1.3K

138.1K

Wenhao Zhu retweetledi

Deedy@deedydas·11 Ağu

Bytedance just dropped realtime voice translation 3x faster than before, with only a ~3s lag! Seed LiveInterp 2 is a full duplex speech-to-speech model with >70% correctness. When this makes it to video calls, it'll open up previously impossible connections.

English

139

1.1K

127.1K

Wenhao Zhu@Wenhao_NLP·26 Tem

@Leo_Xu98 lol🤣

Fangzhi Xu@Leo_Xu98·25 Tem

@Wenhao_NLP yes, your handsome face surprises me😎🥸

English

Wenhao Zhu@Wenhao_NLP·25 Tem

The video in the link will surprise you. Trust me!

Shanbo Cheng@cshanbo

Not a social media/ X person, but still glad to announce Seed LiveInterpret 2.0. In short, it is an end-to-end, full duplex speech-to-speech simultaneous interpretation model. Achieves high-quality, ultra-low latency S2S translation. Website: seed.bytedance.com/en/seed_livein…

English

606

Wenhao Zhu@Wenhao_NLP·22 Tem

@devoto_alessio @PMinervini @simeng_ssun I'd like to recommend our LongBioBench as well! It supports infinite-length evaluation and enables controllable examination. arxiv.org/pdf/2506.02921

English

Alessio Devoto@devoto_alessio·22 Tem

@PMinervini @simeng_ssun Good point! We support LongBench v1 and v2 in KVPress! As for MMLongBench, we are not planning to include it as we don't have KV Cache compression methods for VLMs (for now)

English

Alessio Devoto@devoto_alessio·21 Tem

🏆 Our @nvidia KV Cache Compression Leaderboard is now live! Compare state-of-the-art compression methods side-by-side with KVPress. See which techniques are leading in efficiency and performance. 🥇 huggingface.co/spaces/nvidia/…

English

256

18.5K

Wenhao Zhu@Wenhao_NLP·22 Tem

Could multi-turn interaction the next promising direction for scaling?

Multi-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents, tool use) 🤝 Human-AI Interaction over time 🛡️ Alignment across extended interactions 📏 Evaluation of long-horizon tasks 🧩 Social learning, Open-Endedness, trust, and more 🌟 Featuring an all-star speaker lineup: Dawn Song @dawnsongtweets (UC Berkeley) Jason Weston @jaseweston (Meta FAIR) Natasha Jaques @natashajaques (University of Washington & Google DeepMind) Tim Rocktäschel @_rockt (UCL & DeepMind, tentative) Diyi Yang @Diyi_Yang (Stanford) Peter Henderson @PeterHndrsn (Princeton) Yu Su @ysu_nlp (OSU) Hannah Rose Kirk @hannahrosekirk (Oxford) 📣 Updates: Follow us here & spread the word! #NeurIPS2025 #LLMs #AIAlignment #MultiAgent #ReinforcementLearning #LanguageAgents #InteractiveAI

English

773

Wenhao Zhu retweetledi

Zeyu Huang@ZeroyuHuang·18 Tem

🚀 Introducing Prefix-RFT to blend SFT and RFT! SFT can learn more complex problems by mimicking, but can have poor generalization. RFT has better overall performance but is limited by the initial policy. Our method, Prefix-RFT, makes the best of both worlds!

GIF

English

184

21.4K

Wenhao Zhu@Wenhao_NLP·18 Tem

@teortaxesTex Stay tuned!

English

929

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex·18 Tem

Bytedance Seed is insane they have so much stuff to publish it's going out of order

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) tweet media

English

109

6.7K

Wenhao Zhu@Wenhao_NLP·18 Tem

@Elaina43114880 @teortaxesTex 晚些时候可以进行在线测试，敬请期待！ The service will be available soon. Stay tuned! Currently, we recommend deploying Seed-X on your own device with our released weights.

中文

Elaina@Elaina43114880·18 Tem

@Wenhao_NLP @teortaxesTex hi，可以搞个在线demo测试下吗？

中文

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex·18 Tem

ByteDance Seed released Seed-X, a Mistral-7B shaped LLM specialized for translation, apparently pretrained on ≈6.4B tokens, equaling the likes or R1 and 2.5-Pro in human evaluation. «We deliberately exclude STEM, coding, and reasoning-focused data» lol unexpected data paper

English

306

29K

Keşfet

@natolambert @JustinLin610 @_TobiasLee @XiaomiMiMo @SonglinYang4 @MITEECS @thinkymachines @zhang_benita