Wenhao Zhu

114 posts

Wenhao Zhu banner
Wenhao Zhu

Wenhao Zhu

@Wenhao_NLP

AI researcher@ByteDance Seed | prev. @EdinburghNLP | Multilingual LLM & machine translation

Edinburgh, Scotland Katılım Ekim 2019
794 Takip Edilen588 Takipçiler
Nathan Lambert
Nathan Lambert@natolambert·
In Beijing and Hangzhou this week — want to talk to more AI researchers! Reach out.
Nathan Lambert tweet media
English
39
12
512
29.4K
Junyang Lin
Junyang Lin@JustinLin610·
me stepping down. bye my beloved qwen.
English
1.7K
730
13.6K
6.6M
Lei Li
Lei Li@_TobiasLee·
已经把沉浸式翻译里面的默认模型设置成了 @XiaomiMiMo 的 V2-Flash 比之前一众模型快太多了,翻译质量也很在线
Lei Li tweet media
中文
3
0
11
1.3K
Xuandong Zhao
Xuandong Zhao@xuandongzhao·
Does anyone else feel the same? Gemini 3 Pro hasn’t been great for my daily use… feels worse than Gemini 2.5 Pro. Guess I had my expectations way too high...
English
1
1
12
3.3K
Wenhao Zhu retweetledi
Simran Khanuja
Simran Khanuja@simi_97k·
📢 Announcing the First Workshop on Multilingual and Multicultural Evaluation (MME) — co-located with #EACL2026 🇲🇦 📅 Mar 24–29, 2026 | Rabat, Morocco MME focuses on resources, metrics & methodologies for evaluating multilingual systems! …al-multicultural-evaluation.github.io 🗓️ Submit by Dec 19, 2025
Simran Khanuja tweet media
English
1
18
77
5.9K
Wenhao Zhu retweetledi
Tanishq Mathew Abraham, Ph.D.
Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization "We present DuPO, a dual learning-based preference optimization framework that generates annotation-free feedback via a generalized duality" "DuPO decomposes a primal task’s input into known and unknown components, then constructs its dual task to reconstruct the unknown part using the primal output and known information (e.g., reversing math solutions to recover hidden variables"
Tanishq Mathew Abraham, Ph.D. tweet media
English
3
33
135
10.8K
Wenhao Zhu
Wenhao Zhu@Wenhao_NLP·
@sarahookr @Cohere_Labs @cohere You’ve done an incredible job leading Cohere and empowering the multilingual community. Wishing you the best in your next adventure!
English
0
0
2
151
Sara Hooker
Sara Hooker@sarahookr·
It has been an incredible honor to spend the past few years leading @Cohere_Labs @cohere . This has been the adventure of a lifetime. However, after much deliberation, I made a tough decision 2 months ago it is time to say goodbye.
English
133
20
1.3K
138.1K
Wenhao Zhu retweetledi
Deedy
Deedy@deedydas·
Bytedance just dropped realtime voice translation 3x faster than before, with only a ~3s lag! Seed LiveInterp 2 is a full duplex speech-to-speech model with >70% correctness. When this makes it to video calls, it'll open up previously impossible connections.
English
38
139
1.1K
127.1K
Alessio Devoto
Alessio Devoto@devoto_alessio·
@PMinervini @simeng_ssun Good point! We support LongBench v1 and v2 in KVPress! As for MMLongBench, we are not planning to include it as we don't have KV Cache compression methods for VLMs (for now)
English
1
0
2
75
Alessio Devoto
Alessio Devoto@devoto_alessio·
🏆 Our @nvidia KV Cache Compression Leaderboard is now live! Compare state-of-the-art compression methods side-by-side with KVPress. See which techniques are leading in efficiency and performance. 🥇 huggingface.co/spaces/nvidia/…
Alessio Devoto tweet media
English
8
45
256
18.5K
Wenhao Zhu
Wenhao Zhu@Wenhao_NLP·
Could multi-turn interaction the next promising direction for scaling?
Multi-Turn Interaction LLM Workshop @ NeurIPS 2025@mti_neurips

🚀 Call for Papers — @NeurIPSConf 2025 Workshop Multi-Turn Interactions in LLMs 📅 December 6/7 · 📍 San Diego Convention Center Join us to shape the future of interactive AI. Topics include but are not limited to: 🧠 Multi-Turn RL for Agentic Tasks (e.g., web & GUI agents, tool use) 🤝 Human-AI Interaction over time 🛡️ Alignment across extended interactions 📏 Evaluation of long-horizon tasks 🧩 Social learning, Open-Endedness, trust, and more 🌟 Featuring an all-star speaker lineup: Dawn Song @dawnsongtweets (UC Berkeley) Jason Weston @jaseweston (Meta FAIR) Natasha Jaques @natashajaques (University of Washington & Google DeepMind) Tim Rocktäschel @_rockt (UCL & DeepMind, tentative) Diyi Yang @Diyi_Yang (Stanford) Peter Henderson @PeterHndrsn (Princeton) Yu Su @ysu_nlp (OSU) Hannah Rose Kirk @hannahrosekirk (Oxford) 📣 Updates: Follow us here & spread the word! #NeurIPS2025 #LLMs #AIAlignment #MultiAgent #ReinforcementLearning #LanguageAgents #InteractiveAI

English
0
1
7
773
Wenhao Zhu retweetledi
Zeyu Huang
Zeyu Huang@ZeroyuHuang·
🚀 Introducing Prefix-RFT to blend SFT and RFT! SFT can learn more complex problems by mimicking, but can have poor generalization. RFT has better overall performance but is limited by the initial policy. Our method, Prefix-RFT, makes the best of both worlds!
GIF
English
6
44
184
21.4K
Wenhao Zhu
Wenhao Zhu@Wenhao_NLP·
@Elaina43114880 @teortaxesTex 晚些时候可以进行在线测试,敬请期待! The service will be available soon. Stay tuned! Currently, we recommend deploying Seed-X on your own device with our released weights.
中文
1
0
1
50
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
ByteDance Seed released Seed-X, a Mistral-7B shaped LLM specialized for translation, apparently pretrained on ≈6.4B tokens, equaling the likes or R1 and 2.5-Pro in human evaluation. «We deliberately exclude STEM, coding, and reasoning-focused data» lol unexpected data paper
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) tweet media
English
7
46
306
29K