Ruohao Guo

47 posts

Ruohao Guo

@GuoOctavia

CS PhD Student @ICatGT | Undergrad @UofIllinois

Atlanta, GA Katılım Ekim 2018

365 Takip Edilen134 Takipçiler

Sabitlenmiş Tweet

Ruohao Guo@GuoOctavia·21 Haz

Ever wondered if style lexicons still play a role in the era of LLMs? 🤔 We tested 13 established and 63 novel language styles across different LLMs. 🧠✨ It turns out lexicons are still crucial for style understanding! But how can we better leverage this lexical knowledge? Our approach: meta-tuning LLMs to leverage lexical knowledge for generalizable language style understanding. Check out our latest work at Main of #ACL2024NLP! 🚀 arxiv.org/abs/2305.14592 @mlatgt @ICatGT

English

6.3K

Ruohao Guo@GuoOctavia·14 Mar

@hbXNov @kaiwei_chang @adityagrover_ @VioletNPeng @AnthropicAI Congrats! 🎉

English

Hritik Bansal@hbXNov·14 Mar

Finally defended my Ph.D. thesis! 🥳 A very warm thank you to my family, friends, and advisors — @kaiwei_chang, @adityagrover_, @VioletNPeng, and Hongjing Lu. Next, I will be joining @AnthropicAI as a Member of Technical Staff. My defense slides ⬇️

English

291

23.7K

Ruohao Guo@GuoOctavia·18 Şub

@WeiLin__Chen @Google Very interesting idea! Nice work!

English

Wei-Lin Chen@WeiLin__Chen·17 Şub

🚀 New paper from my internship at @Google! LLMs can “think” for a long time only to get the answer wrong — more tokens do not always help and may be overthinking 😵‍💫 We introduce Deep-Thinking Ratio (DTR), a new way to measure LLM reasoning effort. The idea: Count the tokens models had to think deeply to produce. 🧵

English

624

45.5K

Ruohao Guo retweetledi

Cong Wei@CongWei1230·8 Oca

Thrilled to open-source UniVideo🎬! UniVideo brings unified multimodal understanding, generation, and editing to the video domain One framework for • video/image understanding • text/image → image/video generation • free-form image/video editing • reference-driven image/video generation/editing Code: github.com/KlingTeam/UniV… Model: huggingface.co/KlingTeam/UniV… Project Page: congwei1230.github.io/UniVideo Huggingface: huggingface.co/papers/2510.08…

English

443

67.8K

Ruohao Guo retweetledi

Yang Chen@ychenNLP·16 Ara

🥈 Silver Medal at IOI 2025 & Outperforms DeepSeek-R1-0528 on LiveCodeBench. Instead of mixing different tasks together, we scale *Cascade RL* to develop general LLMs in curriculum (RLFH -> Instruct -> Math -> Code -> SWE). So many learnings, check out our report!👇

Wei Ping@_weiping

🚀 Introducing Nemotron-Cascade! 🚀 We’re thrilled to release Nemotron-Cascade, a family of general-purpose reasoning models trained with cascaded, domain-wise reinforcement learning (Cascade RL), delivering best-in-class performance across a wide range of benchmarks. 💻 Coding powerhouse After RL, our 14B model: • Surpasses DeepSeek-R1-0528 (671B) on LiveCodeBench v5/v6/Pro. • Achieves silver-medal performance at IOI 2025 🥈. • Reaches a 43.1% pass @1 on SWE-Bench Verified, and 53.8% with test-time scaling. 🧠 What is Cascade RL? Instead of mixing heterogeneous prompts across domains, Cascade RL trains sequentially, domain by domain, which reduces engineering complexity, mitigates heterogeneous verification latencies, and enables domain-specific curricula and tailored hyperparameter tuning. ✨ Key insight Using RLHF for alignment as a pre-step dramatically boosts complex reasoning—far beyond preference optimization. Subsequent domain-wise RLVR stages rarely hurt the benchmark performance attained in earlier domains and may even improve it, as illustrated in the following figure. 🤗 Models & training data 🔥 👉 huggingface.co/collections/nv… 📄 Technical report with detailed training and data recipes 👉 arxiv.org/pdf/2512.13607

English

226

23.9K

Ruohao Guo retweetledi

Alan Ritter@alan_ritter·3 Ara

At #NeurIPS2025 through Sunday. Come say hi and check out our posters on: 🔒Probabilistic reasoning for text anonymity estimation: Wednesday @ 11am 🤖 Efficient, self-improving agents: Friday @ 11am

San Diego, CA 🇺🇸 English

1.1K

Ruohao Guo@GuoOctavia·2 Ara

@abeirami Would love to chat :)

English

Ahmad Beirami@abeirami·23 Kas

Will be at NeurIPS Thu Dec 4 to Sun Dec 7, excited to reconnect with old friends and make new ones. If you are excited about AI engineering (orchestration, evals, and optimizing scaffolds), we are hiring! On Saturday I’ll be on panels at the Reliable ML & UniReps workshops.

English

205

29.7K

Ruohao Guo retweetledi

Kai Zhang@KaiZhang_CS·17 Eki

Introducing early experience: using future states resulting from agent’s own action as scalable supervision to train itself - without reward🧠! 1️⃣Reward-free: can train directly in real-world environments. 2️⃣Better RL warm-start: when continued with RL, leads to higher final performance than imitation-only warm-ups. 3️⃣Data-efficient & scalable: outperforms imitation with even 1/8 data.👇

Jason Weston@jaseweston

🌀Agent Learning via Early Experience🌀 📝: arxiv.org/abs/2510.08558 - SFT for agents is sparse; RL on long-horizons is hard We provide new mid-training signals that work: 1) Implicit next state world modeling task 2) Self-reflection on alternate states - Strong improvements over 8 environments and multiple model families - Works well for subsequent RL! 🧵1/5

English

112

15.8K

Ruohao Guo retweetledi

Yao Dou@Yaooo01·15 Eki

Can LLM-simulated users replace expensive human evaluation for multi-turn conversations? Short answer: yes, if you model the user right. With our SimulatorArena, we find that detailed user profiles (knowledge + message style) improve alignment with real human evaluation by 26% at <3% the cost. #EMNLP2025 [1/6] 🧵

English

134

9.9K

Ruohao Guo retweetledi

Jungsoo Park@jungsoo___park·26 Eyl

What if LLMs can forecast their own scores on unseen benchmarks from just a task description? We are the first to study text description→performance prediction, giving practitioners an early read on outcomes so they can plan what to build—before paying full price 💸

English

9.8K

Ruohao Guo retweetledi

Yi Wu@jxwuyi·13 Ağu

🔍We introduce ASearcher, a search agent trained by end2end RL Large-scale (up to 128 turns) RL with AReaL unlocks Long-Horizon Agentic Search (+20.8/+46.7% on GAIA/xBench) 💻Data, Code&Model: github.com/inclusionAI/AS… 📄Paper: arxiv.org/abs/2508.07976v #Agent #OpenSource #LLM #AGI

English

289

27.5K

Ruohao Guo@GuoOctavia·30 May

@XingyuFu2 @PrincetonPLI @penn_nlp Congrats, Xingyu!

Català

143

Xingyu Fu@XingyuFu2·29 May

😌Been wanting to post since March but waited for the graduation photo….Thrilled to finally share that I’ll be joining Princeton University as a postdoc @PrincetonPLI this August! Endless thanks to my incredible advisors and mentors from Penn, UW, Cornell, NYU, UCSB, USC, Columbia, MSFT, and beyond—you’ve shaped my thinking and helped me grow so much. If you’re into multimodal research, especially VLMs / fusion models / image video gen, feel free to DM me, would love to connect!

English

456

39.1K

Ruohao Guo@GuoOctavia·20 May

@jmin__cho @unccs @JHUCompSci Congrats Jaemin!

English

169

Jaemin Cho@jmin__cho·20 May

Sharing some personal updates 🥳: - I've completed my PhD at @unccs! 🎓 - Starting Fall 2026, I'll be joining the Computer Science dept. at Johns Hopkins University (@JHUCompSci) as an Assistant Professor 💙 - Currently exploring options + finalizing the plan for my gap year (Aug 2025 - Jul 2026), so feel free to reach out! 🔎 Endless thanks to my amazing advisor @mohitban47, the @uncnlp group, my partner @HeesooJang2, and my family. I couldn’t have done this without your constant support 🙏 Also, a heartfelt shoutout to all the collaborators I’ve worked with over the years—your ideas, encouragement, and hustle have meant the world. Excited for what’s ahead. Let’s keep building together! ❤️

English

449

90.2K

Ruohao Guo retweetledi

Wei Xu@cocoweixu·4 May

I am giving a keynote at PrivateNLP Workshop (sites.google.com/view/privatenl…) at #NAACL2025 (Sunday 9am CT). * GPT4-v is a performant geolocator, predicting exact GPS coordinates of image > any SOTA * LLMs can estimate privacy risk based on probabilistic reasoning > chain-of-thoughts

English

6.3K

Ruohao Guo retweetledi

alphaXiv@askalphaxiv·8 Nis

Introducing Deep Research for arXiv Ask questions like 'What are the latest breakthroughs in RL fine-tuning?' and get comprehensive lit reviews with trending papers automatically included Turn hours of literature searches into seconds with AI-powered research context ⚡

English

547

372.3K

Ruohao Guo@GuoOctavia·7 Nis

@yugu_nlp Congrats!

English

169

Ruohao Guo retweetledi

Hamish Ivison@hamishivi·4 Mar

How well do data-selection methods work for instruction-tuning at scale? Turns out, when you look at large, varied data pools, lots of recent methods lag behind simple baselines, and a simple embedding-based method (RDS) does best! More below ⬇️ (1/8)

English

325

86.2K

Ruohao Guo retweetledi

Ethan Mendes@EthanMendes3·5 Mar

🚨New Paper: Better search for reasoning (e.g., web tasks) usually requires costly💰demos/rewards What if we only self-improve LLMs on state transitions—capturing a classic RL method in natural language? Spoiler: It works (⬆️39% over base model) & enables efficient search!🚀 🧵

English

4.7K

Ruohao Guo retweetledi

Tarek Naous@tareknaous·9 Oca

What causes entity-related cultural biases in LMs? Is it just pre-training data? Our latest paper shows how varying linguistic phenomena exhibited by entities (such as word sense in Arabic) impact the cross-cultural performance of LMs. arxiv.org/abs/2501.04662

English

4.1K

Ruohao Guo@GuoOctavia·13 Tem

@ychenNLP @alan_ritter @cocoweixu @mchang21 @kartik_goyal_ @Hexiang_Hu @nvidia Congrats, Dr. Chen!

English

150

Yang Chen@ychenNLP·13 Tem

I've successfully defended my PhD! 🎓Really appreciate my advisor @alan_ritter @cocoweixu for everything throughout this journey🥺. Huge thanks to my amazing committee @mchang21 @kartik_goyal_ @Hexiang_Hu 🚙I'll move to CA and join @NVIDIA as a research scientist next month.

Alan Ritter@alan_ritter

Congratulations to @ychenNLP for successfully defending his PhD! Yang has done exciting work advancing both the multilingual and multimodal capabilities of LLMs. Many thanks to his committee: @cocoweixu (co-advisor), @mchang21, @Hexiang_Hu, @kartik_goyal_

English

141

19K

Ruohao Guo@GuoOctavia·21 Haz

Huge thanks to my brilliant collaborators @cocoweixu and @alan_ritter! The code and data will be released soon: github.com/octaviaguo/Sty…

English

179

Ruohao Guo@GuoOctavia·21 Haz

Our experiments show that class randomization significantly boosts lexicon-based meta-tuning in LLMs, enabling effective leverage of lexical knowledge for zero-shot inference.

English

247

Ruohao Guo@GuoOctavia·21 Haz

English

6.3K

Keşfet

@hbXNov @kaiwei_chang @adityagrover_ @VioletNPeng @AnthropicAI @WeiLin__Chen @Google @abeirami