Ruochen Zhang

570 posts

Ruochen Zhang banner
Ruochen Zhang

Ruochen Zhang

@ruochenz_

PhDing @Brown_NLP & @health_nlp // working on LLM interp and capabilities 🔁 human mechanisms // Prev: @cohere, @sutdsg // she/they

Katılım Ekim 2014
1.6K Takip Edilen874 Takipçiler
Alham Fikri Aji
Alham Fikri Aji@AlhamFikri·
VLMs can easily get distracted by unrelated cultural cues. Happy to present our work on this soon at #CVPR2026🥳 Working on multilingual VLMs? Consider using our benchmark: 📜arxiv.org/pdf/2511.17004 🤗huggingface.co/datasets/patri… Amazing work by @patrickamadeus_ and colleagues!
Alham Fikri Aji tweet media
pat@patrickamadeus_

Excited to share that we have committed our paper “Vision-Language Models are Confused Tourists” to #CVPR2026 (Findings)! 🇺🇸🏔 Arxiv: arxiv.org/abs/2511.17004 We question whether current SOTA VLMs remain robust in simple cultural grounding QA when distracting contextual objects are present For example, if you eat chicken schnitzel with Mt. Fuji in the background, will the model fail to recognize it as Japanese katsu? ConfusedTourists introduces: 👉 5k+ evaluation samples across 3 cultural item categories, comprising 243 unique cultural items from 57 countries and 11 sub-regions 🌍 👉 Evaluation of 14 VLMs across 12 data features 🤖 👉 Findings showing that simple concept mixing can cause up to a -40% drop in perform 📉 Special thanks to my co-authors @IkhlasulHanif0 , @emthehunt, @gentaiscool, @FajriKoto, and my advisor @AlhamFikri for the valuable contributions along the way! #multimodal #vlm #multicultural #robustness #evaluation #NLProc #ComputerVision

English
2
18
72
7.3K
Ruochen Zhang retweetledi
Kanishka Misra 🌊
Kanishka Misra 🌊@kanishkamisra·
What is the interplay between representations learned from (language) surface forms alone, and those learned from more grounded evidence (e.g.,vision)? Excited to share new work understanding “Cross-modal taxonomic generalization” in (V)LMs 1/
Kanishka Misra 🌊 tweet media
English
1
13
50
4.6K
Ruochen Zhang retweetledi
Michael Lepori
Michael Lepori@Michael_Lepori·
I'm excited to share that this paper was accepted at ICLR 2026! We show that language models encode one of the most basic ingredients of a world model: the ability to distinguish plausible from implausible states. Check out the paper/thread for more details! See you in Rio!
Michael Lepori tweet media
Michael Lepori@Michael_Lepori

What does your favorite language model know about the real world? 🌎 Can it distinguish between possible and impossible events? We find that LM representations not only encode these distinctions, but that they predict human judgments of event plausibility!

English
3
11
92
9.3K
Ruochen Zhang retweetledi
Brown NLP
Brown NLP@Brown_NLP·
LUNAR Lab is looking for a postdoc to work on understanding and interpreting reasoning in LLMs and humans, broadly construed. The position is funded by Schmidt Sciences and is for at least 18 months, with the likely option to extend. Apply here! forms.gle/CrmrCzun79G9Ca…
English
0
7
36
4.8K
Liu Yang
Liu Yang@Yang_Liuu·
Thank you, Dimitris!! Proud to be your PhD student #5 :D I feel incredibly lucky to have you as my advisor. Thank you for everything!!
Dimitris Papailiopoulos@DimitrisPapail

Liu @Yang_Liuu defended her thesis today and absolutely crushed it. One of the best presentations I've seen at UW-Madison. Three proud advisors: @rdnowak, @Kangwook_Lee (zooming in from Korea 🇰🇷), and me. PhD student #5 from my group🥲

English
4
0
12
2.1K
Alham Fikri Aji
Alham Fikri Aji@AlhamFikri·
Honored to receive the Rising Star Award at @MBZUAI’s 5th anniversary last week! This recognition truly belongs to my incredible team of students, RAs, postdocs, and collaborators! Thank you MBZUAI for the recognition!
Alham Fikri Aji tweet media
English
5
6
65
3.2K
Ruochen Zhang retweetledi
Augmented Mind Podcast
Augmented Mind Podcast@augmind_fm·
AI used to be a distant promise; now it permeates our lives. AI is getting better, but is it making us better? We are promised that AI will augment our minds, but how? We--@EchoShao8899, @shannonzshen, and @michaelryan207--are excited to launch the Augmented Mind Podcast (The AM Podcast), a podcast about technical human-centered AI work. We'll share compelling research, infrastructure, and systems through monthly episodes, featuring interviews with the pioneering minds behind them. We release EP0 today to share who we are, why we started this podcast, and what we're looking forward to. 0:00 - Prelude: the problems we care about 1:48 - Host introduction 2:03 - Why we started the AM Podcast 2:31 - Hot takes on human-centered AI 10:45 - Format of our podcast 11:28 - Unique technical challenges in human-centered AI 16:45 - Let the journey begin!
English
10
32
79
60.9K
Yung-Sung Chuang
Yung-Sung Chuang@YungSungChuang·
🎓 Life updates: I defended my PhD last week at MIT. Now I am joining OpenAI to continue my research on building trustworthy LLMs! Thanks my advisor James Glass and my committee @yoonrkim, @jacobandreas and all my friends for continual supports throughout this journey!
Yung-Sung Chuang tweet mediaYung-Sung Chuang tweet mediaYung-Sung Chuang tweet media
English
50
46
1.1K
62K
Hang Jiang
Hang Jiang@hjian42·
A bit late to share, but excited to be starting a new chapter. I defended my PhD from @MIT in May 2025. In January 2026, I will be joining Northeastern University (@Northeastern) as a tenure-track Assistant Professor, with a joint appointment in the D'Amore-McKim School of Business (@NU_Business) and the Khoury College of Computer Sciences (@KhouryCollege). Go Huskies! My research focuses on AI agents, LLMs/NLP, human-AI interaction, and AI for society. I will be recruiting PhD students for Fall 2026 and RAs starting Spring 2026. Grateful to my mentors and collaborators, and excited to connect and collaborate. Please check out my new homepage: hjian42.github.io #Northeastern #Khoury #DAmoreMcKim #MIT #AI #NLP #HumanAI #AIAgents #PhDAdmissions
Hang Jiang tweet mediaHang Jiang tweet media
English
38
33
639
40.8K
Ruochen Zhang retweetledi
SEACrowd
SEACrowd@seacrowd_ai·
One last call for interested applicants! Applications for the SEACrowd Apprentice Program 2026 will end on December 17, 23:59 (UTC-12). See the post below for more details.
SEACrowd@seacrowd_ai

🌏 Applications are now open for the SEACrowd Apprentice Program 2026! Join a 3–4 month guided research journey where you’ll collaborate with mentors. 🗓️ Apply between Nov 17 – Dec 17, 2025 (UTC-12) 📅 Program runs Feb – Jun 2026 seacrowd.org/apprenticeship

English
0
3
11
1.4K
Ruochen Zhang retweetledi
Catherine Arnett
Catherine Arnett@linguist_cat·
Really nice article illustrating why linguistic equity in AI is a safety issue that impacts many areas of social and political life!
Catherine Arnett tweet media
English
2
5
18
1K
Ruochen Zhang retweetledi
Leonie Weissweiler
Leonie Weissweiler@LAWeissweiler·
🧑‍🔬I’m recruiting PhD students in Natural Language Processing @UniLeipzig Computer Science, together with @Sca_DS! Topics include, but aren’t limited to: 🔎Linguistic Interpretability 🌍Multilingual Evaluation 📖Computational Typology Please share! #NLProc #NLP
Leonie Weissweiler tweet media
English
4
55
209
12.8K
Ruochen Zhang retweetledi
David Bau
David Bau@davidbau·
At the #Neurips2025 mechanistic interpretability workshop I gave a brief talk about Venetian glassmaking, since I think we face a similar moment in AI research today. Here is a blog post summarizing the talk: davidbau.com/archives/2025/…
David Bau tweet media
English
24
98
551
106.6K
Greta Tuckute
Greta Tuckute@GretaTuckute·
@ruochenz_ Thank you very much! And awesome, thanks a lot for the pointers--the cross-lingual circuit overlap in English and Chinese is really neat, and we will add this reference to the updated paper version! :)
English
1
0
4
150
Greta Tuckute
Greta Tuckute@GretaTuckute·
How do LLMs process syntax? Do different syntactic phenomena recruit the same model units, or do they recruit distinct model components? And do different languages rely on similar units to process the same syntactic phenomenon? Check out our new preprint (to appear at ACL 2026)!
Greta Tuckute tweet media
English
3
13
68
7.8K
Michael Elabd
Michael Elabd@MichaelElabd·
@ruochenz_ consider moving to SF, thought-provoking convos everyday! The weather is not as nice though
English
1
0
1
101
Ruochen Zhang
Ruochen Zhang@ruochenz_·
#NeurIPS2025 in San Diego has spoiled me so much with thought-provoking convos, good vibes and amazing weather ☀️ Getting back to east coast and embracing 30F is gonna be real fun 🥶
English
3
1
52
4.8K