steven

185 posts

steven banner
steven

steven

@stevenlu0

cs & ai policy @berkeley_ai, phd’ing soon at @SCSatCMU

Berkeley, CA Katılım Haziran 2025
131 Takip Edilen155 Takipçiler
Sabitlenmiş Tweet
steven
steven@stevenlu0·
great turnout at the citris tech policy launch last week, we had over 200 RSVPs!! really looking forward to seeing how this develops over the rest of the semester and next year, i have lots of plans :)
steven tweet mediasteven tweet mediasteven tweet mediasteven tweet media
English
1
0
7
2K
Abby O'Neill
Abby O'Neill@abby_k_oneill·
Would you trust an AI agent to negotiate on your country's behalf at the G20? Real coordination is long-horizon, asymmetric, and non-binding; current multi-agent evaluations miss this. We build Cooperate to Compete (C2C): a testbed for LM agents coordinating with rivals. 🤝🔪🎭
Abby O'Neill tweet media
English
4
24
91
24.2K
steven
steven@stevenlu0·
i'm co-organizing a workshop on AI governance! we'll have student presentations in the morning, then various presentations in the afternoon ft. CA State Sen. Jerry McNerney, Prof. Suresh Venkatasubramanian, speakers from DeepMind, CCST, Mila & more! register for free food 😋
steven tweet media
English
1
0
9
130
steven
steven@stevenlu0·
call me a nerd but pbs newshour is literally my favorite show on tv. very well deserved!
Lisa Desjardins@LisaDNews

Peabody! Incredible honor for @NewsHour and our team coverage of immigration. Could not be prouder of the work we - and mostly those below - have done. Among the congratulations to: @WmBrangham, @lbarronlopez, @ElizLanders, @TheStephSy, @IAmAmnaNawaz, @GeoffRBennett, @mattloff, Elizabeth Summers, @ecarpeaux, @KyleMidura, @shraipopat, @mikewfritz, Jonah Anderson, @DougAAdams, @newshourfred, @sarajust among many. pbs.org/newshour/press…

English
1
0
1
94
steven retweetledi
Serina Chang
Serina Chang@serinachang5·
🎉 Thrilled to have two papers accepted to ACL 2026 main! 1. Graph-based models match LLMs on close-ended human simulation tasks with far less compute & greater transparency 2. (oral) How to allocate human samples towards fine-tuning vs post-hoc rectification in simulation
Serina Chang tweet mediaSerina Chang tweet media
English
4
19
135
14K
steven
steven@stevenlu0·
me making my biggest decision of the year and it’s what to choose as my new email handle…
English
0
0
6
362
steven
steven@stevenlu0·
@chowtato i love the mediterranean, almost european vibes of these photos in san francisco!
English
1
0
1
653
cato 😾
cato 😾@chowtato·
Feel free to slave away at your 9-5 living in South Bay with the copium that a quicker commute is worth the sacrifice I’ll be spending my prime making the most out of the beautiful city of San Francisco
cato 😾 tweet mediacato 😾 tweet mediacato 😾 tweet mediacato 😾 tweet media
English
74
16
1.2K
126.6K
steven
steven@stevenlu0·
@chrisalbon wouldn’t this be because of the I-80 closures this weekend?
English
0
0
5
960
Chris Albon
Chris Albon@chrisalbon·
Waymo love of the 280 should be studied
Chris Albon tweet media
English
24
1
184
36.3K
steven retweetledi
Joseph Jeesung Suh
Joseph Jeesung Suh@JosephJSSuh·
🎉 Excited to share that GEMS is accepted to ACL 2026 main! We show that a lightweight GNN can match or outperform LLMs at simulating human behavior in discrete-choice settings — with multiple advantages, including efficiency and transparency. Paper: arxiv.org/abs/2511.02135
Joseph Jeesung Suh@JosephJSSuh

LLMs have dominated recent work on simulating human behaviors. But do you really need them? In discrete‑choice settings, our answer is: not necessarily. A lightweight graph neural network (GNN) can match or beat strong LLM-based methods. Paper: arxiv.org/abs/2511.02135 🧵👇

English
2
7
35
4.2K
Allison Chen
Allison Chen@allisonchen_227·
How should we talk about LLMs? Does it matter if we frame them as a machines 📠, tools ⚒️, or companions 👥? In our #CHI2026 paper, that these framings can alter what people believe about LLMs and how they use them. See 🧵for more!
Allison Chen tweet media
English
4
15
46
5K
steven
steven@stevenlu0·
finally made it official while waiting in the airport in boarding group 6! I’ll be starting a PhD at @SCSatCMU in the fall, excited for the journey to come 🥳🎉
steven tweet media
English
24
11
432
19.3K
Shanli Xing
Shanli Xing@shanli_xing·
Super excited to share that I'll be joining @CarnegieMellon as a PhD student, working with @tqchenml and @ericxing! It has been a wonderful journey at @uwcse @UWSyFi learning and building systems that power frontier AI in production. I want to express my sincerest gratitude to @ye_combinator @tqchenml @luisceze for all the opportunities and guidance along the way, and to many others at UW and CMU who have been hugely encouraging, supporting, and intellectually inspiring me. I wouldn't have made it this far without all of you. Looking ahead, I'm eager to explore how AI-system co-design can advance the capabilities of both sides. On the system side, I believe better abstractions and verification signal design can enable the AI-driven cycle for system improvements. On the model side, I'm interested in how to enable models to perform well in long-horizon, sparse-goal tasks that require periodic knowledge consolidation, like doing system research itself. Always happy to chat and collab! Keep building 🤟
English
10
6
115
7.8K
steven
steven@stevenlu0·
very excited to share that I was awarded a 2026 @NSF graduate research fellowship :))
steven tweet media
English
6
4
178
4.2K
Myra Cheng
Myra Cheng@chengmyra1·
In Barcelona for #chi2026! Presenting our work on eliciting LLMs' assumptions about users, and how this mismatches with user expectations, in the Tues poster session! (Spoiler: users assume that LLMs give objective info much more than they actually do --> sycophancy 😢)
English
2
4
39
4.7K
Manoel
Manoel@manoelribeiro·
I'm flying to Barcelona to attend @acm_chi. Let's hang out! :-)
English
1
0
12
675
Yixiong Hao
Yixiong Hao@Yixiong_Hao·
We're launching an international, cross-sector Delphi study to establish consensus on conducting and reporting AI evaluations. All critical infrastructure—from bridges and aircraft to pharmaceuticals—has agreed-upon, rigorous evaluation standards. AI systems will be at least as consequential, yet current practices are uneven, siloed, and hard to compare across organizations and contexts. We need voices from frontier labs, auditors, academia, policymakers, civil society, and industry practitioners to create a shared reference.
Yixiong Hao tweet media
English
2
7
21
1.3K