Mitchell Gordon

117 posts

Mitchell Gordon

@mitchellgordon

Assistant prof @MIT_CSAIL, research @OpenAI. PhD in computer science @Stanford

Katılım Eylül 2007

449 Takip Edilen1.7K Takipçiler

Mitchell Gordon@mitchellgordon·7 Tem

So excited to be back in Seoul for ICML! Would love to chat with people working on human-ai interaction and alignment, please reach out. And looking forward to giving a keynote at the pluralistic alignment workshop on the 11th.

English

2.9K

Mitchell Gordon retweetledi

Michelle Lam@michelle123lam·17 Haz

After over 10 years at Stanford, it's time to leave :) I will be joining UT Austin's CS department as an assistant professor in fall 2027! If you're excited to envision the future of interaction with AI, I'm recruiting PhD students this cycle. Come join me!

English

105

1.3K

99.8K

Mitchell Gordon retweetledi

Jasmine Wang@j_asminewang·17 Haz

OpenAI safety + alignment will be hosting a mixer at ICML & there are several teams attending who are actively hiring. If you're interested in attending our safety & alignment event/meeting teams at ICML, please fill out the below form!

English

347

34.2K

Mitchell Gordon retweetledi

MIT HCI@mithci·9 Haz

We're back! The MIT HCI group has grown, and we couldn't be more excited. A huge welcome to our newest faculty (@mitchellgordon, @huangcza, @ZanaBucinca & @jas_x_flowers) & students joining @arvindsatya1, @karger, Stefanie Mueller, Rob Miller & Daniel Jackson. Give us a follow!

English

17.6K

Mitchell Gordon retweetledi

Elinor@elinorpd_·6 Haz

excited to share that i'll be pursuing my phd in computer science at @MIT_CSAIL starting this fall 🥳🎓 i'm so grateful to be coadvised by the literal dream team: @jacobandreas, @bakkermichiel and @mitchellgordon 🙌

English

357

18.5K

Mitchell Gordon retweetledi

Andre Ye@andreiskiii·8 May

Sycophancy, disempowerment, homogenization of thought: lots to be grim about for what AI is doing to us, the collapse of our subjectivity into a machine "objectivity". But a lot of AI's value seems to come precisely from scaling this objectivity. How do we make sense of this?

English

1.3K

Mitchell Gordon retweetledi

Andre Ye@andreiskiii·8 May

“Should I fear death?” Ask an LLM and you get one answer or a big bag, but little visibility into the decisions and assumptions that produced them. We built the "conceptual multiverse": a system that makes those decisions transparent and intervenable. multiverse.csail.mit.edu

English

6.7K

Mitchell Gordon retweetledi

jenny huang@JennyHuang99·5 May

recently, i’ve been thinking about ways to design ai systems to be more compatible with slow thinking 🐌. you can check out the full blogpost here 🤗: jennyhuang19.github.io/slow-ai-ai-tha…

English

168

12.1K

Mitchell Gordon retweetledi

Lama Ahmad لمى احمد@_lamaahmad·3 May

Last day to apply to the OpenAI safety fellowship! It’s a chance to work with some of my favorite people on some of the most important, interesting, and consequential questions in AI

OpenAI@OpenAI

Introducing the OpenAI Safety Fellowship, a new program supporting independent research on AI safety and alignment—and the next generation of talent. openai.com/index/introduc…

English

15K

Mitchell Gordon@mitchellgordon·27 Mar

MIT postdoc opportunity! We're hiring a human-AI interaction postdoc (HCI+ML/RL) to train agents that deepen how people think and collaborate - rewarded by how humans actually build skill together. With @arvindsatya1 @ZanaBucinca, me & more! Apply by May 1 tinyurl.com/4jsr8ee9

English

122

13.4K

Mitchell Gordon retweetledi

OpenAI@OpenAI·6 Nis

Introducing the OpenAI Safety Fellowship, a new program supporting independent research on AI safety and alignment—and the next generation of talent. openai.com/index/introduc…

English

381

289

2.7K

951.3K

Mitchell Gordon retweetledi

Arvind Satyanarayan@arvindsatya1·27 Mar

🚨MIT Postdoc Opportunity! We're looking for someone with an HCI+ML/RL background to work with us on agents that promote metacognition and sociality—trained with ethnographic rewards! w/@mitchellgordon,@zanabucinca & colleagues in sociology+anthropology tinyurl.com/4jsr8ee9

English

127

15.7K

Mitchell Gordon retweetledi

Lama Ahmad لمى احمد@_lamaahmad·24 Şub

I’m looking for someone who’s excited to be on the operational end of AI safety research problems. This role sits at the intersection of research and execution: working with academic researchers, 3p evaluators, and internal partners to help shape AI safety in practice.

English

250

62K

Mitchell Gordon retweetledi

OpenAI Newsroom@OpenAINewsroom·19 Şub

We’re committing $7.5M to @AISecurityInst’s Alignment Project to fund independent research on mitigations for safety and security risks from misaligned AI. openai.com/index/advancin…

English

213

755

125K

Mitchell Gordon retweetledi

Michael Bernstein@msbernst·12 Şub

Simile on Bloomberg: bloomberg.com/news/videos/20…

Eesti

2.7K

Mitchell Gordon@mitchellgordon·17 Şub

Congrats to the Simile team! Some of the best people I know, working on one of the most interesting problems.

Simile@simile_ai

x.com/i/article/2021…

English

Mitchell Gordon retweetledi

Zoë Hitzig@zhitzig·15 Oca

New on the OpenAI alignment blog! We prototype a method for eliciting the values that drive preferences over model responses, and release CoVal, an experimental dataset we built with it. Details in thread 👇

English

18.2K

Mitchell Gordon retweetledi

Yu Ying Chiu (Kelly Chiu)@kellychiuyy·22 Ara

New paper out with @Scale_AI! Introducing MoReBench - the first-ever benchmark to evaluate procedural moral reasoning in LLMs. MoReBench focuses on how LLMs reason, not just what they decide. We reveal surprising gaps in frontier models' moral reasoning that scaling laws & existing benchmarks miss entirely, and encourage more research around CoT monitoring and robust capability building. This collaboration spanned @UW @nyuniversity @harvard @stanford @mit @cais & more 🧠⚖️

English

126

17K

Mitchell Gordon retweetledi

Hua Shen✨✈️ ICML @Seoul@huashen218·7 Ara

✨Tutorial Materials Now Available! We’re truly grateful for the hundreds (maybe thousands!) of wonderful attendees who joined our #NeurIPS Human–AI Alignment Tutorial 💗 -- Thank you all for your enthusiasm, thoughtful questions, and all the inspiring follow-up conversations 🤗! As many of you requested during #NeurIPS, we would love to share with you the full tutorial video and all slides below provided by our amazing speakers @mitchellgordon @adamfungi @Yoshua_Bengio: 📺 Tutorial Recording: neurips.cc/virtual/2025/l… 📕All Slides: hai-alignment-course.github.io/tutorial/ We’d also love to hear more of your questions and feedback — and hope these resources spark new ideas and collaborations in Human–AI Alignment research🔥!

Hua Shen✨✈️ ICML @Seoul@huashen218

🚀 Thrilled to announce our upcoming #NeurIPS2025 Tutorial on Human–AI Alignment: Foundations, Methods, Practice, and Challenges! 🗓️ Dec 2, 09:30–12:00 PST 📍 Exhibit Hall F, San Diego Convention Center 🔗 NeurIPS program: neurips.cc/virtual/2025/l… 👉 Tutorial Website: hai-alignment-course.github.io/tutorial/ With an incredible lineup of speakers — @mitchellgordon, @adamfungi, @Yoshua_Bengio — we’ll dive into: * Human-in-the-loop AI & Value Alignment * Collective Alignment * Sociotechnical Evaluation and Oversight * A Safety Argument for the Scientist AI 🌟 An exceptional interdisciplinary expert panel -- featuring insights from @dawnsongtweets, @eegilbert, @monojitchou, and @hannahrosekirk! 👫 Welcome to join us for an exciting and engaging session — let’s shape the future of Human–AI Alignment together! #NeurIPS2025 #HAIAlignment #ValueAlignment #CollectiveAlignment #AISafety #ResponsibleAI

English

12.5K

Keşfet

@huangcza @ZanaBucinca @jas_x_flowers @arvindsatya1 @karger @MIT_CSAIL @jacobandreas @bakkermichiel