Mitchell Gordon

112 posts

Mitchell Gordon

Mitchell Gordon

@mitchellgordon

Assistant prof @MIT_CSAIL, research @OpenAI. PhD in computer science @Stanford

Katılım Eylül 2007
440 Takip Edilen1.6K Takipçiler
Mitchell Gordon retweetledi
Andre Ye
Andre Ye@andreiskiii·
Sycophancy, disempowerment, homogenization of thought: lots to be grim about for what AI is doing to us, the collapse of our subjectivity into a machine "objectivity". But a lot of AI's value seems to come precisely from scaling this objectivity. How do we make sense of this?
Andre Ye tweet media
English
1
3
24
1.1K
Mitchell Gordon retweetledi
Andre Ye
Andre Ye@andreiskiii·
“Should I fear death?” Ask an LLM and you get one answer or a big bag, but little visibility into the decisions and assumptions that produced them. We built the "conceptual multiverse": a system that makes those decisions transparent and intervenable. multiverse.csail.mit.edu
Andre Ye tweet media
English
1
9
39
6.2K
Mitchell Gordon retweetledi
jenny huang
jenny huang@JennyHuang99·
recently, i’ve been thinking about ways to design ai systems to be more compatible with slow thinking 🐌. you can check out the full blogpost here 🤗: jennyhuang19.github.io/slow-ai-ai-tha…
jenny huang tweet media
English
4
21
167
11.6K
Mitchell Gordon
Mitchell Gordon@mitchellgordon·
MIT postdoc opportunity! We're hiring a human-AI interaction postdoc (HCI+ML/RL) to train agents that deepen how people think and collaborate - rewarded by how humans actually build skill together. With @arvindsatya1 @ZanaBucinca, me & more! Apply by May 1 tinyurl.com/4jsr8ee9
English
3
18
121
13K
Mitchell Gordon retweetledi
OpenAI
OpenAI@OpenAI·
Introducing the OpenAI Safety Fellowship, a new program supporting independent research on AI safety and alignment—and the next generation of talent. openai.com/index/introduc…
English
385
300
2.7K
946.1K
Mitchell Gordon retweetledi
Arvind Satyanarayan
Arvind Satyanarayan@arvindsatya1·
🚨MIT Postdoc Opportunity! We're looking for someone with an HCI+ML/RL background to work with us on agents that promote metacognition and sociality—trained with ethnographic rewards! w/@mitchellgordon,@zanabucinca & colleagues in sociology+anthropology tinyurl.com/4jsr8ee9
English
1
29
124
14K
Mitchell Gordon retweetledi
Lama Ahmad لمى احمد
Lama Ahmad لمى احمد@_lamaahmad·
I’m looking for someone who’s excited to be on the operational end of AI safety research problems. This role sits at the intersection of research and execution: working with academic researchers, 3p evaluators, and internal partners to help shape AI safety in practice.
English
20
17
250
61.9K
Mitchell Gordon retweetledi
Zoë Hitzig
Zoë Hitzig@zhitzig·
New on the OpenAI alignment blog! We prototype a method for eliciting the values that drive preferences over model responses, and release CoVal, an experimental dataset we built with it. Details in thread 👇
Zoë Hitzig tweet media
English
1
8
50
17.9K
Mitchell Gordon retweetledi
Yu Ying Chiu (Kelly Chiu)
Yu Ying Chiu (Kelly Chiu)@kellychiuyy·
New paper out with @Scale_AI! Introducing MoReBench - the first-ever benchmark to evaluate procedural moral reasoning in LLMs. MoReBench focuses on how LLMs reason, not just what they decide. We reveal surprising gaps in frontier models' moral reasoning that scaling laws & existing benchmarks miss entirely, and encourage more research around CoT monitoring and robust capability building. This collaboration spanned @UW @nyuniversity @harvard @stanford @mit @cais & more 🧠⚖️
Yu Ying Chiu (Kelly Chiu) tweet mediaYu Ying Chiu (Kelly Chiu) tweet mediaYu Ying Chiu (Kelly Chiu) tweet media
English
5
22
125
16.8K
Mitchell Gordon retweetledi
Hua Shen✨
Hua Shen✨@huashen218·
✨Tutorial Materials Now Available! We’re truly grateful for the hundreds (maybe thousands!) of wonderful attendees who joined our #NeurIPS Human–AI Alignment Tutorial 💗 -- Thank you all for your enthusiasm, thoughtful questions, and all the inspiring follow-up conversations 🤗! As many of you requested during #NeurIPS, we would love to share with you the full tutorial video and all slides below provided by our amazing speakers @mitchellgordon @adamfungi @Yoshua_Bengio: 📺 Tutorial Recording: neurips.cc/virtual/2025/l… 📕All Slides: hai-alignment-course.github.io/tutorial/ We’d also love to hear more of your questions and feedback — and hope these resources spark new ideas and collaborations in Human–AI Alignment research🔥!
Hua Shen✨ tweet media
Hua Shen✨@huashen218

🚀 Thrilled to announce our upcoming #NeurIPS2025 Tutorial on Human–AI Alignment: Foundations, Methods, Practice, and Challenges! 🗓️ Dec 2, 09:30–12:00 PST 📍 Exhibit Hall F, San Diego Convention Center 🔗 NeurIPS program: neurips.cc/virtual/2025/l… 👉 Tutorial Website: hai-alignment-course.github.io/tutorial/ With an incredible lineup of speakers — @mitchellgordon, @adamfungi, @Yoshua_Bengio — we’ll dive into: * Human-in-the-loop AI & Value Alignment * Collective Alignment * Sociotechnical Evaluation and Oversight * A Safety Argument for the Scientist AI 🌟 An exceptional interdisciplinary expert panel -- featuring insights from @dawnsongtweets, @eegilbert, @monojitchou, and @hannahrosekirk! 👫 Welcome to join us for an exciting and engaging session — let’s shape the future of Human–AI Alignment together! #NeurIPS2025 #HAIAlignment #ValueAlignment #CollectiveAlignment #AISafety #ResponsibleAI

English
4
14
78
11.8K
Mitchell Gordon retweetledi
Hua Shen✨
Hua Shen✨@huashen218·
🚀 Thrilled to announce our upcoming #NeurIPS2025 Tutorial on Human–AI Alignment: Foundations, Methods, Practice, and Challenges! 🗓️ Dec 2, 09:30–12:00 PST 📍 Exhibit Hall F, San Diego Convention Center 🔗 NeurIPS program: neurips.cc/virtual/2025/l… 👉 Tutorial Website: hai-alignment-course.github.io/tutorial/ With an incredible lineup of speakers — @mitchellgordon, @adamfungi, @Yoshua_Bengio — we’ll dive into: * Human-in-the-loop AI & Value Alignment * Collective Alignment * Sociotechnical Evaluation and Oversight * A Safety Argument for the Scientist AI 🌟 An exceptional interdisciplinary expert panel -- featuring insights from @dawnsongtweets, @eegilbert, @monojitchou, and @hannahrosekirk! 👫 Welcome to join us for an exciting and engaging session — let’s shape the future of Human–AI Alignment together! #NeurIPS2025 #HAIAlignment #ValueAlignment #CollectiveAlignment #AISafety #ResponsibleAI
Hua Shen✨ tweet media
Hua Shen✨@huashen218

Thrilled to share that our paper “Towards Bidirectional Human-AI Alignment” has been accepted to #NeurIPS2025 (Position Track)! 🎉 👫<>🤖We argue for an explicit reflection on what we mean by “alignment”, and to take into account the bidirectional, dynamic interactions between humans and AI to achieve truly responsible and safe AI systems. 🧠+ if you’re generally interested in “alignment”, don’t miss our #NeurIPS2025 Tutorial on “Human-AI Alignment: Foundations, Methods, Practice, and Challenges” , with amazing @mitchellgordon & @adamfungi — more details coming soon! - 💎 NeurIPS 2025 Position Paper: arxiv.org/pdf/2406.09264 - 📚 NeurIPS 2025 Tutorial: neurips.cc/virtual/2025/t… 💗 Huge thanks to our incredible co-authors — this was our 3rd resubmission — your persistent support and encouragement made it happen! Big thanks to everyone in our ICLR & CHI 2025 BiAlign workshops — your enthusiasm keeps us believing we’re doing something right for our community.🙏 ☕️👯‍♀️I’m attending #COLM2025 at Montreal this week, happy to chat more if you’re around! Also, we (w/ multiple co-authors) will present our #BiAlign paper in-person @SanDiego -- catch us at #NeurIPS2025, we’d love to hear your thoughts and join discussions!

English
2
12
108
40.1K
Mitchell Gordon retweetledi
Tyna Eloundou
Tyna Eloundou@ThankYourNiceAI·
No single person or institution should define ideal AI behavior for everyone.  Today, we’re sharing early results from collective alignment, a research effort where we asked the public about how models should behave by default.  Blog here: openai.com/index/collecti…
English
72
91
544
181.9K
Mitchell Gordon retweetledi
OpenAI
OpenAI@OpenAI·
We’ve spent the last few days doing a deep dive on what went wrong with last week’s GPT-4o update in ChatGPT. Expanding on what we missed with sycophancy and the changes we’re going to make in the future: openai.com/index/expandin…
English
483
528
4.7K
2.2M
Mitchell Gordon retweetledi
Jane E
Jane E@janee424·
a bit of a late announcement... I will be starting next year as an assistant professor at @NUSComputing (and am recruiting students!) and just started a postdoc with @landay @StanfordHAI :) thank you to all of my mentors and friends for the support throughout this journey 💜
English
20
7
83
6.2K