Max Lamparth

545 posts

Max Lamparth

@MLamparth

Research Fellow @SISLaboratory @HooverInst at @Stanford | Focusing on interpretable, safe, and ethical AI/LLM decision-making. Ph.D. from TUM.

Palo Alto, CA Katılım Kasım 2022

708 Takip Edilen729 Takipçiler

Sabitlenmiş Tweet

Max Lamparth@MLamparth·9 Mar

Check out our new paper on making language reward models more robust!

SISL@SISLaboratory

🚨 New SISL preprint: State-of-the-art language reward models are still badly biased. Past fixes overcorrect, some can be fixed with simple latent interventions, and some indicate the need for larger efforts.

English

468

Max Lamparth retweetledi

Jared Moore@jaredlcm·6d

Disturbing anecdotal reports of "AI psychosis" and negative psychological effects have been emerging in the news. But what actually happens during these lengthy delusional "spirals"? In our preprint, we analyze chat logs from 19 users who experienced severe psychological harm🧵👇

English

403

51.5K

Max Lamparth retweetledi

Diyi Yang@Diyi_Yang·11 Mar

🚨Postdoc opening: We are looking for a postdoc researcher with expertise in NLP, RL, and/or ML to develop AI-powered clinical support tools for mental health counseling in the Global South. Working with @EmmaBrunskill & @Diyi_Yang at Stanford. Apply by April 15, 2026 via tinyurl.com/ai4mentalhealt… 🧵👇

English

272

43.8K

Max Lamparth@MLamparth·2 Mar

Check out our new paper mitigating unfaithfulness in chain-of-thought reasoning:

SISL@SISLaboratory

Another SISL #ICLR 2026 paper: Current chain-of-thought models often sound persuasive without actually reflecting the reasoning behind the answer, so this work aims to make the reasoning itself carry the information the model needs to be right.

English

11.8K

Max Lamparth@MLamparth·23 Şub

Honored to have contributed to this large endeavor! Thank you Yue Huang @HowieH36226 for leading the team. Check out TrustGen at #ICLR2026

SISL@SISLaboratory

New #ICLR2026 paper with SISL contributions: A new dynamic benchmarking platform for evaluating the trustworthiness of generative AI models, called TrustGen!

English

396

Max Lamparth@MLamparth·17 Şub

Great collaboration with Declan Grabb, MD, Amy Franks, Scott Gershan, MD, DFAPA, Kaitlyn Kunstman, Aaron Lulla, Monika Roots, MD, FAPA, Manu Sharma MD, FAPA, Aryan Shrivastava, Nina Vasan, MD, MBA, Colleen Waickman

Suomi

Max Lamparth@MLamparth·17 Şub

MENTAT was created by nine practicing psychiatrists without LLM assistance with expert-annotated questions designed to expose fairness issues that only surface at scale. Check out the paper and code here: openreview.net/forum?id=tSy7O…

English

Max Lamparth@MLamparth·17 Şub

New #ICLR2026 paper: Our clinician-created dataset MENTAT for evaluating AI in mental healthcare Unlike exam-style benchmarks, MENTAT captures the real ambiguities psychiatrists face daily across 5 critical tasks: diagnosis, treatment, monitoring, triage & documentation.

GIF

English

4.6K

Max Lamparth@MLamparth·12 Şub

Shoking, but not surprising, if true: theshamblog.com/an-ai-agent-pu… If you want to learn more about why we need safe, ethical, and secure AI check out the public syllabus for CS120: web.stanford.edu/class/cs120/

English

Max Lamparth@MLamparth·12 Şub

An AI coding agent writes a hit piece after having its code contributions rejected... Why do we have to use AI systems for tasks where they just might perform somewhat well but without any safety guarantees or sufficient verification?

English

177

Max Lamparth retweetledi

Niloofar@niloofar_mire·20 Eki

I'm recruiting students for fall 2026 thru @LTIatCMU & @CMU_EPP, in: 1. Privacy & security of LLMs, coding, long horizon & embodied agents (robotics) 2. Tiny local llms 3. AI for scientific reasoning, esp. chemistry 4. Latent reasoning 5. anything YOU are passionate about!

English

180

109.8K

Max Lamparth@MLamparth·15 Eki

I’m deeply grateful for the opportunity to work at the intersection of AI safety, security, and broader impacts. I’d love to connect if you are interest in any of these topics or if our work overlaps! :)

English

Max Lamparth@MLamparth·15 Eki

I will also stay affiliated with the Stanford Center for AI Safety to continue teaching CS120 Introduction to AI Safety in Fall quarters at Stanford and we're excited to host a new course CS132 AI as Technology Accelerator in Spring through the TPA!

English

Max Lamparth@MLamparth·15 Eki

New job update! I’m excited to share that I’ve joined the @HooverInst and the Stanford Intelligent Systems Laboratory (@SISLaboratory) in the Stanford University School of Engineering as a Research Fellow, starting September 1st.

English

Max Lamparth retweetledi

SISL@SISLaboratory·8 Eki

1/x Many SISL contributions to this year’s Stanford Center for AI Safety annual meeting. Check them out here (a 🧵) - Automated Red-Teaming Language Models with Adaptive Stress Testing (ASTRA) [github.com/sisl/astra-rl + arxiv.org/abs/2407.09447]

English

242

Keşfet

@EmmaBrunskill @Diyi_Yang @HowieH36226 @LTIatCMU @CMU_EPP @HooverInst @SISLaboratory @elonmusk