Stair AI
48 posts

Stair AI
@Stair_AI
The infrastructure that makes machine intelligence transparent, accountable, and bankable in the age of autonomous finance.

Excited to give this talk at the Stanford Digital Economy Lab on May 18! I will do three things: discuss my group's recent research, identify the most pressing gaps in the community's current understanding, and provide a long-term perspective. Hope to see you there in person or virtually. digitaleconomy.stanford.edu/event/arvind-n… @DigEconLab







Failed Agent experiments can be publishable too🤯 Introducing ICML 2026 Workshop Failure Modes in Agentic AI! We welcome negative results, failed rollouts, debugging traces, reproducible failure cases, and analysis of why agents break. 📍FAGEN @ ICML 2026 🗓 Submission deadline: May 8 11:59 PM AOE 🗓 Notification: May 15 🔗fmai-workshop.github.io Find it. Reproduce it. Trace it. Fix it. We also welcome relevant ICML submissions, especially papers with strong insights that may not have found the right home in the main track!

1/ 🧠 Long reasoning ≠ reliable reasoning. Large reasoning models can write long, convincing chains of thought… and still end with a wrong answer. Our new @icmlconf paper asks: Can we use the reasoning trace itself to detect when the final answer is hallucinated? 🧵👇





