Sabitlenmiş Tweet

Announcing our fully open source code agent to support development in @leanprover. This has been a labor of love by our team at @MistralAI and we look forward to seeing what the #LeanProver community does with it!

English
Jason Rute
426 posts

@JasonRute
AI Researcher @ Mistral AI | Formally IBM Research | Former Mathematician/Logician/Data scientist | Building AI for math and reasoning










@royvanrijn Congratulations to the Mistral team! Great to see a dedicated open-source Lean 4 coding agent. The Lean community is growing fast, and tools like Leanstral will help more people get productive with Lean quickly. Looking forward to seeing how it evolves.






How often do LLMs claim to prove false mathematical statements? In our latest benchmark, BrokenArXiv, we find they do so very often. The best model, GPT-5.4, only rejects 40% of incorrect statements obtained by perturbing recent ArXiv papers, and other models do much worse.









