
@alanamarzoev and I had a great time presenting OpenEstimate at our #ICLR2026 poster session today! Thanks to everyone who came out to chat about evaluating LLM reasoning under uncertainty.

English
Jillian Ross @ICLR26
4 posts







🚨 New paper up on how LLMs reason under uncertainty! 🎲 Many real world uses of LLMs are characterized by the unknown—not only are the models prompted with partial information, but often even humans don't know the "right answer" to the questions asked. Yet most LLM evals focus on problems with clearly defined success criteria. There’s a gap in our understanding of how models perform in this setting. We investigate.... 🔎