Jillian Ross @ICLR26

@JillianRossA_

PhD @MIT researching LLMs for finance

加入时间 Ekim 2025

26 关注9 粉丝

Jillian Ross @ICLR26@JillianRossA_·7h

@alanamarzoev and I had a great time presenting OpenEstimate at our #ICLR2026 poster session today! Thanks to everyone who came out to chat about evaluating LLM reasoning under uncertainty.

English

Jillian Ross @ICLR26@JillianRossA_·3d

On my way to #ICLR2026 to present OpenEstimate with @alanamarzoev and give a spotlight talk at the FINAI Workshop. Over the past few years, @AndrewWLo and I have been studying whether LLMs can be trusted to give sound investment advice. In my talk, I'll show that LLMs demonstrate heuristic collapse: rather than weighing all relevant factors, they latch onto a few salient features and ignore the rest. Heuristic collapse has direct consequences for whether LLMs can meet the legal standard of a fiduciary — and for AI advisors more broadly. This is one of many reasons I think investing is one of the best domains for studying LLMs. Through this domain, I've been able to study LLM reasoning, human-LLM interaction, and emergent systemic effects. If you're working on any of these topics, I'd love to meet. Come find me before or after the talk on Monday at 1:35PM!

English

241

Jillian Ross @ICLR26 已转推

Alana Renda @ICLR26 🇧🇷@alanamarzoev·3d

Heading to #ICLR2026 (@iclr_conf) 🇧🇷 to present OpenEstimate! As LLMs get deployed in decision-making domains, they're increasingly expected to do subjective probability estimation, drawing on everything they know to form beliefs about unknown quantities. Our paper studies this capability with a leakage-resistant benchmark. This sits at the intersection of a few things I care about: RL in hard-to-verify domains, forecasting, and making LLMs honest about what they don't know. Come find me Saturday 10:30–1 at poster #1716 in Pavilion 3! And if you'd like to grab coffee and chat about any of these, DMs are open!

English

6.5K

Jillian Ross @ICLR26 已转推

Jacob Andreas@jacobandreas·23 Eki

👉 New preprint! We have lots of great benchmarks for tasks where it's possible, in principle, for models to get all the answers exactly correct. But what about tasks that *intrinsically* require reasoning about uncertain facts and quantities?

Alana Renda @ICLR26 🇧🇷@alanamarzoev

🚨 New paper up on how LLMs reason under uncertainty! 🎲 Many real world uses of LLMs are characterized by the unknown—not only are the models prompted with partial information, but often even humans don't know the "right answer" to the questions asked. Yet most LLM evals focus on problems with clearly defined success criteria. There’s a gap in our understanding of how models perform in this setting. We investigate.... 🔎

English

12.8K

发现

@alanamarzoev @AndrewWLo @iclr_conf @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates