Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท

126 posts

Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท banner
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท

Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท

@alanamarzoev

Currently: AI research @ MIT. Previously: founder/CEO at @readysetio, research @ Microsoft, UC Berkeley, Cornell.

NYC + BOS ๊ฐ€์ž…์ผ Ocak 2018
1.1K ํŒ”๋กœ์ž‰1.1K ํŒ”๋กœ์›Œ
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท
Starting in 30 min! If youโ€™re interested in deploying LLMs in decision making domains + reasoning under uncertainty come chat with me and @JillianRossA_
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท tweet media
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท@alanamarzoev

Heading to #ICLR2026 (@iclr_conf) ๐Ÿ‡ง๐Ÿ‡ท to present OpenEstimate! As LLMs get deployed in decision-making domains, they're increasingly expected to do subjective probability estimation, drawing on everything they know to form beliefs about unknown quantities. Our paper studies this capability with a leakage-resistant benchmark. This sits at the intersection of a few things I care about: RL in hard-to-verify domains, forecasting, and making LLMs honest about what they don't know. Come find me Saturday 10:30โ€“1 at poster #1716 in Pavilion 3! And if you'd like to grab coffee and chat about any of these, DMs are open!

English
1
2
10
1.2K
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท ๋ฆฌํŠธ์œ—ํ•จ
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท ๋ฆฌํŠธ์œ—ํ•จ
Jillian Ross @ICLR26
Jillian Ross @ICLR26@JillianRossA_ยท
On my way to #ICLR2026 to present OpenEstimate with @alanamarzoev and give a spotlight talk at the FINAI Workshop. Over the past few years, @AndrewWLo and I have been studying whether LLMs can be trusted to give sound investment advice. In my talk, I'll show that LLMs demonstrate heuristic collapse: rather than weighing all relevant factors, they latch onto a few salient features and ignore the rest. Heuristic collapse has direct consequences for whether LLMs can meet the legal standard of a fiduciary โ€” and for AI advisors more broadly. This is one of many reasons I think investing is one of the best domains for studying LLMs. Through this domain, I've been able to study LLM reasoning, human-LLM interaction, and emergent systemic effects. If you're working on any of these topics, I'd love to meet. Come find me before or after the talk on Monday at 1:35PM!
English
0
1
5
211
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท
Heading to #ICLR2026 (@iclr_conf) ๐Ÿ‡ง๐Ÿ‡ท to present OpenEstimate! As LLMs get deployed in decision-making domains, they're increasingly expected to do subjective probability estimation, drawing on everything they know to form beliefs about unknown quantities. Our paper studies this capability with a leakage-resistant benchmark. This sits at the intersection of a few things I care about: RL in hard-to-verify domains, forecasting, and making LLMs honest about what they don't know. Come find me Saturday 10:30โ€“1 at poster #1716 in Pavilion 3! And if you'd like to grab coffee and chat about any of these, DMs are open!
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท tweet media
English
2
8
42
5.7K
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท ๋ฆฌํŠธ์œ—ํ•จ
Gabe Grand @ ICLR 2026 ๐Ÿ‡ง๐Ÿ‡ท
Do AI agents ask good questions? We built โ€œCollaborative Battleshipโ€ to find outโ€”and discovered that weaker LMs + Bayesian inference can beat GPT-5 at 1% of the cost. Paper, code & demos: gabegrand.github.io/battleship Here's what we learned about building rational information-seeking agents... ๐Ÿงต๐Ÿ”ฝ
English
4
35
174
44.4K
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท ๋ฆฌํŠธ์œ—ํ•จ
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท
๐Ÿšจ New paper up on how LLMs reason under uncertainty! ๐ŸŽฒ Many real world uses of LLMs are characterized by the unknownโ€”not only are the models prompted with partial information, but often even humans don't know the "right answer" to the questions asked. Yet most LLM evals focus on problems with clearly defined success criteria. Thereโ€™s a gap in our understanding of how models perform in this setting. We investigate.... ๐Ÿ”Ž
English
6
23
130
26.2K
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท
Bonjour from Montreal ๐Ÿ‡จ๐Ÿ‡ฆ spending the next few days here @ COLM! DM me if youโ€™re around and want to chat about research or non-research topics, including but not limited to: reasoning under uncertainty, forecasting, summarization/RAG, and startups
English
0
2
10
4.2K
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท ๋ฆฌํŠธ์œ—ํ•จ
Alex Renda
Alex Renda@alex_renda_ยท
โœˆ๏ธ ๐Ÿฆ™ Heading to COLM through Thursday! Weโ€™re hiring ML researchers at Jane Street for intern and full time roles, as well as supporting grad students through our fellowship program โ€” DM me or stop by the JS booth if you want to chat about what weโ€™re doing with ML @ JS!
Alex Renda tweet media
English
1
2
13
1.7K
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท@alanamarzoevยท
after a week of deliberation finally took the leap and upgraded to the ChatGPT pro plan... feels like waking up on Christmas morning ๐Ÿฅฒ
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท tweet media
English
1
0
5
668
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท ๋ฆฌํŠธ์œ—ํ•จ
Readyset
Readyset@readysetioยท
Streaming dataflow provides a unique solution to scaling OLTP applications. Want to learn how? Founder and CEO of Readyset, @alanamarzoev, will be giving a talk on this subject at @qconlondon on Tuesday, April 9th at 10:35AM BST! Learn more: qconlondon.com/presentation/aโ€ฆ
English
1
3
10
1.4K
Alana Renda @ICLR26 ๐Ÿ‡ง๐Ÿ‡ท ๋ฆฌํŠธ์œ—ํ•จ
apuchitnis
apuchitnis@apuchitnisยท
caching can be really helpful to reduce backend load, but cache invalidation is famously one of the hard problems in CS enter readyset.io - a cache that is **always in sync** with postgres, so you don't need to invalidate stale data ๐Ÿ˜ฎ
English
1
2
2
567