Guy Tennenholtz

129 posts

Guy Tennenholtz

Guy Tennenholtz

@guytenn

Research Scientist @GoogleAI

Se unió Mayıs 2016
171 Siguiendo320 Seguidores
Guy Tennenholtz retuiteado
Ofir Nabati
Ofir Nabati@ofirnabati·
Our paper, "Spectral Bellman Method: Unifying Representation and Exploration in RL," has been accepted to ICLR 2026! 🚀 Paper: arxiv.org/abs/2507.13181 with @guytenn , Bo Dai and Shie Mannor 🧵 1/n
English
1
2
9
607
Guy Tennenholtz retuiteado
Google Research
Google Research@GoogleResearch·
Ever had a perfect image in mind that a text-to-image model just couldn't capture? Our new reinforcement learning agent, PASTA, turns image generation into a collaborative conversation, learning your style to bring your vision to life. Learn how it works: goo.gle/4gRVEqA
English
7
57
330
25.8K
Guy Tennenholtz
Guy Tennenholtz@guytenn·
We train an LLM to be "aware" of how it will be used during inference. We show we can do this efficiently in SFT and RL under a Best-of-N inference strategy. Our model explores more efficiently and accounts for errors in our scoring model. Check it out: arxiv.org/pdf/2412.15287
English
0
0
2
215
Guy Tennenholtz
Guy Tennenholtz@guytenn·
We're releasing a new dataset with sequential interactions for text to image generation with human feedback. More data will follow very soon. Paper: arxiv.org/pdf/2412.10419 Dataset: kaggle.com/datasets/googl…
Ofir Nabati@ofirnabati

We're excited to share our new paper: "Personalized and Sequential Text-to-Image Generation"! Check out the paper and our new sequential human rater dataset! 👇 Paper: arxiv.org/pdf/2412.10419 Dataset: kaggle.com/datasets/googl… Details below.. 1/N 🧵

English
0
0
4
267
Guy Tennenholtz
Guy Tennenholtz@guytenn·
We construct a practical approach to estimate these three types of uncertainties, contributing to a more effective offline RL algorithm that performs well on synthetic and real-world medical data, improving sota offline RL performance. [4/5]
English
1
0
1
245
Guy Tennenholtz
Guy Tennenholtz@guytenn·
Very happy to share our new paper: "Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding" led by the brilliant @AlizeePace. Dive into a short research thread below! 🧵👇 [1/5] arxiv.org/pdf/2306.01157…
English
2
2
18
3.3K
Guy Tennenholtz
Guy Tennenholtz@guytenn·
Our recent exploration into representation-driven #ReinforcementLearning provides some interesting findings. Definitely worth a read! arxiv.org/abs/2305.19922. Accepted to #ICML2023
Ofir Nabati@ofirnabati

🎉 Excited to share our latest work accepted at #ICML2023: "Representation-Driven Reinforcement Learning" 🚀. In collaboration with @guytenn and @MannorShie, we've developed a representation-driven framework for reinforcement learning. arxiv.org/abs/2305.19922. 🧵[Thread] [1/n]

English
0
0
1
239
Guy Tennenholtz
Guy Tennenholtz@guytenn·
Our approach emphasizes the vital role of exploration in mitigating the adverse effects of popularity bias. Rather than aiming to eradicate popularity bias entirely, we focus on alleviating its negative repercussions, paving the way for improved user welfare. 🧵[Thread] [4/5]
English
1
0
0
110