Sayak Ray Chowdhury retweetledi

📢 Excited to share our work "Active Preference Optimization for Sample Efficient RLHF" accepted at #ICML2024 Theoretical Foundations of Foundation Models (@tf2m_workshop) Workshop! Joint work with @Sayakrayc @SOURADIPCHAKR18 @aldopacchiano
arxiv.org/pdf/2402.10500
🧵(1/6)

English






