
Anush Kini
17 posts

Anush Kini
@Ability_Guy
PhD student at Boston University | Prev @MSFTResearch








Excited to share that our paper “Provably Robust DPO: Aligning Language Models with Noisy Feedback” has been accepted at #ICML2024! We introduce Robust DPO, an unbiased estimate of the DPO loss that is robust to preference noise in the data. arxiv.org/abs/2403.00409 🧵[1/n]

Releasing MASAI: Modular Architecture for Software-engineering AI agents Modularity helps achieve highest resolution rate (28.33%) at <$2 avg. cost/issue on SWE-bench Lite @amuseddaman @twm_as @nalin_wadhwa @AbhavM @SaitejaUtpala @rkbairi @naga86 arxiv.org/abs/2406.11638



Let's try to make an open research in a @ml_collective style @ We opened a Discord channel to communicate discord.gg/RPZHdfCd (thanks @GoAbiAryan). The first meeting will be next Friday, April 19, at noon EST. The plan is to discuss what can be exciting to work on.

