Li Ding

29 posts

Li Ding banner
Li Ding

Li Ding

@li_ding_

Researcher @GoogleDeepMind working on AGI Control. Previously PhD @manningcics, Intern @GoogleResearch @Meta, Researcher @MIT. More: https://t.co/UzDlMWR898

Mountain View Katılım Ekim 2023
187 Takip Edilen218 Takipçiler
Li Ding
Li Ding@li_ding_·
@breadli428 @NeurIPSConf Loved it! Please do put the full version online. You will do many talks. Hard to beat a crowd that refuses to leave.
English
1
0
10
2.2K
Chenhao Li
Chenhao Li@breadli428·
The toughest moment in a PhD: >spend a year building smth you’re proud of >travel across world with advisors’ support to share it >when your moment finally comes >your mic gets cut because the previous schedule ran late Heartbroken, but thanks to whom stayed for me @NeurIPSConf
Ralf Römer@ralfroemer99

@breadli428 doing a great job finishing his presentation after everyone got kicked out of the room at the Embodied World Models Workshop @NeurIPSConf three minutes into his talk... #NeurIPS2025

English
36
67
2.8K
392.2K
Li Ding
Li Ding@li_ding_·
Ran into @lexfridman by chance earlier this week and finally sat down today. Perfect way to wrap up #NeurIPS. It’s been 6 years (how time flies!) since we worked together at MIT. Great catching up on life, ML, and old times.
Li Ding tweet media
English
2
2
86
7.4K
Li Ding
Li Ding@li_ding_·
Broke my Twitter silence for this one. 📸 Officially “UMass bros” according to the legend himself, so I’ll take it. 😎 #NeurIPS #NeurIPS2025
Li Ding tweet media
English
3
4
209
14.9K
Bryon Tjanaka
Bryon Tjanaka@btjanaka·
Happy to share that after a very hectic past couple of weeks, I have defended my thesis and am now #phdone! I am excited to start in my new role as a SWE @Waymo next year. Thank you to my committee and advisor @snikolaidis19 as well as my labmates @icaroslab, family, and friends!
Bryon Tjanaka tweet media
English
6
2
25
1.7K
Li Ding
Li Ding@li_ding_·
Thrilled to announce I’ve successfully defended my PhD! 🎓 Deeply grateful to my advisor Lee Spector, my committee @scottniekum, @MajiSubhransu, @jeffclune, and all collaborators, friends, and family. Milestone achieved, excited for the next chapter!
Li Ding tweet mediaLi Ding tweet media
English
9
1
22
1K
Li Ding retweetledi
Scott Niekum
Scott Niekum@scottniekum·
Preferences in RLHF often come from many people with differing values. Ryan's work explores how to infer a set of representative reward functions that captures that diversity, so that we can better reason about risk and fairness in these settings.
Ryan Bahlous-Boldi@RyanBoldi

Excited to share our new paper on Pareto Optimal Preference Learning (POPL)! 🎉 POPL aims to better align AI with diverse human values by building diverse sets of reward functions or policies! arxiv.org/abs/2406.15599… Work done with @li_ding_ , Lee Spector and @scottniekum

English
0
3
14
1.8K
Li Ding
Li Ding@li_ding_·
POPL enhances the safety and fairness of RLHF by aligning agents and LLMs with diverse human values. It effectively addresses hidden contexts in preferences, ensuring risk-sensitive alignment without additional labeling. Led by @RyanBoldi, w/ Lee Spector and @scottniekum.
Ryan Bahlous-Boldi@RyanBoldi

Excited to share our new paper on Pareto Optimal Preference Learning (POPL)! 🎉 POPL aims to better align AI with diverse human values by building diverse sets of reward functions or policies! arxiv.org/abs/2406.15599… Work done with @li_ding_ , Lee Spector and @scottniekum

English
0
0
5
425
Li Ding
Li Ding@li_ding_·
In a nutshell, new results highlight QDHF's strength in handling complex prompts for GenAI. While diffusion models often struggle with composing objects and attributes correctly, QDHF enhances diversity to explore various compositions, thus improving the quality of responses.
Li Ding tweet media
English
2
0
2
327
Li Ding
Li Ding@li_ding_·
🚀Thrilled to release the QDHF tutorial in @pyribs! Big shoutout to @btjanaka for his meticulous editing and insightful feedback👏. Dive into the tutorial to explore how QDHF enhances GenAI models with diversified, high-quality responses and apply these insights to your projects!
Bryon Tjanaka@btjanaka

Ecstatic to announce the release of @pyribs 0.7.1! The absolute highlight of this release is the QDHF (Quality Diversity through Human Feedback) tutorial contributed by @li_ding_! The tutorial is available here and runs on Google Colab in ~1 hour: docs.pyribs.org/en/stable/tuto…

English
0
5
15
4K
Li Ding
Li Ding@li_ding_·
Hello #NeurIPS2023! I will present our work on Quality Diversity through Human Feedback tomorrow 12/15 at @aloeworkshop. Feel free to stop by Room 211-213 for our spotlight talk (4:15-4:30) or catch us during the poster session (12:45-1:45). Let's chat about learning and more!
English
1
2
12
1.8K
Li Ding retweetledi
RL_Conference
RL_Conference@RL_Conference·
Thrilled to announce the first annual Reinforcement Learning Conference @RL_Conference, which will be held at UMass Amherst August 9-12! RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc.
RL_Conference tweet media
English
2
85
227
99.2K
Li Ding retweetledi
Scott Niekum
Scott Niekum@scottniekum·
Thrilled to announce the first annual Reinforcement Learning Conference @RL_Conference, which will be held at UMass Amherst August 9-12! RLC is the first strongly peer-reviewed RL venue with proceedings, and our call for papers is now available: rl-conference.cc. 🧵
Scott Niekum tweet media
English
1
11
103
14.4K
Li Ding
Li Ding@li_ding_·
Great work led by @RyanBoldi ! QD maintains diversity with pre-defined diversity metrics (or learned, e.g., QDHF!), but is it necessary? This paper proposes an alternative that uses MMO to solve deceptive RL tasks, and outperforms ME on QD-score w/o even optimizing towards it!
Ryan Bahlous-Boldi@RyanBoldi

📢 New Paper Alert! arxiv.org/abs/2311.02283 🔍 Navigating Deceptive Domains w/o Explicit Diversity Maintenance Conclusion: Objectives are all you need!🚀 Authors: Me, @li_ding_ and Lee Spector Accepted @ the #NeurIPS2023 Workshop on Agent Learning in Open Endedness 🧵👇

English
0
0
4
260