Angelica Chen

131 posts

Angelica Chen

@_angie_chen

Gemini training @GoogleDeepMind, PhD from @NYUDataScience, previously @Princeton 🐅, angie-chen at 🦋

New York, NY Katılım Şubat 2016

484 Takip Edilen1.7K Takipçiler

Sabitlenmiş Tweet

Angelica Chen@_angie_chen·30 May

New work w/@sadhikamalladi, @lilyhzhang, @xinyichen2, @QiuyiRichardZ, Rajesh Ranganath, @kchonyc: Contrary to conventional wisdom, RLHF/DPO does *not* produce policies that mostly assign higher likelihood to preferred responses than to less preferred ones.

English

236

52.9K

Angelica Chen retweetledi

NYU Center for Data Science@NYUDataScience·26 Şub

Generalist AIs ace medical exams but fail at hospital operations. But a small, specialized model created by @lavenderjiang99, @_angie_chen, @kchonyc, @ekoermann, et al beat massive generalists in clinical settings. nyudatascience.medium.com/generalist-ais…

English

2.6K

Angelica Chen@_angie_chen·12 Ara

@kchonyc @joshwoodward

QAM

138

Kyunghyun Cho@kchonyc·12 Ara

please @_angie_chen fix it natively please

Kyunghyun Cho@kchonyc

@Google can't figure out how to remove "[cite_start]" issue on their own gemini web app, and thereby i wrote a browser extension to fix it myself ... 🤦 let me present you anticitestart extension; fully open sourced and implemented by antigravity. expecting a zuck-level offer from google anytime now.

English

6.7K

Angelica Chen retweetledi

Mehran Kazemi@kazemi_sm·22 Kas

Hiring alert 🚨 If you’re working on LLMs—especially reasoning, RL, or distillation—and want to collaborate on some exciting research, we’d love to hear from you. Apply below.

Arian Hosseini@arianTBD

Our team at GDM is hiring a Student Researcher (SR) next year 🧠 If you’re a PhD student working on LLMs please apply. I’d love to hear from you. Please fill out this form: forms.gle/bxTEkrDPacn6jS…

English

176

28.2K

Angelica Chen retweetledi

Lavender Jiang@lavenderjiang99·22 Kas

We built Lang1: a 100M–7B family of models specialized for hospital operations. After finetuning, Lang1-1B outperforms generalist models up to 671B, transfers to unseen tasks and another hospital, and is more data-efficient. 🧵Why small specialists win in healthcare.

English

4.7K

Angelica Chen retweetledi

Richard Pang@yzpang_·3 Eki

🚨Prompt Curriculum Learning (PCL) - Efficient LLM RL training algo! - We investigate factors that affect convergence: bsz, # prompt, # gen, prompt selection - We propose PCL: lightweight algo that *dynamically selects intermediate-difficulty prompts* using a learned value model

English

170

24.2K

Angelica Chen@_angie_chen·30 Eyl

@kchonyc @orf_bnw 😨

QME

165

Kyunghyun Cho@kchonyc·29 Eyl

is gemini down? noooooooooooo, @orf_bnw @_angie_chen

English

4.2K

Angelica Chen retweetledi

Quanta Magazine@QuantaMagazine·24 Eyl

Naomi Saphra is tracing AI outputs back to their origins. In a conversation with @benbenbrubaker, the Harvard researcher explains why an evolutionary lens can help researchers study interpretability. quantamagazine.org/to-understand-…

English

43.1K

Angelica Chen@_angie_chen·16 Eyl

Working with Sadhika was one of the biggest highlights of my PhD and I couldn't be more excited to see her mentor the next generation of brilliant scientists! 🌟💫 Apply to work with her at UCSD!

Sadhika Malladi@SadhikaMalladi

Excited to share that I will be starting as an Assistant Professor in CSE at UCSD (@ucsd_cse) in Fall 2026! I am currently recruiting PhD students who want to bridge theory and practice in deep learning - see here: cs.princeton.edu/~smalladi/recr…

English

1.1K

Angelica Chen@_angie_chen·21 Ağu

@kchonyc @orf_bnw 🤔 @vqctran @YiTayML maybe this should be fixed

English

489

Kyunghyun Cho@kchonyc·21 Ağu

great job, @orf_bnw and @_angie_chen !

English

2.8K

Kyunghyun Cho@kchonyc·21 Ağu

it is amazing to have @tallinzen as a colleague as he inspires so many people including myself and gemini 2.5 pro. here's the new position paper on Cardiology of Language Models written almost entirely by Gemini 2.5 Pro with my promp ... nope ... supervision. (and thanks to @gregd_nlp for motivation!)

Tal Linzen@tallinzen

introducing our new interpretability research paradigm, Cardiology of Language Models! it is based on a method we call the "stethoscope", where we train a linear classifier to discriminate between the LLM hidden states that represent a concept and those that do not!

English

23.4K

Angelica Chen@_angie_chen·17 Ağu

@infoxiao One of these sounds so familiar 🤔

English

863

Xiao Ma@infoxiao·16 Ağu

I've been automating parts of my research workflow. Sometimes amazing, sometimes I'm the engineer spending 3 hours to automate a 30-minute task. I think there might be a useful framework here: Context × Action ✅ Potential Sweet Spot (Low Context + High Action): "Run 10 evals across 20 checkpoints" → Clear task, lots of repetitive work ❌ Where It Seems to Break (High Context + Low Action): "Book me a flight but I prefer aisle seats, have Delta status, need to coordinate with friends..." Takes longer to explain than to just book it yourself. Maybe this explains why so many AI features feel useless? The magic might happen when explanation is minimal but the task list is long.

English

575

70.5K

Angelica Chen@_angie_chen·14 Ağu

@vqctran @agihippo Happy birthday!!! 🎁🎊🎉 😄

English

vinh q. tran@vqctran·13 Ağu

@agihippo thank you thank you!!

English

324

yi@agihippo·13 Ağu

Happy birthday @vqctran

English

2.5K

Angelica Chen retweetledi

Nathan C. Frey@nc_frey·31 Tem

LLMs vs Protein Design Tools: Who Wins? if you missed them @icmlconf, here's the poster for work led by @_angie_chen & @samuel_stanton_ that answers this question! Link to paper below 👇

English

5.2K

Angelica Chen retweetledi

ML Collective@ml_collective·25 Tem

Starting in 30 min at 10am PT! @nsaphra presents at DLCT: Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs Angelica Chen, Ravid Shwartz-Ziv, Kyunghyun Cho, Matthew L. Leavitt, Naomi Saphra arxiv.org/abs/2309.07311 Zoom 👇

English

2.1K

Angelica Chen@_angie_chen·17 Tem

Paper: arxiv.org/abs/2410.22296

English

323

Angelica Chen@_angie_chen·17 Tem

We propose a new synthetic benchmark for highly-constrained biophysical optimization tasks, and propose a new semi-online RL algorithm for training an LLM to iteratively improve its own generations while using very few labels. Come chat with me and @samuel_stanton_ !

English

400

Angelica Chen@_angie_chen·17 Tem

Have you ever wondered how your specialized biomolecule engineering model compares with a general purpose LLM on bio-optimization tasks? Come check out our work at ICML, in the East poster session (#E-2804) happening right now!. #ICML2025

English

6.1K

Angelica Chen@_angie_chen·14 Tem

@infoxiao @YiTayML @kchonyc @orf_bnw @OfficialLoganK 🥲 so much for expert choice routing

English

275

Xiao Ma@infoxiao·14 Tem

@YiTayML @kchonyc @_angie_chen @orf_bnw @OfficialLoganK Can’t be activated. Dead expert

English

356

Kyunghyun Cho@kchonyc·14 Tem

i'm getting dizzy waiting for `gemma3n:4b-it-qat`. please, @_angie_chen

English

5.4K

Angelica Chen@_angie_chen·5 Tem

@infoxiao @kchonyc @orf_bnw the real purpose of PhD students

English

273

Xiao Ma@infoxiao·5 Tem

@kchonyc @orf_bnw Don't you have @_angie_chen now? Plx fix thanks.

English

Kyunghyun Cho@kchonyc·4 Tem

Gemini team, can you please fix your rendering to remove "[cite_start]" everywhere? i need to copy-paste the result on a new chat to have Gemini clean up these unnecessary and wrong markers. @orf_bnw get to work, please

English

Angelica Chen retweetledi

Jason Weston@jaseweston·30 Haz

🌉 Bridging Offline & Online RL for LLMs 🌉 📝: arxiv.org/abs/2506.21495 New paper shows on verifiable & non-verifiable tasks: - Online DPO & GRPO give similar performance. - Semi-online (iterative) DPO with sync every s steps (more efficient!) works very well also. - Offline DPO is way behind. - Combining verifiable + non-verifiable works! Cross-transfer gains. - Recipes for how to make this work. 🧵1/4

English

452

68.2K

Angelica Chen retweetledi

Aran Komatsuzaki@arankomatsuzaki·27 Haz

Bridging Offline and Online Reinforcement Learning for LLMs Investigates the effectiveness of RL for finetuning LLMs when transitioning from offline to semi-online to fully online regimes for both verifiable and nonverifiable tasks.

English

285

25.9K

Keşfet

@lavenderjiang99 @kchonyc @ekoermann @joshwoodward @orf_bnw @benbenbrubaker @vqctran @YiTayML