Jingxuan Fan

65 posts

Jingxuan Fan banner
Jingxuan Fan

Jingxuan Fan

@fjxdaisy

Harvard Neuroscience PhD / Harvard AM Master / MIT BCS ‘20 / ex-Meta, Amazon intern

Katılım Mart 2016
579 Takip Edilen300 Takipçiler
Jingxuan Fan
Jingxuan Fan@fjxdaisy·
Excited to share our new work. Thanks to all the collaborators!
Hanlin Zhang@_hanlin_zhang_

Learning from feedback is instrumental but human preference data can be expensive. How much reward supervision could we get from raw web text instead, without human labels? Our latest work, built on a year of incredible effort by @fjxdaisy, advances pure RLHF training across multiple models and tasks. Unsupervised Reward Modeling: split web docs into (prefix, true continuation); treat mismatched continuations in-batch as negatives; train w/ BT loss + score-centering. Findings: 📝 steady gains on RewardBench v1/v2 using just 11M tokens of math web text 📝 transfers across backbones (Llama-3.2 1B/3B, Qwen2.5 3B/7B Instruct) 📝 improves Best-of-N selection (math + safety) and provides a usable reward for GRPO policy optimization 📝 acts as a mid-training procedure that helps further RLHF 📑Paper: arxiv.org/abs/2603.02225 🌐Project: jingxuanf0214.github.io/reward-scaling Joint work with @lisali126, @ZhentingQi, @zdhnarsil, @xkianteb, @ShamKakade6

English
0
0
3
477
Jingxuan Fan retweetledi
Xianjun Yang
Xianjun Yang@xianjun_agi·
📢My New Paper: Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder TLDR: We proposed to use features from SAEs as a measure for data diversity&complexity and proved it's effectiveness on data selection for LLM tuning. arxiv.org/pdf/2502.14050
Xianjun Yang tweet media
English
7
37
180
18.6K
Jingxuan Fan
Jingxuan Fan@fjxdaisy·
@ylongqi Hi Longqi, I am interested in this opportunity and have emailed you my background. Thanks!
English
0
0
1
457
Longqi Yang
Longqi Yang@ylongqi·
Internship alert! We have an immediate part-time research intern opening at Microsoft’s Office of Applied Research to improve LLM reasoning. Please reach out if you or your students are interested!
English
48
39
469
52.5K
Jingxuan Fan retweetledi
Yung-Sung Chuang
Yung-Sung Chuang@YungSungChuang·
(1/5)🚨LLMs can now self-improve to generate better citations✅ 📝We design automatic rewards to assess citation quality 🤖Enable BoN/SimPO w/o external supervision 📈Perform close to “Claude Citations” API w/ only 8B model 📄arxiv.org/abs/2502.09604 🧑‍💻github.com/voidism/SelfCi…
Yung-Sung Chuang tweet mediaYung-Sung Chuang tweet media
English
12
74
314
39.4K
Jingxuan Fan retweetledi
Jiao Sun
Jiao Sun@sunjiao123sun_·
Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference @NeurIPSConf We have ethical reviews for authors, but missed it for invited speakers? 😡
Jiao Sun tweet media
English
175
782
3.7K
2.2M
Huan Wang
Huan Wang@huan__wang·
We're hiring AI Research Interns for Summer 2025! Spend 3 months with us working on AI Agents, LLMs, Reasoning, Planning & more—with a focus on publishing high-quality academic papers. If you have a strong publication record, apply or DM me! #researchpaper #JobOpening #intern
English
63
94
913
183.5K
Jingxuan Fan retweetledi
Chen Sun 🤖
Chen Sun 🤖@ChenSun92·
1/N New paper alert: What happens when a new piece of knowledge is introduced into the training data and how long does it last while a large language model (LM) continues to train?
Chen Sun 🤖 tweet media
English
2
12
50
7.3K
Jingxuan Fan retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
We’re presenting the first AI to solve International Mathematical Olympiad problems at a silver medalist level.🥈 It combines AlphaProof, a new breakthrough model for formal reasoning, and AlphaGeometry 2, an improved version of our previous system. 🧵 dpmd.ai/imo-silver
GIF
English
286
1.2K
4.6K
2M
Jingxuan Fan retweetledi
Shan Meltzer
Shan Meltzer@shanmeltzer·
Thrilled to announce that I have accepted a TT faculty offer at Vanderbilt University in the Dept. of Pharmacology @VandyPharm and Vanderbilt Brain Institute @VanderbiltBrain, starting in September 2024!! 🥳🥳 We will study mechanisms of somatosensory circuit assembly for touch and pain, in health and disease. We will be hiring at all levels!!
Shan Meltzer tweet media
English
78
33
749
97.6K
Jingxuan Fan retweetledi
Demis Hassabis
Demis Hassabis@demishassabis·
Excited to share #AlphaMissense our new AI system that can classify whether genetic mutations (missense variants) are benign or harmful - a critical step toward uncovering causes of many diseases, from cystic fibrosis to cancer. In @ScienceMagazine today dpmd.ai/AlphaMissenseDH
English
56
634
2.9K
450.9K
Jingxuan Fan retweetledi
Binxu Wang 🐱
Binxu Wang 🐱@WangBinxu·
Excited to have given a 2-day ML from Scratch tutorial on Transformers #GPT at Harvard Medical School this week! Thanks to the huge interest, it was one of the most popular of its kind! Check out Slides: tinyurl.com/yc3s4h23 Links to coding exercise: tinyurl.com/2p8absjw
Binxu Wang 🐱 tweet mediaBinxu Wang 🐱 tweet media
English
2
29
141
15.6K
Jingxuan Fan retweetledi
Cengiz Pehlevan
Cengiz Pehlevan@CPehlevan·
I will be missing #cosyne2022 this year, but my group will be there presenting all these great work! I-009 Neural network size balances representational drift and flexibility during Bayesian sampling @jzavatoneveth @canatar_a (1/3)
English
1
2
29
0