Jingxuan Fan

65 posts

Jingxuan Fan

@fjxdaisy

Harvard Neuroscience PhD / Harvard AM Master / MIT BCS ‘20 / ex-Meta, Amazon intern

Katılım Mart 2016

579 Takip Edilen300 Takipçiler

Jingxuan Fan@fjxdaisy·18 Mar

Excited to share our new work. Thanks to all the collaborators!

Hanlin Zhang@_hanlin_zhang_

Learning from feedback is instrumental but human preference data can be expensive. How much reward supervision could we get from raw web text instead, without human labels? Our latest work, built on a year of incredible effort by @fjxdaisy, advances pure RLHF training across multiple models and tasks. Unsupervised Reward Modeling: split web docs into (prefix, true continuation); treat mismatched continuations in-batch as negatives; train w/ BT loss + score-centering. Findings: 📝 steady gains on RewardBench v1/v2 using just 11M tokens of math web text 📝 transfers across backbones (Llama-3.2 1B/3B, Qwen2.5 3B/7B Instruct) 📝 improves Best-of-N selection (math + safety) and provides a usable reward for GRPO policy optimization 📝 acts as a mid-training procedure that helps further RLHF 📑Paper: arxiv.org/abs/2603.02225 🌐Project: jingxuanf0214.github.io/reward-scaling Joint work with @lisali126, @ZhentingQi, @zdhnarsil, @xkianteb, @ShamKakade6

English

477

Jingxuan Fan retweetledi

Kempner Institute at Harvard University@KempnerInst·23 Nis

Are you at #ICLR2025? See the lineup of Kempner Institute presenters and check out their work! #ML #AI

Kempner Institute at Harvard University tweet media

English

4.1K

Jingxuan Fan retweetledi

Xianjun Yang@xianjun_agi·21 Şub

📢My New Paper: Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder TLDR: We proposed to use features from SAEs as a measure for data diversity&complexity and proved it's effectiveness on data selection for LLM tuning. arxiv.org/pdf/2502.14050

English

180

18.6K

Jingxuan Fan@fjxdaisy·20 Şub

@ylongqi Hi Longqi, I am interested in this opportunity and have emailed you my background. Thanks!

English

457

Longqi Yang@ylongqi·20 Şub

Internship alert! We have an immediate part-time research intern opening at Microsoft’s Office of Applied Research to improve LLM reasoning. Please reach out if you or your students are interested!

English

469

52.5K

Jingxuan Fan retweetledi

Yung-Sung Chuang@YungSungChuang·14 Şub

(1/5)🚨LLMs can now self-improve to generate better citations✅ 📝We design automatic rewards to assess citation quality 🤖Enable BoN/SimPO w/o external supervision 📈Perform close to “Claude Citations” API w/ only 8B model 📄arxiv.org/abs/2502.09604 🧑‍💻github.com/voidism/SelfCi…

English

314

39.4K

Jingxuan Fan@fjxdaisy·15 Şub

@CPehlevan Thanks Cengiz!

English

Cengiz Pehlevan@CPehlevan·15 Şub

@fjxdaisy congrats!

English

210

Jingxuan Fan@fjxdaisy·15 Şub

Glad to co-lead HARDMath (accepted to #ICLR2025)! HARDMath offers: - a graduate-level benchmark uniquely targeting applied math techniques - an automatic pipeline for high quality problem/solution generation For more details, check out Erik's thread below!

Erik Wang@erikyw26

We’re excited to share that HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics has been accepted to #ICLR2025! 🧵

English

4.5K

Jingxuan Fan retweetledi

Jiao Sun@sunjiao123sun_·14 Ara

Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference @NeurIPSConf We have ethical reviews for authors, but missed it for invited speakers? 😡

English

175

782

3.7K

2.2M

Jingxuan Fan@fjxdaisy·5 Ara

@huan__wang I am interested! Have first author publication on LLM math reasoning & RL curiosity-driven exploration (homepage: jingxuanf0214.github.io).

English

159

Huan Wang@huan__wang·17 Kas

We're hiring AI Research Interns for Summer 2025! Spend 3 months with us working on AI Agents, LLMs, Reasoning, Planning & more—with a focus on publishing high-quality academic papers. If you have a strong publication record, apply or DM me! #researchpaper #JobOpening #intern

English

913

183.5K

Jingxuan Fan retweetledi

Chen Sun 🤖@ChenSun92·31 Eki

1/N New paper alert: What happens when a new piece of knowledge is introduced into the training data and how long does it last while a large language model (LM) continues to train?

English

7.3K

Jingxuan Fan retweetledi

Google DeepMind@GoogleDeepMind·25 Tem

We’re presenting the first AI to solve International Mathematical Olympiad problems at a silver medalist level.🥈 It combines AlphaProof, a new breakthrough model for formal reasoning, and AlphaGeometry 2, an improved version of our previous system. 🧵 dpmd.ai/imo-silver

GIF

English

286

1.2K

4.6K

Jingxuan Fan retweetledi

Shan Meltzer@shanmeltzer·19 Oca

Thrilled to announce that I have accepted a TT faculty offer at Vanderbilt University in the Dept. of Pharmacology @VandyPharm and Vanderbilt Brain Institute @VanderbiltBrain, starting in September 2024!! 🥳🥳 We will study mechanisms of somatosensory circuit assembly for touch and pain, in health and disease. We will be hiring at all levels!!

English

749

97.6K

Jingxuan Fan@fjxdaisy·18 Kas

@cosmo_shirley @PolymathicAI do you have research internship opportunities under Flatiron's summer intern program?

English

379

Shirley Ho@cosmo_shirley·18 Kas

We are hiring software-focused researchers at @PolymathicAI ! Our goal is to develop large foundation models for the sciences. apply.interfolio.com/136060

English

271

67K

Jingxuan Fan retweetledi

Demis Hassabis@demishassabis·20 Eyl

Excited to share #AlphaMissense our new AI system that can classify whether genetic mutations (missense variants) are benign or harmful - a critical step toward uncovering causes of many diseases, from cystic fibrosis to cancer. In @ScienceMagazine today dpmd.ai/AlphaMissenseDH

English

634

2.9K

450.9K

Jingxuan Fan retweetledi

Binxu Wang 🐱@WangBinxu·21 Nis

Excited to have given a 2-day ML from Scratch tutorial on Transformers #GPT at Harvard Medical School this week! Thanks to the huge interest, it was one of the most popular of its kind! Check out Slides: tinyurl.com/yc3s4h23 Links to coding exercise: tinyurl.com/2p8absjw

English

141

15.6K

Jingxuan Fan retweetledi

Shanshan Qin@ShanshanQin_·12 Oca

Our work on the representational drift is finally online! nature.com/articles/s4159…. Thanks to all my excellent collaborators @CPehlevan @fgh_shiva @dlipshutz @AnirvanMS @chklovskii

English

146

17.8K

Jingxuan Fan retweetledi

David Schneider@schneiderneuro·8 Oca

Excited to share this new work from @Audette_neuro! We found a prominent class of auditory cortex cells that signal prediction errors when mice hear something unexpected. 1/6

bioRxiv Neuroscience@biorxiv_neursci

Stimulus-specific prediction error neurons in mouse auditory cortex biorxiv.org/cgi/content/sh… #biorxiv_neursci

English

122

37.3K

Jingxuan Fan retweetledi

Aditya Nair ~ ആദി@neuronair·5 Oca

So excited to share my 1st first author paper with David Anderson & @Antihebbiann, with help from @TomomiKarigo, @scott_linderman, @SuryaGanguli & Mark Schnitzer in @CellCellPress!! We found an approx. line attractor that correlates with aggressiveness! cell.com/cell/fulltext/…

English

406

120.4K

Jingxuan Fan retweetledi

Cengiz Pehlevan@CPehlevan·17 Mar

I will be missing #cosyne2022 this year, but my group will be there presenting all these great work! I-009 Neural network size balances representational drift and flexibility during Bayesian sampling @jzavatoneveth @canatar_a (1/3)

English

Keşfet

@ylongqi @CPehlevan @NeurIPSConf @huan__wang @VandyPharm @VanderbiltBrain @cosmo_shirley @PolymathicAI