Seth Neel

1.1K posts

Seth Neel

@SethInternet

📍something new prev: rs @googleai (gemini data), asst professor @harvard (biz + cs affil), co-founder @welligence, PhD @penn

NYC x Bay Area Присоединился Ağustos 2016

730 Подписки1.8K Подписчики

Закреплённый твит

Seth Neel@SethInternet·23 Oca

Excited to see our work on leveraging datamodels for unlearning published in @iclr_conf — check out our blog post below for details!

Andrew Ilyas@andrew_ilyas

Machine unlearning ("removing" training data from a trained ML model) is a hard, important problem. Datamodel Matching (DMM): a new unlearning paradigm with strong empirical performance! w/ @kris_georgiev1 @RoyRinberg @smsampark @shivamg_13 @aleks_madry @SethInternet (1/4)

English

5.7K

Seth Neel ретвитнул

Jeff Dean@JeffDean·23 Ara

I'm delighted to jointly author this year-end summary of research advances with @DemisHassabis and James Manyika, on behalf of all of our colleagues across @GoogleDeepMind, @GoogleResearch and @Google. We look at research advances across eight different areas. These summaries are always fun to work on because one can reflect back on the breadth and depth of our collective work over the last year! blog.google/technology/ai/…

English

331

2.8K

472.2K

Seth Neel@SethInternet·17 Ara

@joemelko @gopalkraman Nice talk @joemelko !

English

143

Joe Melkonian@joemelko·17 Ara

please enjoy a video of me asking people to be more thoughtful about data work: thanks @gopalkraman for inviting me and to everyone who asked questions / came by to chat after :)

Gopal@gopalkraman

.@joemelko argues that data curation should be treated as an optimization problem, not guesswork. he walks us through how to learn ... how to learn.

English

4.3K

Seth Neel ретвитнул

Dylan Neel, MD PhD@dylanvneel·21 Eki

🚨 New Biomarker article 🧠 Analysis of early-stage #neuro M&A and partnerships in #biotech. open.substack.com/pub/biomarker/… #VC #Neuroscience

English

351

Seth Neel@SethInternet·17 Eki

@rzshokri @acm_ccs Congratulations that is incredible!

English

108

Reza Shokri@rzshokri·16 Eki

Grateful and happy to receive the ACM CCS @acm_ccs Test of Time Award for our “Privacy-Preserving Deep Learning” paper with Vitaly Shmatikov back in 2015. First reaction: “10 years, man!”

English

Seth Neel@SethInternet·18 Eyl

🏛️ Governance for the LLM era. Audit payout splits, credit for specific responses, and unlearning claims—without a million-dollar retrain. Check out our paper: arxiv.org/pdf/2508.10866 Joint work Ari Karchmer & Martin Pawelcyzk

English

198

Seth Neel@SethInternet·18 Eyl

🚨 Why unverifiable attributions are risky This computational disparity creates a critical trust gap. If payouts or compliance hinges on the accuracy of these attributions, model providers can in theory game naive checks based on MSE: • Repayment/underpayment: scale all scores down → everyone underpaid. • Favoritism: inflate a subset’s scores → friends overpaid, others shorted.

English

215

Seth Neel@SethInternet·18 Eyl

🧪New 📜in NeurIPS '26: data attribution could power data dividends, safety checks, debugging, but existing attribution methods are $$ for LLMs. We develop a new protocol that lets computationally weak third parties verify the accuracy of data attributions w/o computing them!🧵

English

613

Seth Neel@SethInternet·18 Eyl

✅ Our fix: cheap, rigorous verification. A two-message interactive protocol with PAC-Verification guarantees. The Verifier’s cost is O(1/ε²) (and independent of dataset size N), certifying the reported attribution is ε-close to optimal—closing gaming loopholes. We provide strong soundness guarantees, even against an arbitrarily malicious Prover with infinite compute.

English

Seth Neel@SethInternet·18 Eyl

🧩 Broad coverage. Our method applies to arbitrary linear functionals over {±1}^N, capturing predictive training-data attributions (e.g., datamodels, empirical influence) and extending to component attributions used in practice.

English

131

Seth Neel@SethInternet·18 Eyl

English

Seth Neel@SethInternet·7 Eyl

@deliprao Say a bit more?

English

Delip Rao e/σ@deliprao·7 Eyl

Ooh. This is a juicy result for uncertainty calibration in LLMs.

xuan (ɕɥɛn / sh-yen)@xuanalogue

TIL that the asymptotically unbiased estimator produced by MCMC can be turned into an unbiased estimator using only a *finite* number of iterations! The trick is v neat: Turn an infinite limit into a telescoping sum, then arrange to have all terms after some step t cancel out.

English

3.9K

Seth Neel@SethInternet·18 Ağu

@pratyushmaini congrats pratyush + team! this looks like amazing work!

English

Pratyush Maini@pratyushmaini·18 Ağu

15/Needless to say, such a massive undertaking could not have been accomplished without a stellar engineering team that helped us scale our work to trillions of tokens. If you are excited about this, join us jobs.ashbyhq.com/DatologyAI

English

1.7K

Pratyush Maini@pratyushmaini·18 Ağu

1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns. Today @datologyai shares BeyondWeb, our synthetic data approach & all the learnings from scaling it to trillions of tokens🧑🏼‍🍳 - 3B LLMs beat 8B models🚀 - Pareto frontier for performance

English

125

723

185.2K

Seth Neel ретвитнул

Google AI@GoogleAI·22 Tem

Gemini 2.5 Flash-Lite is now stable and generally available for developers and enterprise customers! ⚡ When designing a Gemini model, we think a lot about the tradeoffs between quality, cost, and latency. Previously with 2.0 Flash-Lite we optimized for cost-efficiency over latency. As we built our next iteration, we also wanted to push the boundaries on latency to see how fast we could get the model to think and respond. Resulting in 2.5 Flash-Lite, our fastest, most cost-efficient 2.5 model yet, with lower latency than both 2.0 Flash-Lite and 2.0 Flash on a broad sample of prompts. Try it out in ai.studio and @GoogleCloud Vertex AI.

English

687

258.9K

Seth Neel@SethInternet·6 May

@niloofar_mire @CMU_EPP @LTIatCMU @AIatMeta @kamalikac that is amazing niloofar huge congrats (!!!!)

English

529

Niloofar@niloofar_mire·6 May

📣Thrilled to announce I’ll join Carnegie Mellon University (@CMU_EPP & @LTIatCMU) as an Assistant Professor starting Fall 2026! Until then, I’ll be a Research Scientist at @AIatMeta FAIR in SF, working with @kamalikac’s amazing team on privacy, security, and reasoning in LLMs!

English

226

1.2K

152.2K

Seth Neel@SethInternet·1 May

@marvin_li03 🥳🥳🥳🥳

QME

Marvin Li@marvin_li03·1 May

Accepted as a spotlight at #icml2025! See you in Vancouver 🎉

Marvin Li@marvin_li03

New paper alert!🚨 What do LLM reasoning, diffusions, & jailbreaks have in common? 🤔 All exhibit critical windows📈--a sudden formation of distinctive features during sampling, e.g. correctness or toxicity. We present a unifying theory of critical windows for diffusion & LLMs.

English

1.7K

David Segar@DaveSegar·7 Nis

I recently joined @Nudge as medical director. We're building a world-changing device, with the goal of precisely and non-invasively modulating brain function to improve the lives of as many people as possible.

Nudge@nudge

Our mission is to develop the best technology for interfacing with the brain to improve people’s lives.

English

13K

Seth Neel@SethInternet·8 Nis

@DaveSegar @nudge 👀🔥

QME

134

Открыть

@DemisHassabis @GoogleDeepMind @GoogleResearch @Google @joemelko @gopalkraman @rzshokri @acm_ccs