Andrew Silva

1.3K posts

Andrew Silva

@andrewsilva9

Research Scientist @  | Previously @ Toyota Research Institute and Google | PhD from Georgia Tech. @andrewsilva9.bsky.social

Katılım Eylül 2010

261 Takip Edilen472 Takipçiler

Andrew Silva@andrewsilva9·6d

@tw_killian @BYU @BYUCS Amazing!! Congratulations Taylor, so excited for you 😁

English

Taylor W. Killian@tw_killian·6d

📣 There's never a "best" time to share important updates, especially after sitting on this for so long... I'm joining the faculty @BYU + @BYUCS this Summer as an Assistant Professor in preparation for the upcoming school year. Lots of excitement and a fair bit of nerves. 🧵

English

169

17.3K

Andrew Silva retweetledi

Amir Joudaki@AmirJoudaki·25 Nis

Neural nets don’t just forget. Sometimes, after long training, they lose the ability to learn at all. In our #ICLR2026 poster, we model Loss of Plasticity as gradient dynamics trapped in invariant manifolds: 🔴 frozen units, 🔵 cloned units. The video makes the traps visible.

English

611

100.3K

Andrew Silva retweetledi

Yizhe Zhang@YizheZhangNLP·7 Nis

1/6 The "Self-Improvement" Paradox Can an LLM get smarter using only its own raw, unverified outputs? No verifiers. No teachers. No RL. We found the answer is an emphatic YES. Introducing SimpleSD: Embarrassingly Simple Self-Distillation. By simply sampling solutions from a model with specific temperature and truncation settings and then fine tuning the model on those exact samples, Qwen3-30B jumped from 42.4% to 55.3% (30% improvement) on LiveCodeBench v6 just by training on its own samples! 🚀 The gain is universal across different model sizes (4B, 8B, 30B) and model families (Llama, Qwen). The harder the problem is, the larger the gain. 📈 Kudos to my amazing colleagues @onloglogn, @richard_baihe, @UnderGroundJeg, Navdeep Jaitly, @trebolloc. Check out the paper and code below! 👇 paper: arxiv.org/abs/2604.01193 code: github.com/apple/ml-ssd HF models: huggingface.co/collections/ap…

English

196

17.3K

Andrew Silva retweetledi

Michael Kirchhof@mkirchhof_·6 Mar

New paper 🥳 RL relies a lot on an agent’s capability to explore. Our strategy-guided exploration makes the agent find new solutions more efficiently. It learns faster, and in some environments its Pass@1 surpasses the base model’s Pass@128. 🧵1/6 📄 arxiv.org/abs/2603.02045

English

4.4K

Andrew Silva retweetledi

Jason Ramapuram@jramapuram·26 Şub

Autoregressive models dominate, but what if we treat multimodal generation as discrete order agnostic iterative refinement? Excited to share our systematic study on the design space of Tri-Modal Masked Diffusion Models (MDMs). We pre-trained the first Tri-Modal MDM from scratch on (text,), (image, text), and (audio, text). The same model can do ASR, TTS, T2I, captioning and native text generation. What I'm the most proud of in this work is the scientific rigor. Over 3,500 training runs. Principled hyperparameter transfer. Honest results. Carefully controlled ablations across multiple different axis of entanglement. A thread on our empirical findings (arXiV: arxiv.org/abs/2602.21472)

English

238

40K

Andrew Silva retweetledi

Eran Malach@EranMalach·17 Eki

SSMs promised efficient language modeling for long context, but so far seem to underperform compared to Transformers in many settings. Our new work suggests that this is not a problem with SSMs, but with how we are currently using them. Arxiv: arxiv.org/pdf/2510.14826 🧵

English

419

115.3K

Andrew Silva retweetledi

Yuyang Wang@YuyangW95·24 Eyl

New preprint & open-source! 🚨 “SimpleFold: Folding Proteins is Simpler than You Think” (arxiv.org/abs/2509.18480). We ask: Do protein folding models really need expensive and domain-specific modules like pair representation? We build SimpleFold, a 3B scalable folding model solely built on general-purpose transformers + flow matching, and is trained on 9M structures. SimpleFold supports easy deployment and efficient inference on consumer-level hardware with PyTorch/MLX (try it on your MacBook!) (1/n)

English

354

105.1K

Andrew Silva@andrewsilva9·26 Ağu

Today marks my first day (back) at Apple's MLR group, where I am starting as a research scientist exploring personalization and LLMs! I'm incredibly grateful for my time with TRI (and I can't wait to share that work once it's ready!), and so excited to get started at Apple!

English

3.2K

Andrew Silva@andrewsilva9·14 Ağu

@yoavgo I recently had one there for a month and a half maybe? I just waited and it cleared eventually…

English

265

(((ل()(ل() 'yoav))))👾@yoavgo·14 Ağu

did you ever got a paper "stuck" in the arxiv publication queue? how did you resolve it?

English

3.1K

Andrew Silva retweetledi

PALS NLP Workshop@pals_nlp_wrkshp·4 Ağu

We’re now accepting ARR commitments for PALS 2025! If your ARR-reviewed paper fits with our themes on LLM personalization, submit by September 4: openreview.net/group?id=EMNLP… See the full call for papers and topic details on our website: pals-nlp-workshop.github.io #EMNLP25 @emnlpmeeting

English

1.1K

Andrew Silva retweetledi

PALS NLP Workshop@pals_nlp_wrkshp·28 Tem

3 days remaining for direct submissions to PALS 2025! Share your findings or works in progress on LLM personalization here: openreview.net/group?id=EMNLP… See our website for the call for papers and information about relevant topics: pals-nlp-workshop.github.io #EMNLP25 @emnlpmeeting

English

543

Andrew Silva retweetledi

Rin Metcalf Susa@RinMetcalfSusa·15 Tem

📣 We are excited to present our work on inferring user preferences from writing samples at @icmlconf Poster Session 3 (Wed. 11:00AM - 1:30PM)! Come by to ✋ chat with us, 📄 learn about our method, and 💻 hear about our new interactive benchmark (🔗s below)!

English

490

Andrew Silva@andrewsilva9·12 Tem

Enjoyed this paper and the central idea, so I wrote a quick summary with some thoughts on future work: andrew-silva.github.io/posts/deepmind… Thanks for the great work @yanming_wan @jiaxing_jxwu @marwaabdulhai @LiorShan @natashajaques

Yanming Wan@yanming_wan

Personalization methods for LLMs often rely on extensive user history. We introduce Curiosity-driven User-modeling Reward as Intrinsic Objective (CURIO) to encourage actively learning about the user within multi-turn dialogs. 📜 arxiv.org/abs/2504.03206 🌎 sites.google.com/cs.washington.…

English

2.9K

Andrew Silva retweetledi

PALS NLP Workshop@pals_nlp_wrkshp·3 Tem

Our submission site is now live! Direct submissions for PALS 2025 can be made here: openreview.net/group?id=EMNLP… See our website for the call for papers and information about relevant topics: pals-nlp-workshop.github.io #EMNLP25 @emnlpmeeting

English

3.8K

Andrew Silva@andrewsilva9·25 Haz

@Amireskndri @pals_nlp_wrkshp @emnlpmeeting Hi Amir, thanks for catching this! The deadline is August 1, we have just fixed the website.

English

Amir Eskandari@Amireskndri·19 Haz

@pals_nlp_wrkshp @emnlpmeeting Hi, I noticed that the deadline is listed as July 18 on the website (#important-dates" target="_blank" rel="nofollow noopener">pals-nlp-workshop.github.io/#important-dat…), but this post mentions August 1. Could you please clarify which one is the correct deadline?

English

Andrew Silva retweetledi

PALS NLP Workshop@pals_nlp_wrkshp·10 Haz

Join us at @emnlpmeeting for: "Tailoring AI: Exploring Active and Passive LLM Personalization" 🎯🧠 To answer, when should LLMs personalize? What role do users play in LLM-personalization? 📅 Deadline Aug. 1 📝 Details in thread 🧵👇 #EMNLP2025 #LLM #AI #personalization 1/5

English

10.4K

Andrew Silva@andrewsilva9·31 Oca

@priontific Haha yes that is me! I have MLX_LM + PPO here github.com/andrew-silva/m…, but unfortunately I did not document it _super_ well (much of the documentation on how to run stuff is in the docstrings at the top of each file!). I haven't tried to reimplement GRPO yet though!

English

Priontific@priontific·28 Oca

@andrewsilva9 - any chance you’re the andrewsilva who made the MLX PPO cartpole example? I’m determined to get GRPO (or even just PPO) working in MLX_lm and I’ll be giving it a crack all week, but it’s already clear I’ll be needing help 😅

English

Priontific@priontific·27 Oca

Turns out that to get this to work, I'm gonna have to reimplement @huggingface 's GRPO trainer from the trl library into MLX... which I don't think I'll be able to do, even with Sonnet's help 😅 But I've put @cursor_ai into agentic mode and I'm gonna see what it can do lol

English

558

Andrew Silva retweetledi

James Smith@jamessealesmith·25 Oca

Today is my last day at Samsung Research America. I’m so grateful for the talented colleagues, exciting projects, and incredible mentorship from Yen-Chang Hsu that shaped the start of my career. Thank you, SRA, for this rewarding chapter—excited for what’s next!

English

409

Andrew Silva@andrewsilva9·4 Oca

@lulumeservey "Faithless is he that says farewell when the road darkens" - Gimli

English

Lulu Cheng Meservey@lulumeservey·4 Oca

Happy birthday to the immortal Professor Tolkien — may the hair on his toes never fall out! In annual tribute to one of the greatest geniuses of all time, here’s a collection of wisdom from his characters…

English

518

101.9K

Andrew Silva@andrewsilva9·31 Ara

@yoavgo I’ve noticed it also turns me into an integration engineer, spending a ton of time writing the bits of connector code between different AI generated modules. Which is also very unfun.

English

107

(((ل()(ل() 'yoav))))👾@yoavgo·31 Ara

ai co-coding will lead to the exact opposite: the ai will get the rewards from successfully solving all the microtasks, and we will only see the failures.

English

114

3.7K

(((ل()(ل() 'yoav))))👾@yoavgo·31 Ara

my prediction is that "ai improvements" wont replace programmers, but will make the job waaay less fun. i'll elaborate:

English

237

27.7K

Keşfet

@tw_killian @BYU @BYUCS @onloglogn @richard_baihe @UnderGroundJeg @trebolloc @yoavgo