Andrew Silva

1.3K posts

Andrew Silva

Andrew Silva

@andrewsilva9

Research Scientist @  | Previously @ Toyota Research Institute and Google | PhD from Georgia Tech. @andrewsilva9.bsky.social

Katılım Eylül 2010
261 Takip Edilen472 Takipçiler
Taylor W. Killian
Taylor W. Killian@tw_killian·
📣 There's never a "best" time to share important updates, especially after sitting on this for so long... I'm joining the faculty @BYU + @BYUCS this Summer as an Assistant Professor in preparation for the upcoming school year. Lots of excitement and a fair bit of nerves. 🧵
Taylor W. Killian tweet media
English
44
11
169
17.3K
Andrew Silva retweetledi
Amir Joudaki
Amir Joudaki@AmirJoudaki·
Neural nets don’t just forget. Sometimes, after long training, they lose the ability to learn at all. In our #ICLR2026 poster, we model Loss of Plasticity as gradient dynamics trapped in invariant manifolds: 🔴 frozen units, 🔵 cloned units. The video makes the traps visible.
English
16
52
611
100.3K
Andrew Silva retweetledi
Yizhe Zhang
Yizhe Zhang@YizheZhangNLP·
1/6 The "Self-Improvement" Paradox Can an LLM get smarter using only its own raw, unverified outputs? No verifiers. No teachers. No RL. We found the answer is an emphatic YES. Introducing SimpleSD: Embarrassingly Simple Self-Distillation. By simply sampling solutions from a model with specific temperature and truncation settings and then fine tuning the model on those exact samples, Qwen3-30B jumped from 42.4% to 55.3% (30% improvement) on LiveCodeBench v6 just by training on its own samples! 🚀 The gain is universal across different model sizes (4B, 8B, 30B) and model families (Llama, Qwen). The harder the problem is, the larger the gain. 📈 Kudos to my amazing colleagues @onloglogn, @richard_baihe, @UnderGroundJeg, Navdeep Jaitly, @trebolloc.  Check out the paper and code below! 👇 paper: arxiv.org/abs/2604.01193 code: github.com/apple/ml-ssd HF models: huggingface.co/collections/ap…
Yizhe Zhang tweet media
English
8
29
196
17.3K
Andrew Silva retweetledi
Michael Kirchhof
Michael Kirchhof@mkirchhof_·
New paper 🥳 RL relies a lot on an agent’s capability to explore. Our strategy-guided exploration makes the agent find new solutions more efficiently. It learns faster, and in some environments its Pass@1 surpasses the base model’s Pass@128. 🧵1/6 📄 arxiv.org/abs/2603.02045
Michael Kirchhof tweet media
English
3
13
67
4.4K
Andrew Silva retweetledi
Jason Ramapuram
Jason Ramapuram@jramapuram·
Autoregressive models dominate, but what if we treat multimodal generation as discrete order agnostic iterative refinement? Excited to share our systematic study on the design space of Tri-Modal Masked Diffusion Models (MDMs). We pre-trained the first Tri-Modal MDM from scratch on (text,), (image, text), and (audio, text). The same model can do ASR, TTS, T2I, captioning and native text generation. What I'm the most proud of in this work is the scientific rigor. Over 3,500 training runs. Principled hyperparameter transfer. Honest results. Carefully controlled ablations across multiple different axis of entanglement. A thread on our empirical findings (arXiV: arxiv.org/abs/2602.21472)
Jason Ramapuram tweet media
English
6
43
238
40K
Andrew Silva retweetledi
Eran Malach
Eran Malach@EranMalach·
SSMs promised efficient language modeling for long context, but so far seem to underperform compared to Transformers in many settings. Our new work suggests that this is not a problem with SSMs, but with how we are currently using them. Arxiv: arxiv.org/pdf/2510.14826 🧵
Eran Malach tweet media
English
6
84
419
115.3K
Andrew Silva retweetledi
Yuyang Wang
Yuyang Wang@YuyangW95·
New preprint & open-source! 🚨 “SimpleFold: Folding Proteins is Simpler than You Think” (arxiv.org/abs/2509.18480). We ask: Do protein folding models really need expensive and domain-specific modules like pair representation? We build SimpleFold, a 3B scalable folding model solely built on general-purpose transformers + flow matching, and is trained on 9M structures. SimpleFold supports easy deployment and efficient inference on consumer-level hardware with PyTorch/MLX (try it on your MacBook!) (1/n)
Yuyang Wang tweet media
English
12
87
354
105.1K
Andrew Silva
Andrew Silva@andrewsilva9·
Today marks my first day (back) at Apple's MLR group, where I am starting as a research scientist exploring personalization and LLMs! I'm incredibly grateful for my time with TRI (and I can't wait to share that work once it's ready!), and so excited to get started at Apple!
English
3
0
28
3.2K
Andrew Silva
Andrew Silva@andrewsilva9·
@yoavgo I recently had one there for a month and a half maybe? I just waited and it cleared eventually…
English
1
0
1
265
(((ل()(ل() 'yoav))))👾
did you ever got a paper "stuck" in the arxiv publication queue? how did you resolve it?
English
4
0
5
3.1K
Andrew Silva retweetledi
Rin Metcalf Susa
Rin Metcalf Susa@RinMetcalfSusa·
📣 We are excited to present our work on inferring user preferences from writing samples at @icmlconf Poster Session 3 (Wed. 11:00AM - 1:30PM)! Come by to ✋ chat with us, 📄 learn about our method, and 💻 hear about our new interactive benchmark (🔗s below)!
English
1
3
7
490
Amir Eskandari
Amir Eskandari@Amireskndri·
@pals_nlp_wrkshp @emnlpmeeting Hi, I noticed that the deadline is listed as July 18 on the website (#important-dates" target="_blank" rel="nofollow noopener">pals-nlp-workshop.github.io/#important-dat…), but this post mentions August 1. Could you please clarify which one is the correct deadline?
English
1
0
0
67
Andrew Silva retweetledi
PALS NLP Workshop
PALS NLP Workshop@pals_nlp_wrkshp·
Join us at @emnlpmeeting for: "Tailoring AI: Exploring Active and Passive LLM Personalization" 🎯🧠 To answer, when should LLMs personalize? What role do users play in LLM-personalization? 📅 Deadline Aug. 1 📝 Details in thread 🧵👇 #EMNLP2025 #LLM #AI #personalization 1/5
English
2
18
20
10.4K
Andrew Silva
Andrew Silva@andrewsilva9·
@priontific Haha yes that is me! I have MLX_LM + PPO here github.com/andrew-silva/m…, but unfortunately I did not document it _super_ well (much of the documentation on how to run stuff is in the docstrings at the top of each file!). I haven't tried to reimplement GRPO yet though!
English
1
0
1
44
Priontific
Priontific@priontific·
@andrewsilva9 - any chance you’re the andrewsilva who made the MLX PPO cartpole example? I’m determined to get GRPO (or even just PPO) working in MLX_lm and I’ll be giving it a crack all week, but it’s already clear I’ll be needing help 😅
English
1
0
3
83
Priontific
Priontific@priontific·
Turns out that to get this to work, I'm gonna have to reimplement @huggingface 's GRPO trainer from the trl library into MLX... which I don't think I'll be able to do, even with Sonnet's help 😅 But I've put @cursor_ai into agentic mode and I'm gonna see what it can do lol
Priontific tweet media
English
3
0
7
558
Andrew Silva retweetledi
James Smith
James Smith@jamessealesmith·
Today is my last day at Samsung Research America. I’m so grateful for the talented colleagues, exciting projects, and incredible mentorship from Yen-Chang Hsu that shaped the start of my career. Thank you, SRA, for this rewarding chapter—excited for what’s next!
James Smith tweet media
English
0
1
25
409
Andrew Silva
Andrew Silva@andrewsilva9·
@lulumeservey "Faithless is he that says farewell when the road darkens" - Gimli
English
0
0
2
56
Lulu Cheng Meservey
Lulu Cheng Meservey@lulumeservey·
Happy birthday to the immortal Professor Tolkien — may the hair on his toes never fall out! In annual tribute to one of the greatest geniuses of all time, here’s a collection of wisdom from his characters…
English
16
47
518
101.9K
Andrew Silva
Andrew Silva@andrewsilva9·
@yoavgo I’ve noticed it also turns me into an integration engineer, spending a ton of time writing the bits of connector code between different AI generated modules. Which is also very unfun.
English
0
0
0
107
(((ل()(ل() 'yoav))))👾
ai co-coding will lead to the exact opposite: the ai will get the rewards from successfully solving all the microtasks, and we will only see the failures.
English
11
0
114
3.7K
(((ل()(ل() 'yoav))))👾
my prediction is that "ai improvements" wont replace programmers, but will make the job waaay less fun. i'll elaborate:
English
17
10
237
27.7K