Sanjay Adhikesaven

19 posts

Sanjay Adhikesaven banner
Sanjay Adhikesaven

Sanjay Adhikesaven

@sadhikesaven

eecs @ucberkeleymet | research @berkeleynlp @allen_ai

Berkeley, CA Katılım Nisan 2023
149 Takip Edilen102 Takipçiler
Sabitlenmiş Tweet
Sanjay Adhikesaven
Sanjay Adhikesaven@sadhikesaven·
Imagine you fully post-trained "YourModel v1". Then, you've got better data — math, code, tool use, safety — and you want to improve it. Today, that usually means retraining the whole model. But what if new data could be added modularly, with a fixed cost each time?
Sanjay Adhikesaven tweet media
Ai2@allen_ai

Last year, we introduced FlexOlmo, a novel way to train parts of a model independently then combine them later. BAR builds on that idea for a harder problem: how to keep improving a model without having to retrain each time. 🧵

English
5
18
140
19.4K
Sanjay Adhikesaven retweetledi
Raj Patel
Raj Patel@babugi28·
Today, Human Archive is announcing our $8.2M seed round to model human embodied intelligence. Despite decades of research, we still barely understand ourselves. Our goal is to learn how humans interact with the world, and over the past 6 months, our team’s made enormous progress toward that alongside leading AI labs. learn more @TechCrunch techcrunch.com/2026/05/26/hum…
English
51
23
225
60.5K
Sanjay Adhikesaven retweetledi
Raj Patel
Raj Patel@babugi28·
Human Archive (YC W26) is a research lab modeling human embodied intelligence, and we’re hiring globally across 10 roles in hardware, software, and operations. We build cameras and sensors, deploy them globally at scale, and train models to better understand how humans interact with the physical world. Our goal is to replace manual labor, increase global abundance, shift human effort toward creativity and exploration, and advance how we understand the brain, human cognition, prosthetics, and rehabilitation. By joining now, you’ll join the founding team and work on the most important data project in human history. Software: Machine Learning Engineer (SF) Research Engineer (SF) Hardware: Head of Hardware Engineering (SF + China) Embedded / Electrical Engineer (China) Firmware Engineer (China) Mechanical Engineer (China) Head of Operations (China) Operations: Infrastructure Engineer (India) Software Engineer (India) Operations (Globally) Apply here: jobs.ashbyhq.com/humanarchive
English
12
15
177
19K
Sanjay Adhikesaven retweetledi
Ryan Yixiang Wang
Ryan Yixiang Wang@RyanYixiang·
MoEs are everywhere in frontier models, and they are deployed as a monolith system. But many applications only need a narrow slice of capabilities, e.g., math, code, biomedical, etc. So what if "modularity" is actually the missing opportunity for MoEs? Today, we're releasing EMO: an end-to-end pretrained MoE where modularity emerges naturally, enabling selective use of experts!
Ryan Yixiang Wang tweet media
Ai2@allen_ai

Today we’re releasing EMO, a new mixture-of-experts (MoE) model trained so modular structure emerges directly from data without human-defined priors. EMO can use a small subset of its experts for a given task while keeping near full-model performance. 🧵

English
7
73
532
113K
Sanjay Adhikesaven retweetledi
Jacob Morrison
Jacob Morrison@jacobcares·
How do you add new capabilities to a fully post-trained language model, without retraining from scratch, or losing what it already knows? We're excited to introduce Branch-Adapt-Route (BAR): train independent experts, merge them into an MoE, and upgrade them as needed.
Jacob Morrison tweet media
Ai2@allen_ai

Last year, we introduced FlexOlmo, a novel way to train parts of a model independently then combine them later. BAR builds on that idea for a harder problem: how to keep improving a model without having to retrain each time. 🧵

English
4
31
275
37.6K
Sanjay Adhikesaven
Sanjay Adhikesaven@sadhikesaven·
More broadly, BAR suggests a new way to build and improve upon LMs: not one monolithic pipeline that must be re-run for every update, but a modular system where experts can be trained, added, and upgraded independently.
English
1
0
0
336
Sanjay Adhikesaven
Sanjay Adhikesaven@sadhikesaven·
Imagine you fully post-trained "YourModel v1". Then, you've got better data — math, code, tool use, safety — and you want to improve it. Today, that usually means retraining the whole model. But what if new data could be added modularly, with a fixed cost each time?
Sanjay Adhikesaven tweet media
Ai2@allen_ai

Last year, we introduced FlexOlmo, a novel way to train parts of a model independently then combine them later. BAR builds on that idea for a harder problem: how to keep improving a model without having to retrain each time. 🧵

English
5
18
140
19.4K
Sanjay Adhikesaven retweetledi
Raj Patel
Raj Patel@babugi28·
My mom just sent me this video of my co-founder @shloke_patel and me pitching our mango business when we were 13. Now, in 50 days, we’ll be pitching at @ycombinator's Demo Day in SF. None of this would've been possible without our rockstar parents. Order some mangoes this summer at mangounited.com
Raj Patel tweet media
Raj Patel@babugi28

My co-founder @shloke_patel and I have worked on every major project of our lives together. In high school, we sold 16,000 mangoes, planted 100,000 trees, and started a microbiology startup. He went to @Stanford, I went to @UCBerkeley, then we both dropped out to do @ycombinator and build Human Archive where we’re collecting and labeling aligned multimodal robotics datasets at scale. It’s been 20 years of him pissing me off and I genuinely don’t know how much longer I can take ts someone please find him a girl 😂😂

English
40
71
939
215.7K
Deedy
Deedy@deedydas·
The best free class on the internet is Poker Theory at MIT. It's taught by portfolio managers at quant shops like Citadel, AQR, SIG and 2 WSOP bracelet winners. You play 5000 hands as practice and learn the math of making money from pros. And it's free.
Deedy tweet media
English
123
1.3K
20.5K
2.1M