Pradeep Dasigi (@pdasigi) - Twitter Profili | Zamantika Mersobahis Locabet

@HannaHajishirzi Working with you at Ai2 was a wonderful learning experience. Thank you for your leadership on so many impactful projects!

English

0

2

346

Hanna Hajishirzi@HannaHajishirzi·24 Mar

Life update here: Last week marked the end of my time at Ai2. Proud to have built releases like Olmo, Tülu, FlexOlmo, DRTulu, OLMoTrace, OlmoE, and datasets including Dolma and Dolci—and of how strongly we pushed for open models and open science. Our artifacts reached 33M+ downloads, including ~4M for Olmo 3. I believe Olmo has empowered researchers to push the boundaries of AI I’ll always be cheering on Ai2 and will continue to strongly support open-source, open-science AI. I’m deeply grateful for this chapter and excited for what comes next.

English

40

25

548

56.9K

Pradeep Dasigi retweetledi

Faeze Brahman@faeze_brh·16 Mar

Checkout our new meta-RL method to train strong agents! 💡instead of making new attempts from scratch, we design a simple Self-Reflection paradigm for agents to learn from their own previous experiences!

Teng Xiao@TengX6

🚀 New work: Meta-Reinforcement Learning with Self-Reflection LLM agents shouldn't just solve problems. They should learn from their own attempts. Most current RL methods optimize single independent trajectories. Each attempt starts from scratch, with no mechanism to improve across attempts. But intelligent systems should get better after trying once. This raises a fundamental question: How do we train models to learn from their own attempts? We believe Meta-Reinforcement Learning may be a key paradigm for training future LLM agents, enabling models to adapt and improve across attempts and environments. In this work we introduce MR-Search, a training paradigm built around: 🧠 In-Context Meta-Reinforcement Learning 🪞 Self-Reflection 🔁 Learning to learn at test time 📄 Paper: arxiv.org/abs/2603.11327 💻 Code: github.com/tengxiao1/MR-S…

English

0

3

33

3.7K

Pradeep Dasigi@pdasigi·16 Mar

Can we train agents to make multiple attempts at solving tasks and learn from previous attempts? New work led by @TengX6 and @1t4chiii shows a simple MetaRL approach, where the agent reflects upon its prior episodes, can do this.

Teng Xiao@TengX6

🚀 New work: Meta-Reinforcement Learning with Self-Reflection LLM agents shouldn't just solve problems. They should learn from their own attempts. Most current RL methods optimize single independent trajectories. Each attempt starts from scratch, with no mechanism to improve across attempts. But intelligent systems should get better after trying once. This raises a fundamental question: How do we train models to learn from their own attempts? We believe Meta-Reinforcement Learning may be a key paradigm for training future LLM agents, enabling models to adapt and improve across attempts and environments. In this work we introduce MR-Search, a training paradigm built around: 🧠 In-Context Meta-Reinforcement Learning 🪞 Self-Reflection 🔁 Learning to learn at test time 📄 Paper: arxiv.org/abs/2603.11327 💻 Code: github.com/tengxiao1/MR-S…

English

0

3

583

Pradeep Dasigi retweetledi

Nathan Lambert@natolambert·5 Mar

Excited to share the latest Olmo model: Olmo Hybrid. This is a model with gated delta net (GDN) layers in a 3:1 ratio with full attention. It follows lots of other developments like Qwen 3.5 and Kimi Linear. It's incredible timing to release a fully open model so people can study how these architecture changes impact the full stack. Personally, I learned a lot in making the post-training work. Even with the data being identical for pretraining, post-training is very different! In particular, the OSS tools for these new architectures is really limited. New architectures are much slower than standard transformers or popular models like DeepSeek MoEs. This is work that we can do together to keep pushing the frontier of efficient, open models. This work was led by @lambdaviking @tyleraromero and others. I got to play a smaller part in making post-training work, super fun project! I've written up a blog post that explains why this matters and hybrid models didn't work a few years ago when Mamba was super popular. Plus, this paper is a great entry point for modern deep learning / language modeling scaling theory. Enjoy and send feedback!

English

18

72

497

76.3K

Pradeep Dasigi retweetledi

Ai2@allen_ai·5 Mar

Introducing Olmo Hybrid, a 7B fully open model combining transformer and linear RNN layers. It decisively outperforms Olmo 3 7B across evals, w/ new theory & scaling experiments explaining why. 🧵

English

17

127

785

169.2K

Pradeep Dasigi@pdasigi·18 Şub

@sivareddyg Congratulations Siva!!

English

1

0

1

65

Siva Reddy@sivareddyg·18 Şub

Honored to be a Sloan Fellow. So grateful to my wonderful students, mentors, colleagues, friends and family, thank you! ❤️

Sloan Foundation@SloanFoundation

Congrats to the 126 early-career scholars awarded a 2026 Sloan Research Fellowship, whose creativity and innovation set them apart as the next generation of scientific leaders! Our Fellows represent 7 fields and 44 institutions across the US and Canada. sloan.org/fellowships/20…

English

32

14

111

13.4K

Pradeep Dasigi@pdasigi·10 Şub

Check out the super cool web interface of DR Tulu!

Shannon Shen@shannonzshen

Super excited to share our open interactive demo for DR Tulu-8B! It supports web and literature search with full transparency — you can see the model's thinking traces and tool outputs as it reasons through your query. 🔗 dr-tulu.org 📝 arxiv.org/abs/2511.19399

English

1

0

4

378

Pradeep Dasigi@pdasigi·6 Şub

@AkariAsai This is wonderful! Congratulations to you and the team!

English

0

124

Akari Asai@AkariAsai·4 Şub

Thrilled to share: OpenScholar - our work on scientific deep research agents for reliable literature synthesis -has been accepted to Nature! 🎉 Huge thanks to collaborators across institutions who made this possible!

English

33

227

1.3K

126.1K

Pradeep Dasigi retweetledi

Ai2@allen_ai·27 Oca

Introducing Ai2 Open Coding Agents—starting with SERA, our first-ever coding models. Fast, accessible agents (8B–32B) that adapt to any repo, including private codebases. Train a powerful specialized agent for as little as ~$400, & it works with Claude Code out of the box. 🧵

English

42

141

929

348K

Pradeep Dasigi@pdasigi·21 Oca

@saurabh_shah2 That's great! Congratulations Saurabh!

English

1

0

2

78

Saurabh Shah@saurabh_shah2·20 Oca

I’ve joined humans&! My last blog post explains why I think a human-centric approach is the missing piece in modern AI systems. I’m super psyched about the technical direction of the company. Perhaps even more important, though, is the team; the humans at humans&. My coworkers are completely and wholly wonderful. They’re brilliant, yes, but they’re also kind, funny, focused, and just about every other good adjective I can think of. Put simply: vibes are goooood. We’re bringing together wonderful people united by a much-needed mission to build something truly different. If that excites you, I’d love to chat.

humans&@humansand

Today we introduce humans&, a human-centric frontier AI lab. We believe AI can be reimagined, centering around people and their relationships with each other. At its best, AI should serve as a deeper connective tissue that strengthens organizations and communities

English

40

8

231

40.7K

Pradeep Dasigi@pdasigi·18 Oca

I have been trying out Telugu queries on the Indic LLM Arena over the last few days and most of the responses are surprisingly bad, with lots of hallucinations and sometimes even grammatical errors, even from strong (in English) models. Clearly there is a huge gap between English and Indian language capabilities, and evaluating this is very important. Do contribute if you care about making LLMs work for Indian languages.

AI4Bharat@ai4bharat

For AI to be truly inclusive, it must understand more than just grammar—it must understand context. @AI4Bharat at @iitmadras had launched the Indic LLM Arena. This isn't just another leaderboard; it’s a public utility for: ✅ Developers: Test your models against real-world Indian use cases. ✅ Enterprises: Find out which LLM actually resonates with your customers in rural India. ✅ Sovereignty: Building AI that respects our social fabric and safety norms. Be a part of this movement. Try the Arena today and help us rank the models that will power India's digital future. 👉 ai4bharat.iitm.ac.in/blog/indic-llm… #GenerativeAI #DigitalIndia #IITMadras #IndicLLM #indiaaiimpactsummit2026 @MiteshKhapra @anoopk @prajdabre @ravi_iitm @partha_p_t @ManishGuptaMG1 @meghtweets @dineshteewari1 @abapna @WSAI_IITM @OfficialINDIAai @EkStep_Org @PeoplePlusAI

English

1

30

3.4K

Pradeep Dasigi retweetledi

Wenting Zhao@wzhao_nlp·17 Oca

🌶️ Some (perhaps) spicy thoughts. It’s been a while since my last tweet, but I wanted to write about how disorienting it has been from academia to an LLM lab 😅 The kind of research I was trained to do during my PhD almost doesn’t exist here. The obsession with mathematical elegance and novelty is mostly gone. Everything is about scaling data and compute. For a while, that really got to me. At my lowest point, I felt like I’d lost interest in building LLMs altogether. I didn’t feel intellectually challenged anymore. What made this even stranger was that, at a technical level, things worked. If there was a capability I wanted to teach a model, scaling the right data and compute always got me there, no exception (so far). But recently, I found a way to reconcile with myself.. I realized the real competition isn’t in the ML recipe anymore. Most teams do roughly the same thing. What actually matters is how fast you can iterate, test ideas, and recover from mistakes. And that speed is mostly backed by infrastructure 🏗️ Faster loops, fewer bugs, better tooling. Seeing this made me excited again! Infra is its own deep, hard, and intellectually fun problem space. In 2026, I want to become an ML researcher who’s really good at infra. And I'll come back to ML problems with that edge, and will be excited to share what I find 😌

English

63

114

1.9K

201.5K

Pradeep Dasigi retweetledi

Ai2@allen_ai·16 Oca

SciArena update: our Olmo 3.1 32B Instruct scores 963.6 Elo overall at just $0.17/100 calls—ahead of OpenAI’s GPT-OSS-20B. In Engineering, it hits 1039.2 Elo, only 2.5 behind GPT-OSS-120B—a model ~4× its size. 🧵

English

1

3

14

1.9K

Pradeep Dasigi retweetledi

Partha Talukdar@partha_p_t·14 Oca

Indic LLM Arena needs you! 🇮🇳 Try out which LLM works best for your Indic language queries and vote for the winner! arena.ai4bharat.org

AI4Bharat@ai4bharat

For AI to be truly inclusive, it must understand more than just grammar—it must understand context. @AI4Bharat at @iitmadras had launched the Indic LLM Arena. This isn't just another leaderboard; it’s a public utility for: ✅ Developers: Test your models against real-world Indian use cases. ✅ Enterprises: Find out which LLM actually resonates with your customers in rural India. ✅ Sovereignty: Building AI that respects our social fabric and safety norms. Be a part of this movement. Try the Arena today and help us rank the models that will power India's digital future. 👉 ai4bharat.iitm.ac.in/blog/indic-llm… #GenerativeAI #DigitalIndia #IITMadras #IndicLLM #indiaaiimpactsummit2026 @MiteshKhapra @anoopk @prajdabre @ravi_iitm @partha_p_t @ManishGuptaMG1 @meghtweets @dineshteewari1 @abapna @WSAI_IITM @OfficialINDIAai @EkStep_Org @PeoplePlusAI

English

1

4

23

1.6K

Pradeep Dasigi retweetledi

Ai2@allen_ai·8 Oca

Olmo 3.1 32B Instruct is now on @openrouter, hosted by @DeepInfra. Built for real-world use: reliable instruction following & function calling for agentic workflows + research. Fully open & leading benchmark performance, ready to plug into your stack. 👇

English

3

4

33

8.3K

Pradeep Dasigi retweetledi

DeepInfra@DeepInfra·6 Oca

Now hosting @allen_ai Olmo-3.1-32B-Instruct on DeepInfra. Designed for solid reasoning and clean instruction following - great for research workflows. $0.20 in / $0.60 out per Mtoken

English

2

6

579

Pradeep Dasigi retweetledi

Ai2@allen_ai·18 Ara

Now you can use our most powerful models via API. Olmo 3.1 32B Think, our reasoning model for complex problems, is on @openrouter—free through 12/22. And Olmo 3.1 32B Instruct, our flagship chat model with tool use, is available through @huggingface Inference Providers. 👇

English

5

10

118

16.8K

Pradeep Dasigi retweetledi

Kyle Lo@kylelostat·17 Ara

olmo 3 paper finally on arxiv 🫡 thx to our teammates esp folks who chased additional baselines thx to arxiv-latex-cleaner and overleaf feature for chasing latex bugs thx for all the helpful discussions after our Nov release, best part of open science is progressing together!

English

12

99

467

55.2K

Pradeep Dasigi retweetledi

👋 Jan@jandotai·16 Ara

You can run Olmo 3.1 on Jan. Search for the model name on Jan Hub to get started 💜

Ai2@allen_ai

Olmo 3.1 is here. We extended our strongest RL run and scaled our instruct recipe to 32B—releasing Olmo 3.1 Think 32B & Olmo 3.1 Instruct 32B, our most capable models yet. 🧵

English

1

8

18

4.2K

Pradeep Dasigi retweetledi

Kyle Lo@kylelostat·12 Ara

lol so during neurips, we kept the RL run going and the model kept getting better 😂 Olmo 3.1 is a.. 🐡 32B Thinking, still best fully-open model to-date 🐠 32B Instruct, for ppl who hate long yapping, as good as qwen3 we added like 10 more pages to the paper too! thx for community feedback from convos at neurips: 🐟 more on our eval ideology 🦈 more baselines 🍣 more about RL Zero etc we picked final model (internally called moonlit surfer 🌛🏄) not just on bench scores but good vibes 🥰

Ai2@allen_ai

Olmo 3.1 is here. We extended our strongest RL run and scaled our instruct recipe to 32B—releasing Olmo 3.1 Think 32B & Olmo 3.1 Instruct 32B, our most capable models yet. 🧵

English

2

26

146

19.8K

Pradeep Dasigi

Keşfet