Ravi Theja

1.4K posts

Ravi Theja banner
Ravi Theja

Ravi Theja

@ravithejads

Applied AI @MistralAI Previously - @llama_index (LlamaIndex) Focused on LLMs, Agents, RAG, and fine-tuning LLMs.

SF Katılım Aralık 2010
835 Takip Edilen6.1K Takipçiler
Sabitlenmiş Tweet
Ravi Theja
Ravi Theja@ravithejads·
🔥 Releasing BRAG: High-Performance RAG Model Trained In $25 🚀 We’re thrilled to announce the launch of our RAG models, collectively known as BRAG—a series of SLMs (3 SLMs and 1 Ultra SLM) specifically trained for Retrieval Augmented Generation (RAG). 1️⃣ ⁠BRAG-Qwen2-7b-v0.1 2️⃣ ⁠BRAG-Llama-3.1-8b-v0.1 3️⃣ ⁠BRAG-Llama-3-8b-v0.1 4️⃣ ⁠BRAG-Qwen2-1.5b-v0.1 🌟 Our models outperform Cohere’s Command R+, Qwen2, Llama3.1, and Llama3 Instruct models and closely matched the performance of GPT-4-Turbo and Nvidia’s ChatQA-1.5-8B on ChatRAG-Bench. 💵 Each model was trained in less than $25. 🔍 More interesting details on datasets and training procedure in our release technical report: shorturl.at/IrCE1 🙌 A huge thank you to @HamelHusain, @dan_s_becker, and @charles_irl for the credits on @modal_labs via the LLM Fine-tuning course. This work is done in collaboration with @nlpguy_ under maximalists.ai
Ravi Theja tweet media
English
18
82
440
65.8K
Ravi Theja retweetledi
Pratyush Choudhury (PC)
For a while, @aakrit & I've quietly done this at Activate: spend time with exceptional operators, CTOs & senior builders before there’s a deck, company, or fully formed idea. Now opening it up Mixture of Experts - a small private dinner for people at the edge of founding First edition tomorrow in Bengaluru Request below on the Luma to join or DM if someone exceptional should be in the room. luma.com/1vmm7hq9
English
6
4
92
18.4K
Ravi Theja retweetledi
tokenbender
tokenbender@tokenbender·
Ever wondered if you could extract capabilities and behaviors from neural networks and reuse/update/route it as needed? We introduce low-rank circuit conditioning, a novel approach that preserves the model's output behavior while reshaping how an existing capability is represented. In the base model, standard compact recovery stalls at 29%. After conditioning, the same extraction pipeline reaches 91.33% autoregressive full-answer recovery from 5.05% of MLP channels. The evidence points to a possibility of extracting and using isolated capabilities saving cost, latency and high adaptability. Read our work to understand more - tokenbender.com/posts/honey-i-…
English
26
67
395
41.8K
Ravi Theja retweetledi
Pratyush Choudhury (PC)
India’s AI future is being built by those who ship, not just study. Today I’m announcing Activate Fellows - a summer program for 15 of India’s (and the world’s) best student builders to work inside the country’s leading AI startups Only 15 spots. And 8 days left to apply. Details + link 🧵
Pratyush Choudhury (PC) tweet media
English
9
20
181
14.6K
Chayenne Zhao
Chayenne Zhao@GenAI_is_real·
SGLang grew from an open-source inference project into RadixArk this year. Today we officially announce our $100M Seed, led by Accel and co-led by Spark Capital. The numbers are in the announcement (25K+ stars, 400K+ GPUs in deployment). I want to say something else. Frontier training and inference stacks have long been internal property of a few companies. Every AI team starting today has to rewrite scheduling, KV cache management, batched decoding, RL rollout — work that should already be commodity by now. SGLang covers the inference side. Miles covers large-scale RL and post-training. Together, the goal is letting a ten-person startup run training and serving at the same level as a frontier lab. Doing both as open source and production-grade — that's the entire reason I joined RadixArk. Thanks to Accel, Spark Capital, NVentures, Salience, HOF, Walden, AMD, MediaTek, LDV, Sky9 and the other institutional investors. Thanks to Igor Babuschkin, Lip-Bu Tan, Hock Tan, John Schulman, Soumith Chintala, Lilian Weng, William Fedus, Robert Nishihara, Logan Kilpatrick, Hao Zhang and the other angels. And thanks to every contributor in the SGLang community — your PRs and issues are what actually holds this up. What's left to build is much bigger than what we've already shipped.
RadixArk@radixark

Today, we are thrilled to officially launch RadixArk with $100M in Seed funding at a $400M valuation. The round was led by @Accel and co-led by @sparkcapital. RadixArk exists to make frontier AI infrastructure open and accessible to everyone. Today, the systems behind the most capable AI models are concentrated in a small number of companies. As a result, most AI teams are forced to rebuild training and inference stacks from scratch, duplicating the same infrastructure work instead of focusing on new models, products, and ideas. RadixArk was founded to change that. We are building an AI platform that makes it easier for teams to train and serve the best models at scale. RadixArk comes from the open-source community. We started with SGLang, where many of us are core developers and maintainers, and expanded our work to Miles for large-scale RL and post-training. We will continue contributing to both projects and working with the community to make them the strongest open-source infrastructure foundations for frontier AI. We would like to thank our long-term partners, contributors, and the broader SGLang community for believing in this mission. We're also grateful to @Accel and @sparkcapital, NVentures (Venture capital arm of @nvidia), Salience Capital, A&E Investment, @HOFCapital, @walden_catalyst, @AMD, LDVP, WTT Fubon Family, @MediaTek, Vocal Ventures, @Sky9Capital and our angel investors @ibab, @LipBuTan1, Hock Tan, @johnschulman2, @soumithchintala, @lilianweng, @oliveur, @Thom_Wolf, @LiamFedus, @robertnishihara, @ericzelikman, @OfficialLoganK, and @multiply_matrix among others. Thanks for the exclusive interview with @MeghanBobrowsky at @WSJ about our vision.

English
8
12
131
10.5K
Ravi Theja retweetledi
Kaggle
Kaggle@kaggle·
ParseBench is now live on Kaggle Benchmarks! 🚀 Developed by @llama_index, this benchmark evaluates PDF-to-structured-data conversion, featuring ~2k human-verified pages from real enterprise docs across 5 capability dimensions. 🥇Gemini 3 Flash: 79.3% 🥈GPT 5.4: 72.9% 🥉Gemma 4 31B: 66.4%
English
5
20
118
15.4K
Ravi Theja retweetledi
Jerry Liu
Jerry Liu@jerryjliu0·
We’re open sourcing the first document OCR benchmark for the agentic era, ParseBench. Document parsing is the foundation of every AI agent that works with real-world files. ParseBench is a benchmark that measures parsing quality specifically for agent knowledge work: ✅ It optimizes for semantic correctness (instead of exact similarity) ✅ It has the most comprehensive distribution of real-world enterprise documents It contains ~2,000 human-verified enterprise document pages with 167,000+ test rules across five dimensions that matter most: tables, charts, content faithfulness, semantic formatting, and visual grounding. We benchmarked 14 known document parsers on ParseBench, from frontier/OSS VLMs to specialized parsers to LlamaParse. Here are some of our findings: 💡 Increasing compute budget yields diminishing returns - Gemini/gpt-5-mini/haiku gain 3-5 points from minimal to high thinking, at 4x the cost. 💡 Charts are the most polarizing dimension for evaluation. Most specialized parsers score below 6%, while some VLM-based parsers do a bit better. 💡 VLMs are great at visual understanding but terrible at layout extraction. GPT-5-mini/haiku score below 10% on our visual grounding task, all specialized parsers do much better. 💡 No method crushes all 5 dimensions at once, but LlamaParse achieves the highest overall score at 84.9%, and is the leader in 4 out of the 5 dimensions. This is by far the deepest technical work that we’ve published as a company. I would encourage you to start with our blog and explore our links to Hugging Face to GitHub. All the details are in our full 35-page (!!) ArXiv whitepaper. 🌐: Blog: llamaindex.ai/blog/parsebenc… 📄 Paper: arxiv.org/abs/2604.08538… 💻 Code: github.com/run-llama/Pars… 📊 Dataset: huggingface.co/datasets/llama… 🎥 YouTube: youtube.com/watch?v=g5p7G-…
YouTube video
YouTube
English
31
81
526
107.2K
Ravi Theja retweetledi
LlamaIndex 🦙
LlamaIndex 🦙@llama_index·
LlamaIndex is proud to be named to the 2026 Enterprise Tech 30, #3 in the Early Stage category. The ET30 is an annual list by @Wing_VC and Eric Newcomer, voted on by 90+ leading investors and corporate development leaders. It recognizes the private companies wi th the most potential to shape the future of enterprise technology. Thank you to Wing Venture Capital and Eric Newcomer, and congratulations to all the companies honored this year.
LlamaIndex 🦙 tweet media
English
3
7
24
5.8K
Ravi Theja retweetledi
Boris Cherny
Boris Cherny@bcherny·
I wanted to share a bunch of my favorite hidden and under-utilized features in Claude Code. I'll focus on the ones I use the most. Here goes.
English
553
2.5K
23.2K
3.9M
Ravi Theja retweetledi
Mistral AI
Mistral AI@MistralAI·
Mistral AI is headed to San Jose for @NVIDIA GTC! 🚀 We’re demoing our newest frontier models, sharing our vision for the future of enterprise AI, and unveiling some big news you won't want to miss. 📍 Visit us to see the latest innovations in action. 🗓️ Check out our sessions and book a meeting: link in 🧵
Mistral AI tweet media
English
11
27
217
21K
Ravi Theja retweetledi
Pratyush Kumar
Pratyush Kumar@pratykumar·
📢 Open-sourcing the Sarvam 30B and 105B models! Trained from scratch with all data, model research and inference optimisation done in-house, these models punch above their weight in most global benchmarks plus excel in Indian languages. Get the weights at Hugging Face and AIKosh. Thanks to the good folks at SGLang for day 0 support, vLLM support coming soon. Links, benchmark scores, examples, and more in our blog - sarvam.ai/blogs/sarvam-3…
English
206
1.3K
6.8K
743.9K
Ravi Theja retweetledi
Ramsri Goutham Golla
Ramsri Goutham Golla@ramsri_goutham·
Navarasa in Deepmind's Gemmaverse 🚀 The work @ravithejads and I did in building Navarasa, an Indic instruction-finetuned model on top of Google's Gemma catering to 15 Indian languages, has been featured in Deepmind's Gemmaverse!
Ramsri Goutham Golla tweet media
English
5
2
22
1.5K
Ravi Theja retweetledi
Tanishq Kumar
Tanishq Kumar@tanishqkumar07·
I've been working on a new LLM inference algorithm. It's called Speculative Speculative Decoding (SSD) and it's up to 2x faster than the strongest inference engines in the world. Collab w/ @tri_dao @avnermay. Details in thread.
English
135
454
4.1K
610.3K
Ravi Theja retweetledi
Pratyush Kumar
Pratyush Kumar@pratykumar·
Drop 13/14: The 30B and 105B models, benchmarks, and HF links will all come. But today it is a drop about people. About how our team of just 15 folks gave it their all to do what many doubted as not doable - ie train usefully large, globally competitive models from scratch in India. This team of 15 has now firmly launched @sarvam into its second innings. Yes, we can! @_mohit_singla @anand_404 @kediaharshit9 @AashaySachdeva @sumanthd17 @ArpitDwivedi100 @HarveenChadha @rkal4 @sushil_khyalia @ManavSinghal157 @sohampetkar missing in the pictuere - @selfawareatom @AnnaUpreti Anand @MeghMakwan33973 Utkarsh
Pratyush Kumar tweet media
English
198
730
5.3K
309.9K
Ravi Theja retweetledi
Pratyush Kumar
Pratyush Kumar@pratykumar·
Drop 12/14: Models, products, impact - today something different, very different. Launching Sarvam Kaze, our foray into getting our models into the your hands with our devices - designed and built here in India!
English
128
561
3.3K
348.4K
Ravi Theja retweetledi
Pratyush Kumar
Pratyush Kumar@pratykumar·
Drop 9/14: Today we are introducing Sarvam Studio, our product to help creators go multilingual. One piece of content, every corner of India. With AI video dubbing, Studio generates high-fidelity dubs in 11 Indian languages. In an expert study, participants preferred Sarvam Studio for overall quality and production readiness. With agentic document translation, Studio excels in contextually translating long-form content across genres. Our evaluations demonstrate that readers strongly preferred the output from Studio across different genres.
English
79
335
1.8K
255.1K
Ravi Theja retweetledi
Sumanth
Sumanth@sumanthd17·
I selected one of the most challenging Telugu dialogues, widely celebrated for its powerful delivery, and it absolutely nailed it 🔥 @SarvamForDevs
English
18
41
314
21.5K
Ravi Theja retweetledi
Ramsri Goutham Golla
Ramsri Goutham Golla@ramsri_goutham·
Finally launching my solo SaaS AiArtist.io 🚀 The AI motion graphics generator every creator has wished existed! Every frame in this launch video was created using only text prompts on the same platform. Launch promo: $29 for a full year. No subscriptions!
English
9
6
31
3.7K
Ravi Theja retweetledi
Hugging Models
Hugging Models@HuggingModels·
Open-source, multilingual, real-time speech-to-text model that streams audio and transcribes it into text with low latency across multiple languages. Try now: huggingface.co/spaces/mistral…
English
3
25
208
15.5K