Ravi Theja

1.4K posts

Ravi Theja banner
Ravi Theja

Ravi Theja

@ravithejads

Applied AI @MistralAI Previously - @llama_index (LlamaIndex) Focused on LLMs, Agents, RAG, and fine-tuning LLMs.

Bangalore, India Entrou em Aralık 2010
815 Seguindo6.1K Seguidores
Tweet fixado
Ravi Theja
Ravi Theja@ravithejads·
🔥 Releasing BRAG: High-Performance RAG Model Trained In $25 🚀 We’re thrilled to announce the launch of our RAG models, collectively known as BRAG—a series of SLMs (3 SLMs and 1 Ultra SLM) specifically trained for Retrieval Augmented Generation (RAG). 1️⃣ ⁠BRAG-Qwen2-7b-v0.1 2️⃣ ⁠BRAG-Llama-3.1-8b-v0.1 3️⃣ ⁠BRAG-Llama-3-8b-v0.1 4️⃣ ⁠BRAG-Qwen2-1.5b-v0.1 🌟 Our models outperform Cohere’s Command R+, Qwen2, Llama3.1, and Llama3 Instruct models and closely matched the performance of GPT-4-Turbo and Nvidia’s ChatQA-1.5-8B on ChatRAG-Bench. 💵 Each model was trained in less than $25. 🔍 More interesting details on datasets and training procedure in our release technical report: shorturl.at/IrCE1 🙌 A huge thank you to @HamelHusain, @dan_s_becker, and @charles_irl for the credits on @modal_labs via the LLM Fine-tuning course. This work is done in collaboration with @nlpguy_ under maximalists.ai
Ravi Theja tweet media
English
18
81
441
65.5K
Ravi Theja retweetou
Mistral AI
Mistral AI@MistralAI·
Mistral AI is headed to San Jose for @NVIDIA GTC! 🚀 We’re demoing our newest frontier models, sharing our vision for the future of enterprise AI, and unveiling some big news you won't want to miss. 📍 Visit us to see the latest innovations in action. 🗓️ Check out our sessions and book a meeting: link in 🧵
Mistral AI tweet media
English
11
28
214
16.8K
Ravi Theja retweetou
Pratyush Kumar
Pratyush Kumar@pratykumar·
📢 Open-sourcing the Sarvam 30B and 105B models! Trained from scratch with all data, model research and inference optimisation done in-house, these models punch above their weight in most global benchmarks plus excel in Indian languages. Get the weights at Hugging Face and AIKosh. Thanks to the good folks at SGLang for day 0 support, vLLM support coming soon. Links, benchmark scores, examples, and more in our blog - sarvam.ai/blogs/sarvam-3…
English
208
1.3K
6.9K
724.9K
Ravi Theja retweetou
Ramsri Goutham Golla
Ramsri Goutham Golla@ramsri_goutham·
Navarasa in Deepmind's Gemmaverse 🚀 The work @ravithejads and I did in building Navarasa, an Indic instruction-finetuned model on top of Google's Gemma catering to 15 Indian languages, has been featured in Deepmind's Gemmaverse!
Ramsri Goutham Golla tweet media
English
5
2
22
1.3K
Ravi Theja retweetou
Tanishq Kumar
Tanishq Kumar@tanishqkumar07·
I've been working on a new LLM inference algorithm. It's called Speculative Speculative Decoding (SSD) and it's up to 2x faster than the strongest inference engines in the world. Collab w/ @tri_dao @avnermay. Details in thread.
English
133
455
4K
599.1K
Ravi Theja retweetou
Pratyush Kumar
Pratyush Kumar@pratykumar·
Drop 13/14: The 30B and 105B models, benchmarks, and HF links will all come. But today it is a drop about people. About how our team of just 15 folks gave it their all to do what many doubted as not doable - ie train usefully large, globally competitive models from scratch in India. This team of 15 has now firmly launched @sarvam into its second innings. Yes, we can! @_mohit_singla @anand_404 @kediaharshit9 @AashaySachdeva @sumanthd17 @ArpitDwivedi100 @HarveenChadha @rkal4 @sushil_khyalia @ManavSinghal157 @sohampetkar missing in the pictuere - @selfawareatom @AnnaUpreti Anand @MeghMakwan33973 Utkarsh
Pratyush Kumar tweet media
English
198
742
5.3K
307.9K
Ravi Theja retweetou
Pratyush Kumar
Pratyush Kumar@pratykumar·
Drop 12/14: Models, products, impact - today something different, very different. Launching Sarvam Kaze, our foray into getting our models into the your hands with our devices - designed and built here in India!
English
131
573
3.4K
346.2K
Ravi Theja retweetou
Pratyush Kumar
Pratyush Kumar@pratykumar·
Drop 9/14: Today we are introducing Sarvam Studio, our product to help creators go multilingual. One piece of content, every corner of India. With AI video dubbing, Studio generates high-fidelity dubs in 11 Indian languages. In an expert study, participants preferred Sarvam Studio for overall quality and production readiness. With agentic document translation, Studio excels in contextually translating long-form content across genres. Our evaluations demonstrate that readers strongly preferred the output from Studio across different genres.
English
83
341
1.8K
253.9K
Ravi Theja retweetou
Sumanth
Sumanth@sumanthd17·
I selected one of the most challenging Telugu dialogues, widely celebrated for its powerful delivery, and it absolutely nailed it 🔥 @SarvamForDevs
English
18
42
316
21.5K
Ravi Theja retweetou
Ramsri Goutham Golla
Ramsri Goutham Golla@ramsri_goutham·
Finally launching my solo SaaS AiArtist.io 🚀 The AI motion graphics generator every creator has wished existed! Every frame in this launch video was created using only text prompts on the same platform. Launch promo: $29 for a full year. No subscriptions!
English
9
6
31
3.6K
Ravi Theja retweetou
Hugging Models
Hugging Models@HuggingModels·
Open-source, multilingual, real-time speech-to-text model that streams audio and transcribes it into text with low latency across multiple languages. Try now: huggingface.co/spaces/mistral…
English
3
25
213
15.5K
Ravi Theja retweetou
Mistral AI for Developers
Mistral AI for Developers@MistralDevs·
Voxtral can now directly stream audio input into text output. Perfect for: - Live subtitles - Language learning apps - Note-taking tools - And more! Made a demo for you to try directly on hugging face !
English
26
85
832
72.5K
Ravi Theja
Ravi Theja@ravithejads·
Releasing Voxtral Transcribe 2 from @MistralAI - two next-generation speech-to-text models delivering state-of-the-art transcription quality, speaker diarization, and ultra-low latency. This new family includes: 🔹 Voxtral Mini Transcribe V2 - Industry-leading batch transcription with diarization, word-level timestamps, context biasing, and support for 13 languages. 🔹 Voxtral Realtime - Built for live applications with configurable latency down to sub-200 ms, making it ideal for voice agents and real-time experiences. (Open weights under the Apache 2.0 license.) You can also explore an interactive audio playground in Mistral Studio to instantly test diarization and timestamp features. 👉 Learn more here: voxtral--mistral-website.netlify.app/news/voxtral-t…
Mistral AI@MistralAI

Introducing Voxtral Transcribe 2, next-gen speech-to-text models by @MistralAI. State-of-the-art transcription, speaker diarization, sub-200ms real-time latency. Details in 🧵

English
0
0
8
803
Ravi Theja retweetou
LMSYS Org
LMSYS Org@lmsysorg·
You can now run any Diffusers pipeline directly in SGLang, combining SGLang’s optimized inference stack with the flexibility and rich optimization options of Hugging Face Diffusers 🤗🔥 This partnership truly connects open-source diffusion inference optimization with the Hugging Face ecosystem, making it easier to build and deploy efficient, production-ready generative applications. Huge kudos to @adarshxs for leading this effort!
Sayak Paul@RisingSayak

You can run ANY pipeline from Diffusers in @sgl_project and benefit from the open tooling for optimized inference in the space 🔥 Combine SGLang's optims + Diffusers' flexible options for optims to suit your needs 🤗 Kudos to @adarshxs for leading the work here!

English
0
4
37
3.9K
Ravi Theja retweetou
Sayak Paul
Sayak Paul@RisingSayak·
You can run ANY pipeline from Diffusers in @sgl_project and benefit from the open tooling for optimized inference in the space 🔥 Combine SGLang's optims + Diffusers' flexible options for optims to suit your needs 🤗 Kudos to @adarshxs for leading the work here!
Sayak Paul tweet media
English
2
9
33
8.5K
sheetal chauhan
sheetal chauhan@sheetalchauhan5·
bit late to the party, but here's my 2025 unwrapped: - quit my job. walked out mentally and creatively exhausted after a decade across big tech + startups - three weeks into a supposed “career sabbatical”, randomly signed up for PeakXV's consumer AI hackathon. ended up winning it w. @SinghVijit14477🏆 - joined @southpkcommons . immediate raise in ambition. still no clarity on what i’d do post the 6-month sabbatical - spent the next few weeks trying to fall back in love with the joy and art of creation. my weekend side-projects finally had a term with "vibe coding" - tinkered with a bunch of apps: a) a personal system of records for the agent-native world (pitched to a few folks at MS, dropped within a week), b) a personalized podcast app (notebookLM × seinfeld / dwight schrute). shipped on reddit, got a lot of AI slop flak lol), c) and a few more consumer x social experiments inspired by a weird mix of generative agents stanford paper, game engines, OSS text to 3D models - tried my hand at perfumery and pottery. 'twas kinda nice. - got back into cooking and badminton. AND, finally overcame my childhood fear of drowning in a pool - learnt how to swim! LOVED IT! - caught up with @NamrataRajagop and started jamming on a common frustration/question: 'how do we enable experimental research at scale in india?' - we setup @except_raised with a few more friends. wrote a white paper. shared it around. no response lol - we said “fuck it”. ran a research-first weekend hackathon. people showed up (BOTH DAYS). loved it. wanted more. - so, we got bolder. experimented with micro-grants with help from amazing partners @emergent_vc and @ankitcc at SPC india. the quality of ideas we saw was the highlight of my year. more here: exceptionraised.com - back to tinkering, went deeper into the future of media, creativity, and a new kind of creator blending models, code, and taste! - met @YashGargK at SPC. launched @lets_compose with our first short story, crafted by our dear friend @neervj who i'd just met a week ago lol. 150k views in under 48 hours! wild. - raised some $$ - went to china and SF. overwhelmed by ambition. felt like a lunatic in india and oddly too rational in SF. went through weeks of doubt and imposter syndrome - came back with bigger dreams and a LOT of inspiration. still very early at @lets_compose, but December has been one of the most creatively fulfilling months in a long time excited for 2026. deeply grateful for all the friends, creative partners, and early supporters. @spc_india LFG! 🚀✨
English
13
1
98
9.1K
Ravi Theja retweetou
Ivan Fioravanti ᯅ
Ivan Fioravanti ᯅ@ivanfioravanti·
Mistral Vibe CLI running Devstral-small-2-2512 on M3 Ultra 512GB with LM Studio as backend 🔥 First part of the video 1x and the remaining 2x Great speed overall, 27 toks/s as average tracked on LM Studio console. Details to run it below 🧵
English
11
36
296
49.8K