Subimal Bhattacharjee
7K posts

Subimal Bhattacharjee
@subimal
Cyber, Data Science, Defence, NorthEast India : author, columnist & docu filmmaker! Dir #Netaji@125, #InspiteoftheFence, Prod https://t.co/AGXV93yVnP

















At the launch of "The Digital Decades" by Subimal Bhattacharjee ji, reflected on India’s journey to 1 billion internet users today. From VSNL dial-up to world-leading 5G rollout, from PCO queues to 21+ billion monthly UPI transactions, Bharat’s transformation, especially in the last 11 years led by PM @narendramodi ji has been swift, inclusive and unprecedented. The digital decade ahead belongs to Bharat. 🇮🇳











Sarvam just built India's first full-stack sovereign AI stack that actually works for 1.4 billion people Hype-chasers will miss this - but Sarvam just built something special yesterday at the IndiaAI Summit. Unfortunately, a lot of the discourse continues to miss the real story. Let's take a stock of the state of the union first: (1) Most people outside India (and even many inside) don’t fully appreciate the invisible ceilings we operate under here. H100 and Blackwell clusters are still not commercially stocked at a meaningful scale. (2) US export caps haven't helped, squeezing supply even further. (3) Indian teams literally queue for hours – sometimes days – of A100/H100/Blackwell time that US and Chinese labs get on tap. (4) The IndiaAI Mission provides shared compute, but it comes with strict allocation queues and governance. (5) Data is the even harder long-tail nightmare. Indic languages plus heavy code-mixing across 22 scheduled tongues form a tiny fraction of global corpora. You can’t simply scrape your way to high-quality pretraining data the way English-centric labs do. (6) Any serious local team must first build its own corpus – months of curation, cleaning, deduplication, and synthetic generation – before the very first gradient step. (7) Talent pipeline for HPC-scale MoE training, edge optimisation, and state-space architectures is still forming Despite all of this, the entire effort was pulled off w/ a core team of just 15 engineers & a meager corpus of ~4k GPUs - this is a REAL feat Yet they shipped India’s first credible sovereign full-stack in one coordinated go. Let's take a look at what all Sarvam actually built: (1) A 30B MoE model trained from scratch on 16T pure Indic tokens, 32k context length, ~1B active parameters per token – purpose-engineered for real-time voice conversations and agentic loops that feel completely native in Hinglish or any regional tongue. (2) A 105B MoE model (128k context, ~9B active parameters) reaching GLM-4.5-Air class performance on complex reasoning and long-form tasks - the practical walk-phase semi-frontier model that punches far above its headline size. (3) A 3B state-space Vision model that sets new SOTA on Indic OCR, tables, charts, and even historic Devanagari manuscripts – linear scaling that lets it handle 50-page mixed-language documents where transformers would choke on memory. (4) Sub-350MB edge models that finally make everything truly offline and population-scale: 74M Saaras STT with automatic language ID running 8.5× real-time on Snapdragon 8 Gen 3 (TTFT under 300 ms), 24M Bulbul TTS with natural voice cloning from just one hour of audio inside a 60MB footprint, and 150M bidirectional translation covering 110 language pairs across 10 Indic languages + English with zero English pivot. Smart choices everywhere that scream first-principles engineering. They chose a proven high-sparsity MoE backbone, layered Multi-Headed Latent Attention for massive KV-cache compression wins & partnered with NVIDIA’s Nemotron co-design for both training stability (MoE reinforcement learning is notoriously unstable) & 4× inference throughput on Blackwell. This is real pretraining plus RL solved under constraints that would make most global teams blink. The 105B isn’t 1T-parameter fireworks, but it is the walk-phase model that actually lands on ₹8k feature phones & smart glasses. That is exactly how you reach semi-frontier capability in 2026 w/o burning years on wheel-reinvention Model adoption is always long-tail. You need to ship multiple non-frontier quality pieces until the one that truly owns the dimensions we care about arrives. Sarvam just handed every Indian founder, builder, SME & policymaker a stack that actually works for farmers checking fertiliser prices in their dialect, street vendors negotiating deals in Hinglish, government departments processing 22-language documents & forms w/o any cloud round-trips, and millions more in everyday vernacular scenarios. This isn’t hype. This isn’t nationalism. It’s recognising a genuine engineering feat under constraints that most of the world never has to face – compute scarcity, data fragmentation, talent pipeline still maturing. A cracked team of engineers gave it their all over the past several weeks to do what many doubted as not doable in/from India - built usefully large, globally competitive models from scratch in India. India's own AI moment is arriving & all the stuff done by this amazing team tells us, "Yes, India can & India will" 👏 @SarvamAI, @pratykumar @vivek_raghavan, @_mohit_singla, @anand_404, @kediaharshit9, @AashaySachdeva, @sumanthd17, @ArpitDwivedi100, @HarveenChadha, @rkal4, @sushil_khyalia, @ManavSinghal157, @sohampetkar, @selfawareatom, @AnnaUpreti, @MeghMakwan33973 & the rest of the team

At the AI Impact Summit, the Indian Army’s panel on “Defence Perspective in AI” attracted a wide audience, bringing together military leadership, industry and academia for a focused dialogue on responsible AI in defence, duly moderated by Dr Subimal Bhattacharjee. Lieutenant General Vipul Shinghal, DCOAS (IS&T), outlined leadership challenges in the AI era, while Lieutenant General Harsh Chhibber, DGIS, emphasised human agency in the application of force. Major General Pawan Anand (Retd) highlighted ethical considerations and international humanitarian law. Industry and academic insights were shared by Dr Vikram Jayaram on technological sovereignty, Prof Ganesh Ramakrishnan on India’s sovereign AI stacks, and Ms Madhumita Mohapatra on AI-driven logistics optimisation. The session highlighted a shared approach to operational transformation, grounded in responsible and ethical AI. #AtmanirbharBharat #PeoplePlanetProgress #ResponsibleAI #AIinDefence #DigitalIndia @DefenceMinIndia @SpokespersonMoD @OfficialINDIAai



