Rishikesh (ऋषिकेश)

975 posts

Rishikesh (ऋषिकेश)

@ai_rishikesh

LLM Researcher, Audio Generation, TTS & Image domain Open-Source enthusiast | Backend Dev | Photographer | Boston Celtics 🏀 | Man Utd FC ⚽️

New Delhi, India Katılım Temmuz 2020

608 Takip Edilen371 Takipçiler

Rishikesh (ऋषिकेश)@ai_rishikesh·5d

Mamba 3 🚀 just dropped and the official code is full of CUDA/Triton kernels. So I wrote a clean, readable code from-scratch PyTorch version to actually understand what's going on. Covers SISO + MIMO, trapezoidal scan, and RoPE — all in one file📑. code: github.com/rishikksh20/ma…

English

Rishikesh (ऋषिकेश)@ai_rishikesh·6d

Have implemented the Qwen 3.5 0.8B model. Code: github.com/rishikksh20/qw…

Qwen@Alibaba_Qwen

🚀 Introducing the Qwen 3.5 Small Model Series Qwen3.5-0.8B · Qwen3.5-2B · Qwen3.5-4B · Qwen3.5-9B ✨ More intelligence, less compute. These small models are built on the same Qwen3.5 foundation — native multimodal, improved architecture, scaled RL: • 0.8B / 2B → tiny, fast, great for edge device • 4B → a surprisingly strong multimodal base for lightweight agents • 9B → compact, but already closing the gap with much larger models And yes — we’re also releasing the Base models as well. We hope this better supports research, experimentation, and real-world industrial innovation. Hugging Face: huggingface.co/collections/Qw… ModelScope: modelscope.cn/collections/Qw…

English

105

Rishikesh (ऋषिकेश)@ai_rishikesh·16 Mar

It might be time to revisit and rethink each component of the Transformer stack. Solid paper from Kimi.ai🫡

Kimi.ai@Kimi_Moonshot

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

English

Rishikesh (ऋषिकेश)@ai_rishikesh·7 Mar

Good progress... 🙌

Pratyush Kumar@pratykumar

📢 Open-sourcing the Sarvam 30B and 105B models! Trained from scratch with all data, model research and inference optimisation done in-house, these models punch above their weight in most global benchmarks plus excel in Indian languages. Get the weights at Hugging Face and AIKosh. Thanks to the good folks at SGLang for day 0 support, vLLM support coming soon. Links, benchmark scores, examples, and more in our blog - sarvam.ai/blogs/sarvam-3…

English

100

Rishikesh (ऋषिकेश) retweetledi

Liang Zheng@LiangZheng_06·6 Mar

sharing my slides on end-to-end diffusion model training. we are moving too fast to realise that there are in fact lots of fundamental problem unsolved. drive.google.com/file/d/14XQv_r… REPA-E: github.com/End2End-Diffus… REPA-E VAE family: huggingface.co/REPA-E iREPA: github.com/End2End-Diffus… @sainingxie @1jaskiratsingh

English

273

14.4K

Rishikesh (ऋषिकेश) retweetledi

Karan Thakkar@Carankt·25 Şub

@ai_rishikesh Cooked! Congratulations DubPro AI

Shark Tank India@sharktankindia

What if your voice could speak every language? This pitcher just made it possible! 🌐🦈 #dubpro.ai Watch The New Episodes Of Shark Tank India Season 5, Streaming Now Mon-Fri, 8 PM on Sony LIV. Watch Free On Mobile Only. @AnupamMittal @amangupta0303 @namitathapar @vineetasng

Română

Rishikesh (ऋषिकेश)@ai_rishikesh·17 Şub

🫨

Hugging Models@HuggingModels

NVIDIA just dropped PersonaPlex-7B 🤯 A full-duplex voice model that listens and talks at the same time. No pauses. No turn-taking. Real conversation. 100% open source. Free. Voice AI just leveled up. huggingface.co/nvidia/persona…

ART

Rishikesh (ऋषिकेश)@ai_rishikesh·23 Oca

It makes Premium Voice tokens obsolete, and a cheap voice agent is no longer an imagination. KUDOS TO Alibaba 🎊🎊🎊

Qwen@Alibaba_Qwen

Qwen3-TTS is officially live. We’ve open-sourced the full family—VoiceDesign, CustomVoice, and Base—bringing high quality to the open community. - 5 models (0.6B & 1.8B) - Free-form voice design & cloning - Support for 10 languages - SOTA 12Hz tokenizer for high compression - Full fine-tuning support - SOTA performance We believe this is arguably the most disruptive release in open-source TTS yet. Go ahead, break it and build something cool. 🚀 Everything is out now—weights, code, and paper. Enjoy. 🧵 Github: github.com/QwenLM/Qwen3-T… Hugging Face: huggingface.co/collections/Qw… ModelScope: modelscope.cn/collections/Qw… Blog: qwen.ai/blog?id=qwen3t… Paper: github.com/QwenLM/Qwen3-T… Hugging Face Demo: huggingface.co/spaces/Qwen/Qw… ModelScope Demo: modelscope.cn/studios/Qwen/Q… API: alibabacloud.com/help/en/model-…

English

Rishikesh (ऋषिकेश) retweetledi

DailyPapers@HuggingPapers·22 Oca

Qwen just dropped Qwen3-TTS on Hugging Face Voice cloning from 3s of audio, 10-language support, and 97ms streaming latency for ultra-realistic speech generation

English

210

19K

Rishikesh (ऋषिकेश)@ai_rishikesh·15 Ara

👀

Mark Kretschmann@mark_k

Gemma 4 incoming from @GoogleDeepMind (new open source model!)

ART

Rishikesh (ऋषिकेश)@ai_rishikesh·15 Ara

Amazing 🤯 the inference speed, quality, emotions and portability

fal@fal

🚨 Chatterbox Turbo is now live on fal! 🗣️ Ultra-fast, open-source text-to-speech built for real-time voice AI ⚡ Up to 6× faster-than-real-time 🎭 Paralinguistic tags for non-verbal reactions: [sigh], [chuckle], [laugh], [gasp] + more 🎙️ Instant voice cloning from ~5 seconds of audio. Reactions stay in the same voice

English

Rishikesh (ऋषिकेश)@ai_rishikesh·12 Ara

👀

OpenBMB@OpenBMB

VoxCPM Technical Report is here! ⚡️ We’re taking realistic speech generation to the next level of Efficiency. 📉 ✨ Highlights: 🚫 Tokenizer-Free: Pure end-to-end continuous modeling for high fidelity. 🧠 Hierarchical Design: TSLM + RALM ensures both stability & expressivity. ⚡ Blazing Fast: Achieves an RTF as low as 0.17 on consumer GPUs. Dive into the tech that makes it possible: 🔗 Technical Report: arxiv.org/abs/2509.24650 🤗 Model: huggingface.co/openbmb/VoxCPM… 🎮 Demo: huggingface.co/spaces/openbmb… #AI #TTS #OpenBMB #VoxCPM #OpenSource

ART

Rishikesh (ऋषिकेश)@ai_rishikesh·10 Ara

@rdesh26 @Carankt

QAM

304

Desh Raj@rdesh26·10 Ara

>hiring RS interns for summer 2026 (12 to 24 weeks) >work on audio/speech + LLMs at MSL >should be a PhD candidate with relevant experience >apply through metacareers.com/profile/job_de… or DM me

English

127

11.2K

Rishikesh (ऋषिकेश)@ai_rishikesh·9 Ara

Keep 👀 on them

Vignesh Ravichandran@viggy28

Your clients and prospects are already talking to AI. Why aren’t they talking to yours? 🤔 That changes today with @myclone_is - your knowledge, your clients, your AI. Let me tell the quiet part out loud: for consultants, advisors, and coaches, you are the product. We spent 4 months with 450+ consultants & coaches and captured it in a 2-min film about Jo. Who is ready to use AI to scale themselves? #MyClone

English

Rishikesh (ऋषिकेश)@ai_rishikesh·8 Ara

🤩

steven@Tu7uruu

Just dropped on HF: YODAS2-Sido a multilingual, massive-scale speech dataset. > 67+ languages with balanced speaker diversity > High-quality, natural conversational audio > Ideal for ASR, TTS, speech-to-speech, and audio agents > Clean annotations with ready-to-train splits > Strong fit for multimodal LLM alignment work You can easily load it with Hugging Face’s datasets library!

ART

Rishikesh (ऋषिकेश)@ai_rishikesh·4 Ara

Qwen 2.5 0.5 B 🥳 still shows capability in different form... huggingface.co/microsoft/Vibe…

English

254

Rishikesh (ऋषिकेश)@ai_rishikesh·26 Kas

@mufaddal_vohra Bro pick Rashford when asked to choose between Ronaldo and Messi... Now sounds like Eric Ten Hag ....

English

Mufaddal Vohra@mufaddal_vohra·26 Kas

Gautam Gambhir said, “I’m same guy under whom India did well in England, won the Champions Trophy and won the Asia Cup”.

English

2.3K

700

15.4K

3.8M

Rishikesh (ऋषिकेश)@ai_rishikesh·26 Kas

@news24tvchannel ASAP हाँ

News24@news24tvchannel·26 Kas

क्या गंभीर को कोच के पद से हटा देना चाहिए ◆ कमेंट में लिखिए - हाँ या ना #GautamGambhir | Gautam Gambhir

हिन्दी

711

7.8K

446.7K

Rishikesh (ऋषिकेश)@ai_rishikesh·26 Kas

Mr. GG please leave Indian Cricket Team ASAP.. You get a team who won T20 WC, no. 1 in ODI and WTC finalist Test team... All you have to do is just coach but you bring your ego and stupidity in to the team. @BCCI need some action against Coach and Selectors

English