Varun

903 posts

Varun

@Smart_enouf

💻 Software Developer | Passionate about coding & innovation | NSUT '24 📚 | AI/Ml enthusiast/Pupil@Codeforces(1213)

Delhi Katılım Şubat 2019

5.1K Takip Edilen312 Takipçiler

Sabitlenmiş Tweet

Varun@Smart_enouf·23 Ara

Spent the last few months building a virtual try-on tool. It’s live now. Really cool to see the AI actually handling the cloth warping correctly. check it out here: viton-repo.vercel.app #AI #StableDiffusion #Fashion #Viton #Clothing

English

345

Varun@Smart_enouf·1d

@HeyGen AvatarV

Eesti

HeyGen@HeyGen·1d

We solved character consistency. Forever Avatar V captures you in 15 seconds and holds your identity across every video. Change the look, outfit, and setting to create unlimited versions of you. RT + comment "AvatarV" below and I'll DM 100 credits to test it out (must follow)

English

1.5K

2.2K

647K

Varun retweetledi

Sameer Allana@HitmanCricket·5d

Hate him as much as you want, he only has love to offer. MS Dhoni is not only a great cricketer but also a great human being.

English

657

4.6K

107.9K

Varun retweetledi

Aliaksandr Valialkin@valyala·28 Mar

@oprydai This is about LLM and AGI

English

271

Varun@Smart_enouf·27 Mar

@Fintech00 😂😂

QME

Vijay Marathe@Fintech00·26 Mar

Ekdm mast moula aadmi tha Na tension leta tha Na tension deta tha

Chart Wallah@Chart_Wallah108

"Insaan ki keemat uske jaane ke baad hi samajh aati hai.”

Filipino

131

638

10.4K

294.4K

Varun retweetledi

Kimi.ai@Kimi_Moonshot·26 Mar

Zhilin at GTC: Introducing Attention Residuals Learning selective memory, rather than mechanically accumulating everything, is the beauty of attention. Many of you have probably read Attention Is All You Need, the 2017 Transformer paper that brought “human-like” attention into the model’s field of view. From that point on, models no longer simply read everything in a mechanical way. Instead, they began to develop a sense of what matters more and what matters less across the text, choosing to retain the more important information. Recently, Kimi applied this idea of attention to the temporal dimension, then rotated it 90 degrees into the model’s depth dimension. This allows the model to have attention not only over time, but also throughout the process of information transmission across layers—giving it a more intelligent way to understand and process information.

English

155

1.4K

107.5K

Varun@Smart_enouf·26 Mar

@manojdotdev True 😂

English

Manoj Kumar@manojdotdev·25 Mar

Bro is raising multiple pull requests but none of them are Merging

Indian Tech & Infra@IndianTechGuide

🚨 "Big brands sell sugary drinks with flashy fruit images, misleading people and fueling rising diabetes and lifestyle issues, especially among kids." - MP Raghav Chadha.

English

150

446

7.5K

168.9K

Varun retweetledi

कुंभकरण@_kumbhkaran·21 Mar

Almost 18000 students graduate from IITs every year. Around 7000 are from the top IITs, and about 1000 are from CS/EC branches. Out of those 1000, less than 5% get an international offer, while the remaining 95% stay here. They earn and pay taxes here, but we still have not built a system that supports them if they dare to do something challenging. It is not IITians alone, it is the opportunities created by US that allows IITians to grow there. Meanwhile, in India, we are busy finding new ways to distribute freebies so that we can win the next election, and these freebies are funded by the taxes paid by someone working day and night while living in a 10×15 room in Bengaluru or Gurgaon.

English

236

2.5K

140.5K

Varun@Smart_enouf·20 Mar

@Hiteshdotcom @rutturaj 😂😂

QME

Hitesh Choudhary@Hiteshdotcom·20 Mar

@rutturaj Screen se nikl ke type krunga😂

Norsk

107

2.2K

Ruturaj Chondekar 👨‍💻@rutturaj·20 Mar

maine suna hai isme @Hiteshdotcom gharpe aake padhayenge 💀

abhinav@AbhinavXJ

you are metally unstable if you are paying any penny let alone 180k, which is many people's engineering fees for a coding COURSE in big 2026

हिन्दी

3.1K

Varun@Smart_enouf·20 Mar

@ShyamMeeraSingh Rajeev talwar😂

हिन्दी

Shyam Meera Singh@ShyamMeeraSingh·19 Mar

इस खबर को पढ़कर मुझे ग्रीक फिलोसफर- राजीव तलवार की एक बात याद आ गई- “जिस देश का विद्वान ही…

Bar and Bench@barandbench

The person who is an accused is praying for protection? You are a suspected accused. You are trying to sensationalise the issue: Uttarakhand High Court to gym owner ‘Mohammad’ Deepak Kumar

हिन्दी

5.9K

184.3K

Varun@Smart_enouf·20 Mar

@Mantu_kumar91 @piyushgarg_dev 😂😂

QME

MANTU KUMAR@Mantu_kumar91·19 Mar

Skill sikhte hi dead ho jaata hai 😑

हिन्दी

1.3K

104.1K

Varun retweetledi

Mayank Pratap Singh@Mayank_022·19 Mar

I coded a Speech-to-Text model from scratch. 𝐇𝐞𝐫𝐞 𝐢𝐬 𝐭𝐡𝐞 𝐛𝐥𝐨𝐠 𝐟𝐨𝐫 𝐭𝐡𝐞 𝐬𝐚𝐦𝐞: blogs.mayankpratapsingh.in/chapters/speec… No APIs. No pre-trained models. Just PyTorch, an A100 GPU, and hours of debugging. This started months ago. I wanted to understand how machines hear. Not surface-level understanding. I wanted to build the whole thing myself. So I built it piece by piece: autoencoders, VAEs, VQ-VAEs, Residual Vector Quantization, and CTC loss. Each one took days to get right. Trained for 3 hours on 13,100 audio clips. Got complete garbage. Changed the tokenizer from BPE to character-level. Rechecked everything. Asked @neural_avb who built STT models before. His answer: these models are tricky to train and need days of compute, not hours. Cut the dataset to 200 clips. After 2 hours, actual words appeared. Overfitted? Absolutely. But watching noise turn into recognizable English was satisfying. I have made a blog about this as well so you can learn about the same and my process - Audio fundamentals and waveform representation - Why attention breaks on raw audio - Convolutional downsampling - Transformer encoder with positional encoding - Vector Quantization, straight-through estimator, and RVQ - CTC loss and greedy decoding - Full training loop with VQ loss warmup - What went wrong and what finally worked Resources: - Blog: blogs.mayankpratapsingh.in/chapters/speec… - Code: github.com/Mayankpratapsi… More Resoures CTC loss distill.pub/2017/ctc/ @neural_avb videos @avb_fj" target="_blank" rel="nofollow noopener">youtube.com/@avb_fj SoundStream Paper arxiv.org/abs/2107.03312 LJ speech dataset keithito.com/LJ-Speech-Data… wav2vec paper arxiv.org/abs/2006.11477 RVQ blog drscotthawley.github.io/blog/posts/202… Next up: I've already trained two TTS architectures from scratch. Video post about those coming soon. But first, I'm dropping a visual breakdown of Vision Transformers, covering how they work and how to fine-tune them. Follow me @Mayank_022 you're into audio deep learning. Repost so others can find this