Bhiman Kumar Baghel

203 posts

Bhiman Kumar Baghel banner
Bhiman Kumar Baghel

Bhiman Kumar Baghel

@bhinubkb

CS PhD Student @ University Of Pittsburgh. Ex -Lead Engineer - Speech & Natural Language Processing @ Samsung Research, Bangalore. MTech in CSE @ IIT Kharagpur

Pittsburgh, PA Katılım Haziran 2015
67 Takip Edilen38 Takipçiler
Sabitlenmiş Tweet
Bhiman Kumar Baghel
Bhiman Kumar Baghel@bhinubkb·
🚨“How DeepSeek-R1 beat OpenAI at reasoning — a deep dive” 🧵 1/ I’ve launched a 3-part breakdown. In Part 1: – LLaMA 3 vs DeepSeek-R1-zero – MoE vs Dense – RLHF Let’s dive in 🧵 Full Video: 🔗 youtu.be/iLBdjINKR0U
YouTube video
YouTube
Bhiman Kumar Baghel tweet media
English
1
0
1
99
Bhiman Kumar Baghel retweetledi
sui ☄️
sui ☄️@birdabo·
this story is absolutely insane 🤯 > tech guy with zero biology background. > his dog got terminal cancer. > vets said 1 - 6 months left. > bro said nah not on my watch. > asked ChatGPT for a treatment plan. > sequenced tumor DNA for $3k. > used AlphaFold AI to model mutated proteins. > designed world’s first personalized mRNA vaccine for a dog. > partnered with universities to synthesize it. > ethics approval took 3 months. > vaccine design took 2 months. > first injection December 2025. > tumors shrank 75% within weeks. > dog happy. > universities confirmed it worked. > now designing version 2 for remaining tumor. AI + a guy determined to save his dog just outperformed the pharma industry 💀 the cure for cancer will be open source.
vittorio@IterIntellectus

this is actually insane > be tech guy in australia > adopt cancer riddled rescue dog, months to live > not_going_to_give_you_up.mp4 > pay $3,000 to sequence her tumor DNA > feed it to ChatGPT and AlphaFold > zero background in biology > identify mutated proteins, match them to drug targets > design a custom mRNA cancer vaccine from scratch > genomics professor is “gobsmacked” that some puppy lover did this on his own > need ethics approval to administer it > red tape takes longer than designing the vaccine > 3 months, finally approved > drive 10 hours to get rosie her first injection > tumor halves > coat gets glossy again > dog is alive and happy > professor: “if we can do this for a dog, why aren’t we rolling this out to humans?” one man with a chatbot, and $3,000 just outperformed the entire pharmaceutical discovery pipeline. we are going to cure so many diseases. I dont think people realize how good things are going to get

English
341
3.9K
36.7K
2.3M
Bhiman Kumar Baghel
Bhiman Kumar Baghel@bhinubkb·
🧵 9/ 🎓 I’m Bhiman, a CS PhD student @ Pitt, studying model editing, LLM reasoning, and efficient fine-tuning. As I go through my PhD journey, I share what I learn. Let’s learn together. 👉 Follow me: x.com/bhinubkb #phdlife
English
0
0
0
35
Bhiman Kumar Baghel
Bhiman Kumar Baghel@bhinubkb·
🧵 5/ LLaMA 3: → 405B parameters (all active during inference) DeepSeek-V3-base: → 671B total → Only 37B active at inference via MoE routing ⚡ That’s massive savings in compute during training and inference.
English
1
0
0
49
Bhiman Kumar Baghel
Bhiman Kumar Baghel@bhinubkb·
🧵 4/ Starting with the architectural difference … 💡 Instead of a dense decoder-only transformer (like LLaMA 3), DeepSeek-R1-zero uses a Mixture of Experts model. 🧠 Only a subset of experts are active per input → faster inference + modular specialization.
Bhiman Kumar Baghel tweet media
English
1
0
0
32
Bhiman Kumar Baghel
Bhiman Kumar Baghel@bhinubkb·
🚨“How DeepSeek-R1 beat OpenAI at reasoning — a deep dive” 🧵 1/ I’ve launched a 3-part breakdown. In Part 1: – LLaMA 3 vs DeepSeek-R1-zero – MoE vs Dense – RLHF Let’s dive in 🧵 Full Video: 🔗 youtu.be/iLBdjINKR0U
YouTube video
YouTube
Bhiman Kumar Baghel tweet media
English
1
0
1
99
Bhiman Kumar Baghel
Bhiman Kumar Baghel@bhinubkb·
🧵 3/ 🔧 Let’s start with the training pipeline. Conventional LLMs typically follow: 1. Pre-training (language learning) 2. Supervised fine-tuning 3. RLHF (reinforcement learning with human feedback) DeepSeek-R1 uses a twist on this…
Bhiman Kumar Baghel tweet mediaBhiman Kumar Baghel tweet media
English
1
0
0
44
Bhiman Kumar Baghel
Bhiman Kumar Baghel@bhinubkb·
🧵 2/ 📌 Why is DeepSeek-R1 a big deal? It beat OpenAI’s O1 (their reasoning-tuned GPT-4) on math & code — traditionally the hardest LLM tasks. And it did it using mostly reinforcement learning.
Bhiman Kumar Baghel tweet media
English
1
0
0
30
Bhiman Kumar Baghel retweetledi
Elon Musk
Elon Musk@elonmusk·
Do these expenditures seem like a good use of your taxes?
English
9.9K
39.4K
212.5K
39.2M
Bhiman Kumar Baghel retweetledi
Elon Musk
Elon Musk@elonmusk·
Elon Musk tweet media
ZXX
43.6K
98.6K
826K
99.9M
Bhiman Kumar Baghel retweetledi
Sasha Rush
Sasha Rush@srush_nlp·
Updating all my NeurIPS papers.
Sasha Rush tweet media
English
37
163
2.4K
250.4K
Backpacking Daku
Backpacking Daku@outofofficedaku·
Delhi ✈️ New York Return Jan - Feb - Mar 2025 Rs. 57K 🎒💼 Fly with British Airways Share | RT #DakuFlightDeals
Backpacking Daku tweet media
English
4
2
60
13K
Alvin
Alvin@sondesix·
We've seen green or pink lines before, but have you seen white lines on a phone's screen before? 🤔
Alvin tweet media
English
22
4
175
12.1K
Akshat Shrivastava
Akshat Shrivastava@Akshat_World·
Burj Khalifa was built in 2004. In 20 years, it helped transform Dubai's economy. Lakhs of Tourists come to see Burj Khalifa. Most of them international tourists, which adds more money to Dubai's economy. Now, Indian politicians look at this case study: and say "let's build giant structures, tourists will come, jobs will be added. And, money will flow like river" In fact: many politicians in India have tried emulating this model. For example: In Noida, Mayawati spent 2600 Crore of Public Money on building numerous statues. Did it transform Noida's economy? Absolutely not. What people miss is: a structure is the centerpiece. Planned infrastructure needs to be built around that (that's the hard part). Near Burj Khalifa, for example:- - Females can go out at the middle of the night (safe) - Easy to get a Cab/travel (good transport) - If you have a baby, you can use a baby stroller. And walk for kilometres. A developer called EMAAR not only built Burj Khalifa, but did the masterplan development for entire Downtown. Their vision was to give tourists a great experience. And, they succeeded. Unfortunately, in India: the vision is to build a structure. Make reels and vlogs with it (take media credit) Therefore, we have reached a point where even our newly constructed Parliament is leaking, despite spending several crores.
English
329
853
6.8K
559.9K
Punekar News
Punekar News@punekarnews·
🚨 Disturbing Video Alert 🚨 #Pune: A tragic incident occurred in Colony No. 2, Ganeshnagar, near Bopkhel, Pimpri Chinchwad, Pune, where a minor girl lost her life as a gate fell on her on Wednesday (31st July 2024). The heartbreaking event was captured on CCTV. Parents keep a watch on your children and please ensure the safety of your surroundings and regularly check for any potential hazards to prevent such unfortunate accidents. #Pune #TragicIncident #Ganeshnagar #PimpriChinchwad #Heartbreaking #StaySafe
English
207
324
889
298K