mehul

35 posts

mehul banner
mehul

mehul

@emptysaysstuff

one more day at a time... i love robots, vision, llms and jokes

Mumbai, India Katılım Şubat 2022
76 Takip Edilen28 Takipçiler
mehul retweetledi
mehul
mehul@emptysaysstuff·
LLM training wastes 4GB per batch on a tensor it throws away immediately. i built the fix for JAX, a maintainer reviewed the code, and its live on PyPI now. here’s how it works.
mehul tweet media
English
4
3
39
3.3K
Tirth Gada
Tirth Gada@tirth_gada·
EchoJEPA learns ultrasound physics, not just hearts. So I asked: can cardiac echo SSL features transfer to a completely different organ? Built a small experiment to find out. Thread
English
3
5
31
567
mehul
mehul@emptysaysstuff·
cross-entropy loss creates a score for every token in the vocabulary (128K+ for Llama-3). you use it once and discard it. the fix: process it in chunks, keep a running total. same math, 97% less memory. opened a PR on JAX, the issue author asked for more features, built those too
mehul tweet media
English
1
1
11
324
vrushtee
vrushtee@vruga_·
wrote a blog on how i got started with robotic learning. how lerobot, a 3d printed arm and arxiv is *more* than enough to work on frontier robotics research.
vrushtee tweet media
English
18
50
727
37.6K
mehul
mehul@emptysaysstuff·
i love how videos like these pop up on my feed to just make me realize how useless i am as a student of engineering... creds: youtube.com/watch?v=aHFo-7…
YouTube video
YouTube
mehul tweet media
English
0
0
1
51
mehul retweetledi
Robots Digest 🤖
Robots Digest 🤖@robotsdigest·
π0.6 from Physical Intelligence is NOT another robot gimmick. It’s a Vision-Language-Action model trained with real sensory feedback to learn from its own experiences, not just imitate. That’s embodied learning, not scripted motion.
English
24
133
905
41.2K