Sabitlenmiş Tweet
Nathan Yan
85 posts

Nathan Yan
@OfficialNathanY
I post about VLA content because I find it cool 17 Prev @Roboflow, @ultralytics
Katılım Ağustos 2023
516 Takip Edilen302 Takipçiler

I added KV caching and INT8 KV quantization to our transformer inference, improving throughput by 35x.
All of this was done from scratch in Rust + CUDA, on top of a homemade ML framework.
On a 4-token prompt with 252 generated tokens:
- Original: 0.76 tok/s
- KV cache fp32: 27.21 tok/s
- KV cache int8 (quantized): 27.29 tok/s
Try it out yourself here: mni-ml.github.io/demos/kv-cache/
In practice:
- KV caching gave us about a 35x end-to-end speedup
- INT8 KV cache kept roughly the same speed as fp32 but cut KV cache memory by 3.78x
FP32 cache used 4.5 MB in this run while the INT8 cache used only 1.19 MB
This simple change to inference created a huge impact on performance. To learn more about the KV cache and other optimizations like this, check out the blog at mni.ml!
English
Nathan Yan retweetledi

Nathan Yan retweetledi

1/ Introducing CLAW 🦀 — a pipeline for scalable generation of language-annotated whole-body motion data for the Unitree G1 humanoid.
Joint work with @ThomasYuxinChen at MSC Lab, UCberkeley
Code: github.com/JianuoCao/CLAW
Tech Report: arxiv.org/pdf/2604.11251
English

@rickyramosx trained a navigation MLP head in Isaac Lab and used that as the base model. Then attached Paligemma as for vision and language capabilities (basically took inspiration from Pi0)
Need to check the amount of trajectories I trained it on lol dont have it top of head rn
English

@OfficialNathanY Super cool stuff, what was the base model and how many trajectories did you train on
English

Robots don’t feel, they just follow positions.
That’s why they fail in the real world.
This paper shows what changes:
It lets robots feel force while acting.
Not just where to move, but how hard to push.
That’s the difference between demo and real work.
Thanks for sharing, @OfficialNathanY !
📍Paper: arxiv.org/pdf/2603.15169
Project: sites.google.com/view/force-vla…
Would love to hear what my friend @Klajd_Lika thinks about this.
——
Weekly robotics and AI insights.
Subscribe free: 22astronauts.com
English

@KrishivThakuria when mythos drops Ima see u coding in mountainpeaks bro
English

I’ll be joining @sentra_app in San Francisco as a Software Engineering Intern!!!
If you’ll be in the Bay Area this summer, let’s connect. (Hmu, I need to find a roommate 😭)

English













