Justin Strong

4.7K posts

Justin Strong banner
Justin Strong

Justin Strong

@GPTJustin

Excited about startups and robots | cooking up something new | Does a lot of hackerhouses

San Francisco, CA Katılım Mart 2015
2.5K Takip Edilen2.3K Takipçiler
Sabitlenmiş Tweet
Justin Strong
Justin Strong@GPTJustin·
Living in a hackerhouse is the best way to immerse yourself in the San Francisco AI community 🍊 Orange House 2.0 is looking for a new roommate Even if you’re not looking for housing you’ll still like hearing about the fun events we host
Justin Strong tweet mediaJustin Strong tweet mediaJustin Strong tweet media
English
18
8
101
23.3K
Justin Strong
Justin Strong@GPTJustin·
Training Pi0.5 to control the SO-100 is 65% complete and the model has seen 6,500 robot demonstrations across hundreds of different tasks. Loss is trending down from from 0.39 to 0.08 and expected to complete tomorrow. I started this project while staying in @eczachly's creator house for NVIDIA GTC, running local scale ups on my GPU rig to validate before committing to the cloud run. After driving my computer 300 miles home I discovered it no longer had functioning internet or keyboard IO. Todays task will be fixing the computer before training completes so I am able to test the model on my SO-100. I'll post results tomorrow
Justin Strong tweet mediaJustin Strong tweet mediaJustin Strong tweet media
English
3
5
49
3.2K
Justin Strong
Justin Strong@GPTJustin·
Is Claude really dumb for you today?
English
1
0
3
207
Sergey Levine
Sergey Levine@svlevine·
Didn't think that "drones with grippers" was in the cards for a likely embodiment for pi models, but there we have it. It's literally a flying gripper. But believe it or not, pi models have been used on even stranger embodiments before...
Stanford MSL@StanfordMSL

π, But Make It Fly ✈️ We fine-tuned π0, a VLA model pretrained entirely on manipulators, to fly a drone that picks up objects, navigates through gates, and composes both skills from language commands.

English
7
30
371
34.2K
Stanford MSL
Stanford MSL@StanfordMSL·
π, But Make It Fly ✈️ We fine-tuned π0, a VLA model pretrained entirely on manipulators, to fly a drone that picks up objects, navigates through gates, and composes both skills from language commands.
English
10
31
290
76.1K
Justin Strong
Justin Strong@GPTJustin·
@VMises76153 What was the data scale you used? If you created this robot yourself I’m guessing you just don’t have anywhere near enough data
English
0
0
0
8
Von Mises
Von Mises@VMises76153·
There's multiple explanations for what I found with the most likely being that I need to freeze some other set of weights because my training destroyed whatever the vla had learned. Perhaps I just need to translate the action space differently Maybe my latency from video is too high Maybe I just don't have anywhere near enough data to do this Maybe the lighting was bad There's really no telling it would take years to work through all those variables and there's a lot of things out there that can be tried. I am not a machine learning researcher, rather just a person trying to make a product and leverage the state of the art in robotic models.
English
1
0
1
13
Justin Strong
Justin Strong@GPTJustin·
The best robotics model cant control the most accessible robot arm. I’m finetuning π0.5 on 5m frames of so101 to produce an open source generalist model that anyone can run at home. I have filtered the HuggingFace community VLA dataset down to 5.4m frames of so100 data with 215 unique tasks. I’ve frozen the VLA weights to finetune only the action expert to teach π0.5 the so100 embodiment without any loss of generality. This approach is an unproven experiment as I can find no similar attempt to teach π0.5 a new embodiment while preserving generality. Training is underway, I’ll be posting updates as we make progress
Justin Strong tweet media
English
12
11
190
8.2K
Justin Strong
Justin Strong@GPTJustin·
@embedrapp As much as I hate the AI reply, I’m curious what is special about your IDE, Claude code is weak in the low level motor control department
English
1
0
1
126
Embedr
Embedr@embedrapp·
@GPTJustin finetuning on 5m frames at home?? the absolute mad lad energy here. if you need an IDE that actually understands the microcontrollers driving those joints without crashing, check out embedr.app. following for the updates.
English
1
0
2
213
Justin Strong
Justin Strong@GPTJustin·
Hey Pepijn! Are you involved in the HuggingFaceVLA community dataset? One issue I ran into was unexpectedly hitting hf rate limits downloading the 690gb dataset, I was very surprised it took 24hrs of rented h100 time to download. Next time I’ll have a better plan around dataset download
English
1
0
1
69
Justin Strong
Justin Strong@GPTJustin·
@VMises76153 That’s really interesting, what were the details of your approach or learnings?
English
1
0
0
77
Von Mises
Von Mises@VMises76153·
@GPTJustin I've tried retraining it on a new embodiment that is even more radically different than that arm and I got nowhere and it's probably that anyone else who's tried got nowhere and didn't post anything about it. But I really really really hope it works so I'll be watching
English
1
0
3
234
Justin Strong
Justin Strong@GPTJustin·
@TetraxYT Oh that’s really interesting, do you have a repo I could check out or any details?
English
0
0
0
79
Abhinav Palle
Abhinav Palle@TetraxYT·
@GPTJustin We fine tuned pi0 using Lora on 160k on a completely new embodiment. Forget generalization it succeeds at a task end to end 1 out of 32 times.
English
1
0
2
188
Ayo
Ayo@theonlyAyo·
@GPTJustin Nice, following this closely.
English
1
0
1
172
Justin Strong
Justin Strong@GPTJustin·
@soft_servo I see some task specific fine tunes on HF, but wanted something that generalizes
English
1
0
0
83
softservo
softservo@soft_servo·
@GPTJustin haven’t seen anyone successfully fine tune pi0.5 on the so101 (probably due to being much smaller, having only 5dof, and imprecision of the hardware)
English
1
0
1
108
Justin Strong retweetledi
Jack Vial
Jack Vial@jackvial89·
The π*0.6 RECAP value network training is starting to look good! It's only about 15 minutes into training. I spent most of last week studying the paper and making notes on the value network and advantage conditioning. I'm training on a 40 episode pick and place dataset for so101. 20 are successful, 20 I purposely failed trying to imitate some of the model failure modes I have seen.
Jack Vial tweet media
Jack Vial@jackvial89

i'm working on an implementation of π*0.6 RECAP. going to start with a simplified version of the full pipeline

English
8
13
116
10.8K