Justin Strong

4.7K posts

Justin Strong

@GPTJustin

Excited about startups and robots | cooking up something new | Does a lot of hackerhouses

San Francisco, CA Katılım Mart 2015

2.5K Takip Edilen2.3K Takipçiler

Sabitlenmiş Tweet

Justin Strong@GPTJustin·25 Mar

Living in a hackerhouse is the best way to immerse yourself in the San Francisco AI community 🍊 Orange House 2.0 is looking for a new roommate Even if you’re not looking for housing you’ll still like hearing about the fun events we host

English

101

23.3K

Justin Strong@GPTJustin·6h

@harv_net @EcZachly They’re physically the same, no problem

English

h-a-r-v 🦄@harv_net·2d

@GPTJustin @EcZachly Does it work with SO-101 too?

English

Justin Strong@GPTJustin·3d

Training Pi0.5 to control the SO-100 is 65% complete and the model has seen 6,500 robot demonstrations across hundreds of different tasks. Loss is trending down from from 0.39 to 0.08 and expected to complete tomorrow. I started this project while staying in @eczachly's creator house for NVIDIA GTC, running local scale ups on my GPU rig to validate before committing to the cloud run. After driving my computer 300 miles home I discovered it no longer had functioning internet or keyboard IO. Todays task will be fixing the computer before training completes so I am able to test the model on my SO-100. I'll post results tomorrow

English

3.2K

Justin Strong@GPTJustin·6h

@basecampbernie I’m so confused how this happens

English

Base Camp Bernie@basecampbernie·7h

@GPTJustin Dumb for a week for me.. TurboDerp SlopCannon

English

Justin Strong@GPTJustin·7h

Is Claude really dumb for you today?

English

207

Justin Strong@GPTJustin·1d

@svlevine What is the stranger embodiment?

English

192

Sergey Levine@svlevine·1d

Didn't think that "drones with grippers" was in the cards for a likely embodiment for pi models, but there we have it. It's literally a flying gripper. But believe it or not, pi models have been used on even stranger embodiments before...

Stanford MSL@StanfordMSL

π, But Make It Fly ✈️ We fine-tuned π0, a VLA model pretrained entirely on manipulators, to fly a drone that picks up objects, navigates through gates, and composes both skills from language commands.

English

371

34.2K

Justin Strong@GPTJustin·1d

@StanfordMSL Finetuning seriously undersells the work here

English

154

Stanford MSL@StanfordMSL·2d

English

290

76.1K

Justin Strong@GPTJustin·1d

@StanfordMSL This is really fascinating, why did you choose pi0?

English

174

Justin Strong@GPTJustin·3d

@VMises76153 What was the data scale you used? If you created this robot yourself I’m guessing you just don’t have anywhere near enough data

English

Von Mises@VMises76153·3d

There's multiple explanations for what I found with the most likely being that I need to freeze some other set of weights because my training destroyed whatever the vla had learned. Perhaps I just need to translate the action space differently Maybe my latency from video is too high Maybe I just don't have anywhere near enough data to do this Maybe the lighting was bad There's really no telling it would take years to work through all those variables and there's a lot of things out there that can be tried. I am not a machine learning researcher, rather just a person trying to make a product and leverage the state of the art in robotic models.

English

Justin Strong@GPTJustin·4d

The best robotics model cant control the most accessible robot arm. I’m finetuning π0.5 on 5m frames of so101 to produce an open source generalist model that anyone can run at home. I have filtered the HuggingFace community VLA dataset down to 5.4m frames of so100 data with 215 unique tasks. I’ve frozen the VLA weights to finetune only the action expert to teach π0.5 the so100 embodiment without any loss of generality. This approach is an unproven experiment as I can find no similar attempt to teach π0.5 a new embodiment while preserving generality. Training is underway, I’ll be posting updates as we make progress

English

190

8.2K

Justin Strong@GPTJustin·3d

@embedrapp As much as I hate the AI reply, I’m curious what is special about your IDE, Claude code is weak in the low level motor control department

English

126

Embedr@embedrapp·3d

@GPTJustin finetuning on 5m frames at home?? the absolute mad lad energy here. if you need an IDE that actually understands the microcontrollers driving those joints without crashing, check out embedr.app. following for the updates.

English

213

Justin Strong@GPTJustin·3d

@IsaacSin12 Thanks!

English

Isaac Sin@IsaacSin12·3d

@GPTJustin cool idea

English

Justin Strong@GPTJustin·3d

Hey Pepijn! Are you involved in the HuggingFaceVLA community dataset? One issue I ran into was unexpectedly hitting hf rate limits downloading the 690gb dataset, I was very surprised it took 24hrs of rented h100 time to download. Next time I’ll have a better plan around dataset download

English

Pepijn@pepijn2233·3d

@GPTJustin Let us know if we can help

English

202

Justin Strong@GPTJustin·3d

@VMises76153 That’s really interesting, what were the details of your approach or learnings?

English

Von Mises@VMises76153·3d

@GPTJustin I've tried retraining it on a new embodiment that is even more radically different than that arm and I got nowhere and it's probably that anyone else who's tried got nowhere and didn't post anything about it. But I really really really hope it works so I'll be watching

English

234

Justin Strong@GPTJustin·3d

@TetraxYT Oh that’s really interesting, do you have a repo I could check out or any details?

English

Abhinav Palle@TetraxYT·3d

@GPTJustin We fine tuned pi0 using Lora on 160k on a completely new embodiment. Forget generalization it succeeds at a task end to end 1 out of 32 times.

English

188

Justin Strong@GPTJustin·3d

@theonlyAyo Thanks! I’ll try to keep updates going

English

Ayo@theonlyAyo·3d

@GPTJustin Nice, following this closely.

English

172

Justin Strong@GPTJustin·3d

@soft_servo I see some task specific fine tunes on HF, but wanted something that generalizes

English

softservo@soft_servo·3d

@GPTJustin haven’t seen anyone successfully fine tune pi0.5 on the so101 (probably due to being much smaller, having only 5dof, and imprecision of the hardware)

English

108

Justin Strong@GPTJustin·3d

@lenadroid Thanks! I’m stroked for it to finish training

English

Lena Hall@lenadroid·3d

@GPTJustin this is extremely cool

English

305

Justin Strong@GPTJustin·4d

@DhruvDiddi @Solo__Tech I’m not, what’s solo tech?

English

249

Dhruv Diddi@DhruvDiddi·4d

@GPTJustin Nice! Are you @Solo__Tech ?

English

251

Justin Strong@GPTJustin·4d

@jackvial89 This is so cool! Looking forward to seeing the results

English

Justin Strong retweetledi

Jack Vial@jackvial89·6d

The π*0.6 RECAP value network training is starting to look good! It's only about 15 minutes into training. I spent most of last week studying the paper and making notes on the value network and advantage conditioning. I'm training on a 40 episode pick and place dataset for so101. 20 are successful, 20 I purposely failed trying to imitate some of the model failure modes I have seen.

Jack Vial@jackvial89

i'm working on an implementation of π*0.6 RECAP. going to start with a simplified version of the full pipeline

English

116

10.8K

Keşfet

@harv_net @EcZachly @eczachly @basecampbernie @svlevine @StanfordMSL @VMises76153 @embedrapp