Ville🤖

2K posts

Ville🤖

@VilleKuosmanen

gentleman scientist 🤖 @voyagerobotics

London, UK Katılım Mayıs 2020

859 Takip Edilen3.2K Takipçiler

Sabitlenmiş Tweet

Ville🤖@VilleKuosmanen·24 Nis

A month ago @pravsels and I set out to reproduce @physical_int’s RL Token paper. Today, I am sharing our research notes from the journey we’ve been on. x.com/VilleKuosmanen…

Ville🤖@VilleKuosmanen

"RL Token" looks like a great and surprisingly simple post-training methodology for optimising robot models for dexterous tasks in the real world! Over the next few weeks, me and @pravsels will be attempting to reproduce the results (& open source the code) Stay tuned 👀

English

123

16.5K

Ville🤖@VilleKuosmanen·2h

@Francis_Aln yes but they are also 10x cheaper and far more commonly used than YAMs for example

English

Francis@Francis_Aln·2h

@VilleKuosmanen Aren't so-101s a bit limited for SOTA work ?

English

Ville🤖@VilleKuosmanen·1d

if anyone is interested in remotely accessing an so-101 arm farm, for any purpose (inc. evals, data collection, RL), DM me 😎

English

2.5K

Ville🤖@VilleKuosmanen·7h

@moss_here scale

English

Moss@moss_here·16h

@VilleKuosmanen Question. What can one do with a arm farm ?

English

Ville🤖@VilleKuosmanen·22h

this is how europeans view wine from california btw sorry sf bros I don't make the rules 🤷‍♂️

Rare 🇺🇸@RareImagery

For serious winos, you can now buy a bucket of wine from Costco.

English

824

Ville🤖@VilleKuosmanen·1d

@omnomnihal @droyd_robotics I recommend investigating force torque sensors and/or controllers too, to plug in cables you need to apply specific amount of force which is hard to do with position control only

English

115

Nihal@omnomnihal·1d

summer internship update: I’ll be working on VLAs at @droyd_robotics! Will be researching how different end effectors (to plug cables in) affect the policy. Might end up writing a paper on it. If you have any leads/have done similar work lmk!

English

1.4K

Ville🤖@VilleKuosmanen·1d

@chetan_ Needs more and better RL 👀

English

116

Chetan@chetan_·1d

i'd guess many failures are symptoms of running the policy inference with some sort of time warping to speed up the rollouts failure modes tend to be due to physics breaking down (e.g. undershooting in fast moves because of controller dynamics, or accidentally launching things)

The Humanoid Hub@TheHumanoidHub

The robot is sorting and placing labels face-down at an impressive rate of ~23 packages/minute. It does make mistakes every now and then.

English

768

Ville🤖 retweetledi

Lucas@__Rhodium__·2d

Virgin egocentric data collectooor vs. mecha-mounted teleop chad

Unitree@UnitreeRobotics

Unitree Unveils: GD01, A Manned Transformable Mecha, from $650,000 👏 The world's first production-ready manned mecha. It can transform. It's a civilian vehicle. It weighs ~500kg with you inside. Please everyone be sure to use the robot in a Friendly and Safe manner.

Español

1.2K

69K

Ville🤖@VilleKuosmanen·2d

Really cool video from @ihorbeaver!

Igor Kulakov@ihorbeaver

We have news! We created a new robotics model called Loop Model 1. On the zip-tie insertion task, it achieves 20x more throughput per unit of data than "Pi06 + RLT" from Physical Intelligence, a top model for such tasks. It’s the missing piece that makes MicroFactory work, because now deployment becomes so simple and fast that our users can do it themselves.

English

1.3K

Ville🤖@VilleKuosmanen·8 May

is opus 4.7 getting worse? keeps forgetting things, doesn't understand root cause of issues even on extra high effort. esp bad in cloud system design, not understanding the diff in running locally vs in cloud are people vibe coding with this? what are they building with it?

English

457

Ville🤖@VilleKuosmanen·8 May

If that does describe you and you are or want to be london based let me know, DM's open 😁

English

188

Ville🤖@VilleKuosmanen·8 May

had a dream last night that I hired the perfect mechanical engineer to help us with some gripper and robot cell designs yet another part of my life getting consumed by work 🧐

English

854

Ville🤖@VilleKuosmanen·7 May

@masato_ka the motion tokens work I did some time ago was built on DP in LeRobot with very minor changes to the policy and these hyperparams I posted, so it definitely can work: @villekuosmanen/guiding-action-diffusion-policies-in-robots-via-motion-tokens-f8a5fe4ef8b1" target="_blank" rel="nofollow noopener">medium.com/@villekuosmane… the previous hyperparams were optimised for push-t which is much simpler than real robot

English

1.2K

masato_ka@masato_ka·7 May

I’ve not heard success case DP training on LeRobot. We have to try again this.

Ville🤖@VilleKuosmanen

We just merged a small change to @LeRobotHF's default Diffusion Policy hyperparams, which should make it work better out of the box with real robots. If you've been using LeRobot to train mostly ACT policies, give a go to Diffusion Policy!

English

1.5K

Ville🤖@VilleKuosmanen·7 May

@masato_ka @LeRobotHF 🤗

QME

198

masato_ka@masato_ka·7 May

@VilleKuosmanen @LeRobotHF Great job!

English

238

Ville🤖@VilleKuosmanen·7 May

English

8.8K

Ville🤖@VilleKuosmanen·7 May

@pham_blnh @livekit the robot on rails looks great, would love to see that in action! how do you teleoperate the rail DOF?

English

151

Binh Pham@pham_blnh·7 May

collect data remotely using @livekit, infer remotely using @livekit excited for what’s to come, handling transport layer for robots should be as easy as setting up a call

English

1.4K

Ville🤖@VilleKuosmanen·7 May

@LeRobotHF (all depends on system config ofc, inc. type of GPU used, hard drive vs SSD, number of cameras, torchcodec vs pyav and so on)

English

544

Ville🤖@VilleKuosmanen·7 May

@LeRobotHF If you do end up testing DP, note that your training speed might be bottlenecked by data loading rather than GPU usage (use wandb system metrics to find out). If this is the case, you should increase the number of data loader workers, assuming your CPU has enough threads

English

605

Ville🤖@VilleKuosmanen·7 May

@DominiqueCAPaul don't show this video to your employer's liability insurance broker or whatever version of it you have in switzerland😅

English

271

Dominique Paul@DominiqueCAPaul·7 May

So using relative joint angles with Pi0.5 isn’t going that well yet. I think it’s some bug in the inference loop. Need to think of a debugging visualization tomorrow.

English

136

13.1K

Ville🤖@VilleKuosmanen·5 May

@ax_pey Agreed, and I think there are many great use cases in elder and health care for specialised robots, such as the patient transfer robot developed by @AbleInnovation

English

149

Axel@ax_pey·5 May

elder care is one of these fields that seems obvious but is a terrible first use-case for general-purpose robots - very low societal tolerance for any failure (humanoid kills grandpa inadvertently -> company dies) - very high loads required (carrying a human who fell) - all or nothing: either the robot deals with all the problems an elder has, or you will still have to send a human for some tasks, which defeats the purpose compare with some tasks for entertainment (dance) or early adopters (bring me a beer from the fridge) one day it will make sense to have them for elder care labor, but we're not there yet and until then you can maybe have interactive robots (but not general-purpose)

Lukas Ziegler@lukas_m_ziegler

elder care is probably the biggest labor crisis in the world. where are the robots?"

English

1.5K

Ville🤖@VilleKuosmanen·1 May

We still know so little on how to use the learned representations present in VLMs for VLA training. Great work @suning_huang and team!

Suning Huang@suning_huang

🤖Low-data post-training can teach a VLA policy a new robot skill. But it also makes it too attached to the training demos. We call this lock-in🔒: the policy can execute the post-training task, yet fails to respond to seemingly obvious prompt changes. DeLock preserves steerability using only the policy’s own pretrained knowledge. No extra supervision needed!🚀🚀🚀 #Robotics #AI #EmbodiedAI #VLA

English

2.9K

Keşfet

@Francis_Aln @moss_here @omnomnihal @droyd_robotics @chetan_ @ihorbeaver @masato_ka @LeRobotHF