Sabitlenmiş Tweet
metr0x
613 posts

metr0x
@metrox_eth
Monkey building robots Co-founder @SHOW_ROBOTICS | Prev. Founder @SODAmerch | Prev. Ops lead, Support lead @SnapshotLabs
Thailand, deep in robots Katılım Aralık 2024
507 Takip Edilen551 Takipçiler

Collected 3h of data in ~5h wall time with VR teleop today. Honestly feels way better than leader arms. There's never anything in your way even with weird motions, and being able to leave an arm frozen when not in use is huge.
What do you think of the setup? Tried to make it as comfortable as possible ;)
English


@earthtojake @ed0henderson Curious too, I am looking to upgrade from Roarm, I worry about the wobble tho.
English
metr0x retweetledi

@YuXiang_IRVL Awesome ! Curious about your pi0.5 fine-tune setup. Did you start from the base PI checkpoint or from a robot-specific pretrained checkpoint (so101 or similar)? Trying to figure out the right starting point for a RoArm M3 stacking task.
English

Among the policies we evaluated so far (ACT, DiT, SmolVLA, pi0, pi0.5), fine-tuned pi0.5 achieves the best performance on VLA-REPLICA.
The trend is consistent with recent simulation benchmarks such as RoboLab. The policy behaviors in the real world:
irvlutd.github.io/VLAReplica/sce…
Yu Xiang@YuXiang_IRVL
Strongly agree. In VLA-REPLICA irvlutd.github.io/VLAReplica/, we explicitly evaluate all three aspects: • object location variation • different object instances • background clutter We design the test scenes such that they are different from the demonstration distribution
English

Added a sim page to the @SHOW_ROBOTICS workshop UI today.
It's a work in progress. MuJoCo scene with six 25mm cubes laid out in a 2×3 grid. Two cameras (wrist + front).
It plays back synthetic stacking trajectories so I can watch the arm pick and place before committing GPU time to a training run. If the motion looks wrong here, no point training on it.
Most of the day went into inverse kinematics and URDF partial implementation of RoArm Gripper B.
Next: decompose the Gripper B mesh into separate STLs (base, jaws, linkage) and rig the six pivots so the jaws actually open and close in sim.
English

@khanbinan007 @SHOW_ROBOTICS Thanks ! You are always supportive, I appreciate it 🫡
English

Ye we are still early, nothing is standardized, an arxiv paper is not a good format for datasets conditions documentation, but it’s often the only source of truth. I would say chance to mismatch something is pretty much 100% 😀 It’s also hard to know what a pre-trained expert is actually capable of. Any particular edge use case is a flip of a coin. In example I am not seeing much VLA cube stacking demos, sota not clear, teleop not fully solved, force feedback an leader / torque control on motors is a must for delicate tasks. We are building at the frontiers.
English

@metrox_eth Interesting how the bottleneck ended up being systems engineering rather than the policy itself. Timing mismatches and I/O latency seem massively underestimated in robotics evals. How often do you think “model failures” are actually infrastructure and runtime issues?
English

20 pick-and-place episodes on the RoArm M3. 95% success rate. The policy was trained a week ago. We unlocked it today by fixing the runtime.
The signal: FPS counter on the eval dashboard at 12-14 while the model was trained at 20fps. Every eval was running at 60-70% of training-time inference frequency. Distribution mismatch baked in.
Profiled the robot loop. send() to the servo controllers was blocking the main thread for 20-110ms per step. Refactored to an AsyncArmWorker: serial I/O on a dedicated thread, main loop latency drops to ~0ms. 20fps stable.
Hardware: added a PCIe card with 4 Renesas USB controllers, cameras and arms on isolated buses. Removed the USB contention inflating send() variance.
Last mile: base servo offset +3° clockwise from training calibration. Tuned, re-evaled. ACT v3 025k policy, 20 consecutive episodes at 95%.
Gripper still has a residual timing quirk. Minor at this success rate, fix later.
VR teleop was gated behind 70% baseline. Cleared. SmolVLA v6 (100k full finetune, unfrozen encoder) finished cooking tonight. Next on the bench.
English

I built a real-world-to-simulation demo using a @RealSenseai stereo camera mounted on a little #ROS #AMR robot to feed my skeletal pose into a humanoid robot in RVIZ and Gazebo ❤️🤖. Stop be the RealSense booth at @Robotics_Summit & Expo next week to see it in action!
English

@CryptoBE2 $show will follow @metrox_eth working way too hard rn
English






