Rex Zhang
749 posts

Rex Zhang
@RexDQZhang
Building mode, AI/ Robotics @PathOn_Robotics, previously researcher @AmazonScience, PhD @UCBerkeley, undergrad @PKU1898, 1k+ citations
San Francisco, CA Katılım Kasım 2014
38 Takip Edilen931 Takipçiler

Welcome to the official PathOn Robotics 'Testing Facility' (aka the robot gym). Also known as the corner of my basement right next to my gym.
We've been running our custom sensor/software on the Go2 for our initial GTM. Bypassing the factory brain and injecting an enterprise-grade OS requires zero fancy office space—just a clear floor, a bright light, and absolute focus.
Hyper capital-efficient build mode. Let's keep shipping.🤖🛠️


English

Six months in: tight team, design partners, a working product, live demos — all from a dining table and a basement. Every dollar goes into the build.
Sharing it all openly from here. 🚀
#buildinpublic #solofounder #robotics
English

Excited to share the first version of our in-house dexterous hand design at @pathon_robotics ! 🦾
We're building a unified dexterous manipulation pipeline that works across multiple hand platforms — both off-the-shelf hands and our own design, which
we're now prototyping.
Next week, we'll integrate eFlesh, a low-cost tactile sensor, to give the system rich contact feedback for fine manipulation. @Raunaqmb
The goal: a hardware-agnostic pipeline where you can plug in your preferred hand + sensor stack and get dexterous manipulation out of the box.
More updates coming soon 👇
English

Fully open-sourced — reproduce the whole upgrade:
🧩 STL + STEP for every part
📋 Bill of Materials
🔧 Step-by-step assembly guide
📸 Print orientation diagrams
github.com/PathOn-AI/path…
What should we grasp next? 🤖
English
Rex Zhang retweetledi

Introducing Hi Robot – Hierarchical Interactive Robot
Our first step at @physical_int towards teaching robots to listen and think harder.
A 🧵 on how we make robots more steerable 👇
English

My Chinese name is Danqing. Uber/Lyft always pronounced it as either "DanKing" or "DanQueen." I started telling people yeah I'm DanKing
When I started my company I needed an English name people would remember. Tried "Danni" first. Too soft. Not me.
I always loved the name Rex 🦖 But it's a guy's name. Spent time looking for a female version.
Then I thought — who says I can't be Rex? I'm already DanKing 👑
So Rex it is. So Rex it is.
English
Rex Zhang retweetledi

The power of the Claw, in the palm of a robot hand. Agentic robotics is here! Today, we open-source CaP-X: vibe agents, alive in the physical world. They incarnate as robot arms and humanoids with a rich set of perception APIs, actuation APIs, and auto synthesize skill libraries as they go. CaP-X is a strict superset of our old stack, because policies like VLAs are “just” API calls as well. It solves many tasks zero-shot that a learned policy would struggle with.
And we are doing much more than vibing. CaP-X is our most systematic, scientific study on agentic robotics so far:
- We build a comprehensive agentic toolkit: perception (SAM3 segmentation, Molmo pointing, depth, point cloud), control (IK solvers, grasp planner, navigation), and visualization (EEF, mask overlays) that work across different robots.
- CaP-Gym: LLM’s first Physical Exam! 187 manipulation tasks across RoboSuite, LIBERO-PRO, and BEHAVIOR. Tabletop, bimanual, mobile manipulation. Sim and real. Can’t wait to see the gradients flow from CaP-Gym to the next wave of frontier LLM releases.
- CaP-Bench: we benchmark 12 frontier LLMs/VLMs (Gemini, GPT, Opus, Qwen, DeepSeek, Kimi, and more) across 8 evaluation tiers. We systematically vary API abstraction level, agentic harness, and visual grounding methods. Lots of insights in our paper.
- CaP-Agent0: a training-free agentic harness that matches or exceeds human expert code on 4 out of 7 tasks without task-specific tuning.
- CaP-RL: if you get a gym, you get RL ;). A 7B OSS model jumps from 20% to 72% success after only 50 training iterations. The synthesized programs transfer to real robots with minimal sim-to-real gap.
3 years ago, our team created Voyager, one of the earliest agentic AI that plays and learns in Minecraft continuously. Its key ideas — skill libraries, self-reflection loops, and in-context planning — have since influenced many modern agentic designs.
Today, the agent graduates from Minecraft and gets a real job. It’s April Fool’s, but this Claw is getting its hands dirty for real!
Link in thread:
English
Rex Zhang retweetledi
Rex Zhang retweetledi

🤲Tactile sensing is powerful for robot manipulation, but hardware is still difficult to access, reproduce, and scale.
🎯That’s why we built FlexiTac: an open-source, low-cost, and scalable tactile sensing solution designed for real robotic systems.
• Project page: flexitac.github.io
We hope FlexiTac can help democratize tactile sensing for robotics research. (1/n)
English








