Jackson-Yuan

612 posts

Jackson-Yuan

@Yuanvis

Co-founder @ Rayvo | POV × AI × Robotics Building the real-world data engine for embodied AI — from egocentric capture to scalable training.

SF SZ Katılım Ekim 2024

171 Takip Edilen3.7K Takipçiler

Jackson-Yuan@Yuanvis·5d

@adamraudonis @builddotai @DeepReach_AI @foxglove @GenrobotAI @gi_labs @labelbox @MeckaAI @micro1_ai @microagi @Neuracore_AI @Nominal_io @rerundotio @roboto_ai @EgoScale

QAM

194

Adam Raudonis@adamraudonis·5d

The robot data ecosystem is growing! Anyone I miss? @builddotai @DeepReach_AI @foxglove @GenrobotAI @gi_labs humanarchive.ai @labelbox @MeckaAI @micro1_ai @microagi @Neuracore_AI @Nominal_io oceanveo.ai orbifold.ai @rerundotio @roboto_ai @ropedia_ai @scale_AI senseirobotics.com surgehq.ai T* (stealth) tryasimov.ai

English

120

16.1K

Jackson-Yuan@Yuanvis·16 Mar

@spiderr_MON soon!

English

ʙʀᴀɪɴғᴜᴇʟx@spiderr_MON·15 Mar

@Yuanvis Wen Rayvo?

English

Jackson-Yuan@Yuanvis·15 Mar

Hot take: the bottleneck is no longer access to egocentric video. The real bottleneck is turning raw Egocentric behavior into training-ready data for robotics. More video doesn’t automatically mean more usable data.

English

116

Jackson-Yuan@Yuanvis·15 Mar

We recently refreshed our website at egoscale.com, with a few concrete examples of the data layer we believe robotics will need next.

English

Jackson-Yuan@Yuanvis·4 Mar

@grok is scaling Egocentric Data the definitive 'GPT-3.5 moment' for Embodied AI, and the best way to finally break the scaling law for robotics?

EgoScale@EgoScale

Not just video. Training-ready 3D human behavior. From raw POV to structured 3D trajectories & actions. Built for embodied models.

English

Jackson-Yuan retweetledi

EgoScale@EgoScale·26 Şub

We’ve reached 25K hours of real-world egocentric (POV) human activity data. Covering multiple agents × environments × strategies: the same goal, different paths; the same scene, different decisions. If your model must generalize, diversity is essential.

English

236

29.4K

Jackson-Yuan@Yuanvis·26 Şub

@DrJimFan you definitely should try Egoscale.ai

English

Jim Fan@DrJimFan·25 Şub

We trained a humanoid with 22-DoF dexterous hands to assemble model cars, operate syringes, sort poker cards, fold/roll shirts, all learned primarily from 20,000+ hours of egocentric human video with no robot in the loop. Humans are the most scalable embodiment on the planet. We discovered a near-perfect log-linear scaling law (R² = 0.998) between human video volume and action prediction loss, and this loss directly predicts real-robot success rate. Humanoid robots will be the end game, because they are the practical form factor with minimal embodiment gap from humans. Call it the Bitter Lesson of robot hardware: the kinematic similarity lets us simply retarget human finger motion onto dexterous robot hand joints. No learned embeddings, no fancy transfer algorithms needed. Relative wrist motion + retargeted 22-DoF finger actions serve as a unified action space that carries through from pre-training to robot execution. Our recipe is called "EgoScale": - Pre-train GR00T N1.5 on 20K hours of human video, mid-train with only 4 hours (!) of robot play data with Sharpa hands. 54% gains over training from scratch across 5 highly dexterous tasks. - Most surprising result: a *single* teleop demo is sufficient to learn a never-before-seen task. Our recipe enables extreme data efficiency. - Although we pre-train in 22-DoF hand joint space, the policy transfers to a Unitree G1 with 7-DoF tri-finger hands. 30%+ gains over training on G1 data alone. The scalable path to robot dexterity was never more robots. It was always us. Deep dives in thread:

English

145

283

1.7K

269.2K

Jackson-Yuan@Yuanvis·24 Şub

@Monad_APAC AI & Robotics are the future

English

Monad APAC@Monad_APAC·23 Şub

A recap from the Rebel in Paradise hacker house in Shenzen 🇨🇳 AI on Monad is just getting started.

English

110

19.3K

Jackson-Yuan retweetledi

ChatGPT@ChatGPTapp·9 Şub

ZXX

104

1.6K

241.5K

Jackson-Yuan@Yuanvis·4 Şub

@Gingiris1031 @monad I think this is just the founder path. Not solving your problem, but solving whatever the company needs next — including the weird, unsexy stuff. Once you accept that, it hurts less. And sometimes even becomes fun.

English

Gingiris@Gingiris1031·3 Şub

@Yuanvis @monad Honestly, it’s wild how the pure joy of just solving your own stuff gets swallowed fast by all the "other" hats you gotta wear. Kinda makes you wonder—how do you keep that spark alive when the floor sweeping almost becomes part of the gig? 🤔

English

Jackson-Yuan@Yuanvis·4 Eyl

Now I know why I worked like a machine gun during founder residency. @monad Back in the company, half of my time is factory runs, team issues, business dinners — one step away from sweeping the floor myself. (Guess that’s another way to collect POV.) In residency, I only solved my own problems. That was pure joy.

English

1.1K

Jackson-Yuan@Yuanvis·24 Oca

@jinglingcookies Couldn’t agree more. The gap is still real-world behavior and long-horizon adaptation. We’ve been exploring egocentric, in-the-wild data as one piece of that puzzle — curious to see how sim + real data converge. Thanks for the shoutout

English

cookies (🍪,🍪) | 饼妹@jinglingcookies·24 Oca

insightful post on the state of robotics and its main bottleneck with the limitation being real world understanding and adaptive intelligence (acting on data received), am excited to see the emergence of: 1) firms that collect data of robotics in real world e.g. in manufacturing plants 2) firms that train models in highly diverse simulated environments (sim-2-real is still a question here) @Yuanvis is working on something exciting with @rayvo_xyz; anticipating great things from the team this year :)

Rui Ma@ruima

Today I was catching up with a friend who’s spent decades in China tech and now advises several Chinese robotics companies, and both of us were confused how far perceptions lag reality. In their travels this month, they met with a Japanese CEO of an industrial robotics company and also the head of humanoid robotics at a major global consulting firm. Both said that Chinese companies were just working on dancing robots, Unitree-style. There was very little awareness that, whether or not it’s the right long-term direction, humanoid robots are already in scaled production and operating on factory lines in China today. In fact we plan to visit such lines in April and almost did last week but couldn’t make it work logistically. Additionally, while most of our time was spent on new energy and manufacturing, we also visited a few robotics companies. They were uniformly bullish that robots will replace a meaningful amount of human labor, and in many factory settings, that’s already clearly happening. Some lines were truly mostly devoid of people. The question that always comes up in these conversations BTW is hands. Dexterity. Tactile sensing. People from outside the industry tend to assume this is the hardest part. What I’ve now heard repeatedly across different companies is that, from a hardware perspective, this problem is largely solved. Highly sensitive robot hands already exist. They can handle delicate objects without deforming soft materials, and in some ways they’re already superhuman, able to detect tiny changes in texture, temperature, and weight. You see them everywhere in demo mode at trade shows and exhibitions. These systems aren’t always economical yet, but there’s strong confidence they’ll become cost-effective soon, across many more models. The real bottleneck now is intelligence. Without it, you’re left with a very precise machine that isn’t autonomous and can’t be used in a general-purpose way. Much of the hardware people imagine for future humanoid robots already exists. What’s missing in a big way is real-world understanding and fast, adaptive intelligence. We’re going deeper on this next. We’re partnering with the Shenzhen Robotics Association and attending their conference in April, and we’re putting together a robotics-focused trip from April 20th to 24th. Link in comments (and also pinned to my profile).

English

982

Jackson-Yuan retweetledi

EgoScale@EgoScale·21 Oca

We’re nearing 5,000 hours of real-world egocentric POV manipulation data. Collected across different people and real-world environments, with varied object layouts and execution styles, all from natural, unscripted first-person behavior.

English

911

Jackson-Yuan retweetledi

DogeDesigner@cb_doge·21 Oca

"AI and robotics is a supersonic tsunami, this is really gonna be the most radical change that we've ever seen." 一 Elon Musk

English

451

521

2.4K

79.9K

Jackson-Yuan@Yuanvis·16 Oca

Thanks! We now have hundreds of early users contributing egocentric data on an ongoing basis. Our first batch of datasets is in delivery, and we’re starting to scale both users and data collection more systematically. We’re on track to reach hundreds of thousands of hours of real-world egocentric data in this quarter.

English

MZ 🔶@mzhid0x·15 Oca

@Yuanvis What's the update of your progress rn 😉

English

Jackson-Yuan@Yuanvis·15 Oca

This is a promising direction — using large-scale egocentric (POV) data to push robots toward real-world generalization.

Skild AI@SkildAI

Humans learn by watching. Robots should too.

English

195

Jackson-Yuan@Yuanvis·27 Ara

@DataLust_xyz Exactly, and it takes patience and conviction to keep doing the right thing before it becomes obvious.

English

DataLust@DataLust_xyz·27 Ara

@Yuanvis 100% true, gotta build for the next 3-6 months, not the last 3-6 months

English

Jackson-Yuan@Yuanvis·27 Ara

Progress in tech isn’t linear — it’s discontinuous. One proof point changes everything. Suddenly, conversations get easier. Interest accelerates. Momentum shows up. This is why the best founders aim to be 2–3 months ahead of the market. After a long wait, we’re finally there.

English

251

Jackson-Yuan@Yuanvis·22 Ara

After experiencing FSD firsthand, I’m convinced this is the most real “robot” in production today. Not because of a single model breakthrough, but because it’s trained on massive amounts of real-world data — collected continuously, then fed back to serve users better. Reality-trained systems scale differently.

Tesla AI@Tesla_AI

FSD is trained on billions of real-world miles, including power outages 📸 @edgecase411

English

289

Jackson-Yuan@Yuanvis·21 Ara

Honest question I keep coming back to: How long will it actually take for robots to learn human-level everyday actions well enough to be genuinely useful at home?

EgoScale@EgoScale

Why does the same household task keep breaking robots? Because the world changes. At EgoScale, we collect thousands of unscripted egocentric household videos, capturing how the same task varies across objects, materials, geometries, and lighting conditions. With explicit task structure and subtask-level annotations, this data helps robots move beyond a single environment and generalize to the real world. Here is a simple example from the wild: wiping a table while holding another object, a natural bimanual coordination pattern captured by EgoScale. #VLA #Humanoid

English

233

Jackson-Yuan@Yuanvis·19 Ara

@thisismyhat data and model are all importants

Italiano

Brian Cheung@thisismyhat·18 Ara

With a sufficient model of the world, any data can be useful data

Physical Intelligence@physical_int

This also shows up in the representations learned by the model. We plot the model’s representations of human and robot images. As pre-training is scaled up, the representation of humans and robots become more aligned: to a scaled-up model, human videos "look" like robot demos.

English

3.6K

Keşfet

@adamraudonis @builddotai @DeepReach_AI @foxglove @GenrobotAI @gi_labs @labelbox @MeckaAI