Helen Jiang (@helenqjiang) - Twitter Profili | Zamantika Mersobahis Locabet

Helen Jiang@helenqjiang·16 Nis

3 days into being at skild and already such fun things are happening 🤭🤭

We have acquired Zebra Technologies’ robotics arm (formerly Fetch Robotics). This is what happens when orchestration meets intelligence -- a major step toward fully autonomous warehouses. More robots. More environments. One unified brain.

English

0

1

19

2.5K

Helen Jiang retweetledi

Jason Y. Zhang@jasonyzhang2·20 Ara

Check out Kyle’s paper, which uses a VLM to post-train a diffusion AE! Learned image compression trades off file size with perceptual quality. Normally, quality is measured using a calibrated network based on human perception (eg LPIPS). Instead, we query Gemini. [1/N]

Kyle Sargent@KyleSargentAI

Vision-language models are getting better every day. Can we use them to improve image compression? Yes! For my internship, working w/ @GoogleDeepMind, @GoogleResearch, we designed VLIC, a diffusion autoencoder post-trained with VLM preferences. Our preprint is out today! A🧵:

English

2

7

82

9.3K

Helen Jiang retweetledi

Jason Y. Zhang@jasonyzhang2·30 Tem

Last year, my ring bearer was a Skild robot. Excited to see how far they've come!!

Skild AI@SkildAI

Modern AI is confined to the digital world. At Skild AI, we are building towards AGI for the real world, unconstrained by robot type or task — a single, omni-bodied brain. Today, we are sharing our journey, starting with early milestones, with more to come in the weeks ahead. Our Mission: Artificial General Intelligence grounded in the physical world. We believe AGI that can truly understand and reason in the real world can only be built through grounding in the physical world. Our Vision: Any robot, Any task, One brain. We tackle robotics in its full generality – building a continually improving, omni-bodied brain that can control any hardware for any task. Who are we? A passionate group of scientists & engineers driven by our shared vision. We have been researching AI and robotics for more than a decade. Our team includes pioneers of self-supervised learning, curiosity-driven exploration, end-to-end sim2real for visual locomotion, dexterous manipulation, learning from human videos, robot parkour, and many more. Many of these works have won awards at top-tier AI and Robotics conferences. Our team has also built production-ready systems at Anduril, Tesla, Nvidia, Meta, Kitty Hawk, Google, Everyday Robotics, and Amazon. Join us in our mission to build the robot brains of tomorrow.

English

4

10

118

14.7K

Helen Jiang@helenqjiang·29 Tem

I remember when we got to witness Skild first hand when one of their robots was our ring bearer last September 🤖💍🐻 Super excited to see them sharing their innovations with the world!

Skild AI@SkildAI

Modern AI is confined to the digital world. At Skild AI, we are building towards AGI for the real world, unconstrained by robot type or task — a single, omni-bodied brain. Today, we are sharing our journey, starting with early milestones, with more to come in the weeks ahead. Our Mission: Artificial General Intelligence grounded in the physical world. We believe AGI that can truly understand and reason in the real world can only be built through grounding in the physical world. Our Vision: Any robot, Any task, One brain. We tackle robotics in its full generality – building a continually improving, omni-bodied brain that can control any hardware for any task. Who are we? A passionate group of scientists & engineers driven by our shared vision. We have been researching AI and robotics for more than a decade. Our team includes pioneers of self-supervised learning, curiosity-driven exploration, end-to-end sim2real for visual locomotion, dexterous manipulation, learning from human videos, robot parkour, and many more. Many of these works have won awards at top-tier AI and Robotics conferences. Our team has also built production-ready systems at Anduril, Tesla, Nvidia, Meta, Kitty Hawk, Google, Everyday Robotics, and Amazon. Join us in our mission to build the robot brains of tomorrow.

English

1

16

1.5K

Helen Jiang retweetledi

NVIDIA Data Center@NVIDIADC·8 Eki

📣 TSMC, a leader in semiconductor manufacturing, has started production with NVIDIA's cuLitho platform, accelerating chip manufacturing and pushing technological boundaries. #DataCenter Learn more now ➡️ nvda.ws/3YfL6dk

English

3

17

92

4K

Helen Jiang retweetledi

Raunaq Bhirangi@Raunaqmb·19 Eyl

The sense of touch is fundamental to how we interact with the world. But the most exciting developments in robotics continue to focus primarily on vision. I spent the last four years trying to understand why. And we might have found a pretty good fix. Introducing AnySkin

English

11

64

437

49.1K

Helen Jiang retweetledi

Kenneth Marino@Kenneth_Marino·29 Ağu

Very excited to announce that in Fall 2025, I will be starting as an Assistant Professor at the Kahlert School of Computing at the University of Utah @UUtah.

English

13

19

299

34.2K

Helen Jiang retweetledi

Sudeep Dasari@SudeepDasari·16 Tem

Robots need strong visuo-motor representations to manipulate objects, but it’s hard to learn these using demo data alone. Our #RSS2024 project vastly improves robotic representations, using human affordances mined from Ego4D! w/ @mohansrirama @shikharbahl @gupta_abhinav_

English

1

17

89

11K

Helen Jiang retweetledi

Daniel Geng@dangengdg·18 Haz

I'm at CVPR presenting "Visual Anagrams" on - Tuesday: 10am, Poster #429 - Friday: Oral6B @ 1pm, Poster #118 (pm) Let me know if you want to chat! Also, we manufactured a bunch of these "jigsaws with two solutions." If you want one, just hunt me down in the conference hall :)

English

7

22

162

19.2K

Helen Jiang@helenqjiang·24 May

@vdean314 i miss your cookies!! 🤤

English

0

1

55

Victoria Dean @vdean.bsky.social@vdean314·23 May

My summer students started today!! ✨

Victoria Dean @vdean.bsky.social tweet media

English

1

0

21

1.9K

Helen Jiang retweetledi

Jason Y. Zhang@jasonyzhang2·5 Nis

Today, I defended my PHD thesis! A huge thanks to my committee and everyone who made these 5 years amazing 🎉🥳

English

29

5

288

20.2K

Helen Jiang retweetledi

Jason Y. Zhang@jasonyzhang2·14 Mar

Had a great time chatting with Itzik today about our new ICLR paper on diffusing camera rays for pose estimation! Check out our discussion on ray-based camera representations, diffusion models, and learning to predict camera pose: youtu.be/KgHwv3Nf8rg

YouTube

Talking Papers Podcast@talking_papers

1/ Exciting news, academia Twitter! 🎓🎧 A new episode of #TalkingPapersPodcast is live where I dive deep into a fresh approach to camera pose estimation. My guest? The remarkable @jasonyzhang2 , a PhD student at @CMU_Robotics. Tune in 👉 youtu.be/KgHwv3Nf8rg

English

1

8

55

6.4K

Helen Jiang retweetledi

Shubham Tulsiani@shubhtuls·23 Şub

[1/6] What representation comes to mind when you think of a ‘camera’? Perhaps an extrinsic + intrinsic matrix? In our ICLR (oral) paper, we instead infer a distributed representation where each pixel is associated with a ray, and show SoTA results for few-view pose estimation.

English

19

128

974

141.7K

Helen Jiang retweetledi

Jason Y. Zhang@jasonyzhang2·23 Ara

[1/6] The first step to 3D is getting camera poses. But typical pipelines struggle in sparse-view setups bc of texture-less surfaces, symmetries, or insufficient overlap Our #3DV2024 paper RelPose++ uses a probabilistic energy-based model to get accurate 6D poses from <10 views!

GIF

English

4

36

284

29.9K

Helen Jiang retweetledi

Jason Y. Zhang@jasonyzhang2·23 Şub

Our #ICLR2024 (Oral) paper parameterizes cameras as bundles of rays for sparse-view pose estimation. We train a diffusion model to predict this representation which can be seamlessly converted to classic camera representations using least-squares! [1/N] jasonyzhang.com/RayDiffusion

GIF

Shubham Tulsiani@shubhtuls

[1/6] What representation comes to mind when you think of a ‘camera’? Perhaps an extrinsic + intrinsic matrix? In our ICLR (oral) paper, we instead infer a distributed representation where each pixel is associated with a ray, and show SoTA results for few-view pose estimation.

English

6

46

289

31.9K

Helen Jiang retweetledi

Yufei Ye@yufei_ye·14 Eyl

Excited to share our work in ICCV23(Oral)! tl;dr: 3D-fy everyday hand-object interaction clips, no template required. w/ @Poorvi_rh , @shubhtuls, and Abhinav. Project page: judyye.github.io/diffhoi-www

English

0

28

127

22.5K

Helen Jiang@helenqjiang·14 Haz

some super cool work done by other cmu students!

Deepak Pathak@pathak2206

🤖 Robotics often faces a chicken and egg problem: no web-scale robot data for training (unlike CV or NLP) b/c robots aren't deployed yet & vice-versa. Introducing VRB: Use large-scale human videos to train a *general-purpose* affordance model to jumpstart any robotics paradigm!

English

0

1

515

Helen Jiang retweetledi

AK@_akhaliq·18 Nis

Affordances from Human Videos as a Versatile Representation for Robotics abs: arxiv.org/abs/2304.08488 project page: robo-affordances.github.io

English

2

44

208

57.4K

Helen Jiang retweetledi

Samarth Sinha@_sam_sinha_·3 Mar

Super excited to share that my first project as a PhD student was accepted at CVPR 2023! #CVPR2023 😄 We show how to recover accurate camera poses from as few as 3 images! More details to come! w/ @jasonyzhang2 @taiyasaki @igilitschenski @DaveLindell arxiv.org/abs/2211.16991

English

27

115

724

100K

Helen Jiang retweetledi

Jason Y. Zhang@jasonyzhang2·24 Eki

If you're at #ECCV2022, come stop by our poster on Tuesday Afternoon to talk about sparse-view pose estimation! From just a few images, our approach outputs coherent camera rotations using an energy-based relative pose predictor

Shubham Tulsiani@shubhtuls

[1/4] Camera poses are essential for (neural) 3D reconstruction. But what about sparse-view settings where obtaining these via COLMAP isn’t feasible? Our ECCV paper tackles this using an energy-based formulation for predicting relative rotation (jasonyzhang.com/relpose)

English

0

5

37

0

Helen Jiang

Keşfet