Yiming Dou

78 posts

Yiming Dou

@_YimingDou

Ph.D. student at Cornell | Computer Vision, Multimodal, Robotics

Shanghai ↔️ New York Katılım Mart 2022

966 Takip Edilen767 Takipçiler

Sabitlenmiş Tweet

Yiming Dou@_YimingDou·13 Haz

Ever wondered how a scene sounds👂 when you interact👋 with it? Introducing our #CVPR2025 work "Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes" -- we make 3D scene reconstructions audibly interactive! yimingdou.com/hearing_hands/

English

8.2K

Yiming Dou retweetledi

Paul Liang@pliang279·18 Haz

Despite much progress in AI, the ability for AI to 'smell' like humans remains elusive. Smell AIs 🤖👃can be used for allergen sensing (e.g., peanuts or gluten in food), hormone detection for health, safety & environmental monitoring, quality control in manufacturing, and more. As a step towards AI for smell, our group is releasing **SmellNet,** a massive open dataset to advance AI smell-recognition in real-world settings. Using portable gas and chemical sensors, we collected 180,000 time steps of 50 substances (spanning nuts, spices, herbs, fruits, and vegetables) with 50 hours of data. SmellNet enables the training of AI models for real-time classification of substances based on their smell alone - see video below, where even subtle differences between cumin, cloves, and oregano can be detected. Check out our paper and open-source data & code for the smell AI revolution! paper: arxiv.org/abs/2506.00239 data & code: github.com/MIT-MI/SmellNet w Dewei, Carol, David @ddvd233 @medialab @MITEECS

English

128

15.2K

Yiming Dou retweetledi

Linyi Jin@jin_linyi·14 Haz

Hello! If you are interested in dynamic 3D or 4D, don't miss the oral session 3A at 9 am on Saturday: @zhengqi_li will be presenting "MegaSaM" I'll be presenting "Stereo4D" and @QianqianWang5 will be presenting "CUT3R"

English

1.5K

Yiming Dou@_YimingDou·14 Haz

@zizhpan Haha thanks😆

English

Zizheng Pan@zizhpan·13 Haz

@_YimingDou Very interesting research! It feels so funny when you collecting the data (the scene where you're patting the sofa) 🤣.

English

530

Yiming Dou@_YimingDou·13 Haz

English

8.2K

Yiming Dou retweetledi

Ayush Shrivastava@ayshrv·13 Haz

Excited to share our CVPR 2025 paper on cross-modal space-time correspondence! We present a method to match pixels across different modalities (RGB-Depth, RGB-Thermal, Photo-Sketch, and cross-style images) — trained entirely using unpaired data and self-supervision. Our approach learns correspondences through contrastive random walks across visual modalities. #CVPR2025 (1/6)

English

120

Yiming Dou retweetledi

Jeongsoo Park@jespark0·13 Haz

Can AI image detectors keep up with new fakes? Mostly, no. Existing detectors are trained using a handful of models. But there are thousands in the wild! Our work, Community Forensics, uses 4800+ generators to train detectors that generalize to new fakes. #CVPR2025 🧵 (1/5)

English

1.8K

Yiming Dou@_YimingDou·13 Haz

Wonderful collaboration with Wonseok Oh, Yuqing Luo, @antoniloq, @andrewhowens!!!

English

181

Yiming Dou@_YimingDou·13 Haz

Combining with our previous #CVPR2024 work TaRF (yimingdou.com/TaRF/), we create an immersive 3D scene reconstruction that allows users to interact with it using sight👀, touch👆 and sound👂.

English

171

Yiming Dou retweetledi

Daniel Geng@dangengdg·12 Haz

Hello! If you like pretty images and videos and want a rec for CVPR oral session, you should def go to Image/Video Gen, Friday at 9am: I'll be presenting "Motion Prompting" @RyanBurgert will be presenting "Go with the Flow" and @ChangPasca1650 will be presenting "LookingGlass"

English

5.3K

Yiming Dou retweetledi

Chris Rockwell@_crockwell·25 Nis

Ever wish YouTube had 3D labels? 🚀Introducing🎥DynPose-100K🎥, an Internet-scale collection of diverse videos annotated with camera pose! Applications include camera-controlled video generation🤩and learned dynamic pose estimation😯 Download: huggingface.co/datasets/nvidi…

English

177

42.9K

Yiming Dou retweetledi

Yuanchen Ju@ju_yuanchen·22 Nis

🧩#CVPR2025🌷Introducing Two By Two✌️: The First Large-Scale Daily Pairwise Assembly Dataset with SE(3)-Equivariant Pose Estimation. 🤖2BY2 helps robots master daily 3D assembly tasks—like plugging sockets or arranging flowers—across diverse objects! 🐨Co-lead by @yuqi_Beijing

English

10.8K

Yiming Dou@_YimingDou·27 Mar

Thanks to @OpenAI, got a chance to grow up again in Ghibli anime🤗

English

624

Yiming Dou retweetledi

Sarah Jabbour@SarahJabbour_·15 Oca

I’m on the PhD internship market for Spr/Summer 2025! I have experience in multimodal AI (EHR, X-ray, text), explainability for image models w/ genAI, clinician-AI interaction (surveyed 700+ doctors), and tabular foundation models. Please reach out if you think there’s a fit!

English

6.5K

Yiming Dou retweetledi

Yuanchen Ju@ju_yuanchen·8 Ara

🍌We present DenseMatcher！ 🤖️DenseMatcher enables robots to acquire generalizable skills across diverse object categories by only seeing one demo, by finding correspondences between 3D objects even with different types, shapes, and appearances.

English

116

24.2K

Yiming Dou retweetledi

Daniel Geng@dangengdg·4 Ara

What happens when you train a video generation model to be conditioned on motion? Turns out you can perform "motion prompting," just like you might prompt an LLM! Doing so enables many different capabilities. Here’s a few examples – check out this thread 🧵 for more results!

English

146

673

94.5K

Yiming Dou retweetledi

Junyi Zhang@junyi42·7 Eki

Excited to share MonST3R! -- a simple way to estimate geometry from unposed video of dynamic scene We achieve competitive results on several downstreams (video depth, camera pose) and believe this is a promising step toward feed-forward 4D reconstruction monst3r-project.github.io

English

138

726

131.4K

Yiming Dou retweetledi

Zichen Wang@Zichen2501·30 Eyl

Differentiable rendering made SIMPLE❗️ Differentiating physically based renderers is hard: Dirac-delta discontinuities arise at object silhouette. Our #SIGGRAPHAsia2024 work shows how a simple relaxation can rescue the day, enabling easy 3D reconstruction and relighting! (1/N)

English

349

45.3K

Keşfet

@ddvd233 @medialab @MITEECS @zhengqi_li @QianqianWang5 @zizhpan @antoniloq @andrewhowens