

Ryousuke Yamada
442 posts

@FragileGoodwill
Research Scientist @ AIST | Visiting Postdoc @ UTN | HQ @ cvpaper.challenge



[new CVPR'26 paper] 🔄 SSL works great when you have tons of data. But in 3D… we don’t. High-quality 3D scans are expensive, slow, and hard to scale. So what if we could pretrain 3D models without any real 3D scans? 1/


[new CVPR'26 paper] 🔄 SSL works great when you have tons of data. But in 3D… we don’t. High-quality 3D scans are expensive, slow, and hard to scale. So what if we could pretrain 3D models without any real 3D scans? 1/


[new CVPR'26 paper] 🔄 SSL works great when you have tons of data. But in 3D… we don’t. High-quality 3D scans are expensive, slow, and hard to scale. So what if we could pretrain 3D models without any real 3D scans? 1/









🚀 New arXiv preprint! PowerCLIP is the first method to align **powersets of image region subsets with textual phrase structures**, enabling fine-grained compositional and robust image-text understanding beyond simple global or token-to-patch alignment.

今更ながら、海外研究留学 Advent Calendar 2025にお誘いいただき、寄稿しました!! adventar.org/calendars/12626 ドイツ(ニュルンベルク)での家探し&一人暮らし立ち上げの話を書きました🇩🇪 現地での仮住まいなどリアル体験をまとめています note.com/ryousukeeee/n/…

