
Dídac Surís
213 posts

Dídac Surís
@Surisdi
Research Scientist @AIatMeta. Previously a Computer Vision PhD student at @Columbia. Amateur guitarist. Tweets in Catalan, Spanish or English


the hardest problem in computer vision? occlusion - it's always occlusion

pix2gestalt: Amodal Segmentation by Synthesizing Wholes paper page: huggingface.co/papers/2401.14… synthesizes whole objects from only partially visible ones, enabling amodal segmentation, recognition, and 3D reconstruction of occluded objects

ViperGPT: Visual Inference via Python Execution for Reasoning abs: arxiv.org/abs/2303.08128 project page: viper.cs.columbia.edu




Do you have some home videos you’d like to add music to? Tomorrow at #CVPR2022 we present “It’s Time for Artistic Correspondence in Music and Video”! video: youtu.be/A4g30USxI0Q website and paper: musicforvideo.cs.columbia.edu w/ @cvondrick, Bryan Russell, @justin_salamon












It’s time to discuss: what is the best structure for representing videos and what is the way forward in video understanding? We are eager to hear your views at our #ICCV2021 workshop on Structured Representations for Video Understanding Submission: Aug 27 sites.google.com/view/srvu-iccv…










