

Gabriele Goletto
33 posts

@GGoletto
Research Scientist @ Microsoft (Computer Vision).





Check out our CVPR 2025 paper: arxiv.org/abs/2504.01961. Work with Dilara Gokay, Joseph Heyward, @ChuhanZhang5 , @DanielZoran_ , Viorica Pătrăucean, @joaocarreira , @dimadamen and Andrew Zisserman, @GoogleDeepMind















🔔Can VLMs spatially refer objects in Ego? Can VLMs understand interactions? Which hand is holding an object and what's the object in the left hand? We show current VLMs struggle in interactions and release new data & models for HOI-Ref in Ego sid2697.github.io/hoi-ref/ On ArXiv🧵


📢 "An Outlook into the Future of Egocentric Vision" 44 pages + 385 references survey now available on @openreviewnet We invite comments/suggestions/corrections from researchers for 30days. Major contributions will be acknowledged [instructions in 🧵] openreview.net/forum?id=V3974… 1/4







