Gabriele Goletto

33 posts

Gabriele Goletto

@GGoletto

Research Scientist @ Microsoft (Computer Vision).

Katılım Temmuz 2014

216 Takip Edilen148 Takipçiler

Gabriele Goletto retweetledi

Oier Mees@oier_mees·19 Eyl

After two fantastic years at @UCBerkeley I'm thrilled to share that I've joined @Microsoft in Zurich🇨🇭to pioneer the next generation of multimodal foundation models to drive agents 🤖 that can seamlessly interact across the digital and physical worlds 🌍 We are hiring! 🧵

English

523

55.6K

Gabriele Goletto retweetledi

Gabriele Trivigno@gabTrivv·2 Haz

🚀 As #CVPR2025 week kicks off, meet SANSA: Semantically AligNed Segment Anything 2 We turn SAM2 into a semantic few-shot segmenter: 🧠 Unlocks latent semantics in frozen SAM2 ✏️ Supports any prompt: fast and scalable annotation 📦 No extra encoders 📎 github.com/ClaudiaCuttano…

English

171

8.9K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·10 Nis

Now on ArXiv our @CVPR #CVPR2025 paper Learning from Streaming Video with Orthogonal Gradients Instead of shuffling clips, can we learn from videos fed sequentially, where you see a clip once, in order? How to deal with the correlation of gradients over training? 1/3

Tengda Han@TengdaHan

Check out our CVPR 2025 paper: arxiv.org/abs/2504.01961. Work with Dilara Gokay, Joseph Heyward, @ChuhanZhang5 , @DanielZoran_ , Viorica Pătrăucean, @joaocarreira , @dimadamen and Andrew Zisserman, @GoogleDeepMind

English

6.4K

Gabriele Goletto retweetledi

Tommie Kerssies@tommiekerssies·31 Mar

Image segmentation doesn’t have to be rocket science. 🚀 Why build a rocket engine full of bolted-on subsystems when one elegant unit does the job? 💡 That’s what we did for segmentation. ✅ Meet the Encoder-only Mask Transformer (EoMT): tue-mps.github.io/eomt (CVPR 2025) (1/6)

English

433

34.5K

Gabriele Goletto retweetledi

Gabriele Trivigno@gabTrivv·24 Mar

🔥 Our paper SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation is accepted at #CVPR2025! 🎉 We make #SegmentAnything wiser, enabling it to understand text prompts—training only 4.9M parameters! 🧠 💻 Code, models & demo: github.com/ClaudiaCuttano… Why SAMWISE?👇

English

230

16.6K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·7 Şub

🛑📢 HD-EPIC: A Highly-Detailed Egocentric Video Dataset hd-epic.github.io arxiv.org/abs/2502.04144 New collected videos 263 annotations/min: recipe, nutrition, actions, sounds, 3D object movement &fixture associations, masks. 26K VQA benchmark to challenge current VLMs 1/N

English

122

20.8K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·16 Eki

📢 Our @ACCVConf Oral It’s Just Another Day: Unique Video Captioning by Discriminitive Prompting is now on ArXiv tobyperrett.github.io/its-just-anoth… For challenging ego &timeloop movies, uniquely caption ev clip, including those near identical ones, w/out re-training captioning model 1/N

GIF

English

18.6K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·3 Eki

To all our Amigos ⁦@eccvconf⁩… We’re presenting AMEGO this morning. Poster#193 ⁦@GGoletto⁩ is ready and so am I Directions: enter poster alley opposite hpc-ai booth, walk to the end, look left… you’ll find us there

English

Gabriele Goletto retweetledi

Dima Damen@dimadamen·27 Eyl

Heading to @eccvconf #ECCV2024? You're on the academic job market for permanent position (Ass Prof - L/SL)? Already Ass Prof but considering a move? We're hiring @Bristol University in Computer Vision (Advert out soon). Ping me (DM or Email) to chat in Milan - AMA about this post

English

2.8K

Gabriele Goletto retweetledi

Alex Stoken@AlexStoken·26 Eyl

Love timm, but need to do lower-level vision tasks? Meet IMM, a collection of Image Matching Models unified by a simple, easy to use API. You can use any of 30 models (LightGlue, LoFTR, RoMa, SIFT) just by cloning the repo and changing a single parameter! github.com/gmberton/image…

English

268

21.8K

Gabriele Goletto retweetledi

AI Bites | YouTube Channel@ai_bites·24 Eyl

This work aims to detect and identify activity-centric zones in real-world conditions and leverage their domain-agnostic representations to enhance the generalization of first-person action recognition models. Paper: Egocentric zone-aware action recognition across environments Link: arxiv.org/abs/2409.14205 Project: gabrielegoletto.github.io/EgoZAR #AI #AI美女 #LLMs #deeplearning #machinelearning #3D #actionrecognition

English

313

Gabriele Goletto retweetledi

AI Bites | YouTube Channel@ai_bites·18 Eyl

AMEGO - a representation of long videos. AMEGO breaks the video into Hand-Object Interaction (HOI) tracklets, and location segments. This forms a semantic-free memory of the video. AMEGO is built in an online fashion, eliminating the need to reprocess past frames. Paper: AMEGO: Active Memory from long EGOcentric videos Link: arxiv.org/abs/2409.10917 Project: gabrielegoletto.github.io/AMEGO #AI #AI美女 #LLMs #deeplearning #machinelearning #3D

English

1.3K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·18 Eyl

📢 [New Preprint]@eccvconf #ECCV2024 paper AMEGO: Active Memory from long EGOcentric videos gabrielegoletto.github.io/AMEGO/ Semantic-free representation of all interacting objects and locations in a long egocentric video. w/ 20K VQA benchmark that uses object crops to query interactions.

English

139

10K

Gabriele Goletto retweetledi

Gabriele Berton@gabriberton·17 Haz

Our retrieval + matching pipeline for astronaut photo localization is out! Manually localized photos taken by @Space_Station astronauts are used for disaster management and research - we automate this process with EarthLoc (CVPR24) and EarthMatch (CVPRW24) earthloc-and-earthmatch.github.io

GIF

English

159

13.4K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·9 May

Waiting @eccvconf reviews - need a distraction? Check our accepted #IJCV paper "An Outlook into the Future of Egocentric Vision". Read about Stanley, our EgoDesigner using #EgoAI to design the set of a movie in 2030, or Marco the factory worker w/ #EgoAI sid2697.github.io/futureofegovis…

English

9.9K

Gabriele Goletto retweetledi

Siddhant Bansal@Sid__Bansal·18 Nis

Enabling VLMs to understand hand-object interactions in egocentric vision! We use EPIC-Kitchens & @ego4_d to train VLM4HOI. Check the paper to know how we did it arxiv.org/abs/2404.09933 Data: github.com/Sid2697/HOI-Re… Code: github.com/Sid2697/HOI-Ref Webpage: sid2697.github.io

Dima Damen@dimadamen

🔔Can VLMs spatially refer objects in Ego? Can VLMs understand interactions? Which hand is holding an object and what's the object in the left hand? We show current VLMs struggle in interactions and release new data & models for HOI-Ref in Ego sid2697.github.io/hoi-ref/ On ArXiv🧵

English

2.6K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·9 Şub

A revised version of our paper "An Outlook into the Future of Egocentric Vision" is now available. This includes the latest peer-reviewed works after our initial submission. *New* sections on Ego-Language models added and novel insights included arxiv.org/abs/2308.07123

Dima Damen@dimadamen

📢 "An Outlook into the Future of Egocentric Vision" 44 pages + 385 references survey now available on @openreviewnet We invite comments/suggestions/corrections from researchers for 30days. Major contributions will be acknowledged [instructions in 🧵] openreview.net/forum?id=V3974… 1/4

English

14K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·24 Kas

Applications now open- 2nd Summer of Research @BristolUni #MachineLearning and #ComputerVision (MaVi) group. R u PhD student with overlapping interests to us? U can visit for 3mnths this summer! Apply - DL 19/1/24 @MaVi.html" target="_blank" rel="nofollow noopener">uob-mavi.github.io/Summer@MaVi.ht… Watch 2023 cohort: youtu.be/p12V1z2Te0Y

YouTube

English

16.4K

Gabriele Goletto retweetledi

Tatiana@tommasi_tatiana·15 Ağu

📢"An Outlook into the Future of Egocentric Vision" 44 pages + 385 references survey now available on @openreviewnet We invite comments/suggestions/corrections from researchers for 30days. Major contributions will be acknowledged [instructions in 🧵] openreview.net/forum?id=V3974… 1/4

English

1.5K

Gabriele Goletto retweetledi

Dima Damen@dimadamen·15 Ağu

English

32K

Keşfet

@UCBerkeley @Microsoft @CVPR @ACCVConf @eccvconf @Bristol @Space_Station @ego4_d