KAUST Vision CAIR group

128 posts

KAUST Vision CAIR group banner
KAUST Vision CAIR group

KAUST Vision CAIR group

@KAUSTVisionCAIR

KAUST VISION CAIR (Computer Vision, Core AI Research) group led by Prof. @moElhoseiny. @KAUST_News https://t.co/wpPoM4iiYj…

Thuwal, Saudi Arabia Katılım Ağustos 2020
73 Takip Edilen346 Takipçiler
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
The C3DV @CVPR is starting at 8:15. Please join us, Where: Room 110 B, Music City Center, Nashville Zoom link is accessible through CVPR for registered participants, excellent lineup of speakers!
KAUST Vision CAIR group@KAUSTVisionCAIR

Join us for the Workshop on Compositional 3D Vision (C3DV) #CVPR2025! @CVPR 🏆 Challenges: bit.ly/4c8DfDS 📢 Call for Papers: bit.ly/4c2BsQD C3DV will feature two exciting challenges and a fantastic lineup of speakers! 🧵👇

English
0
4
6
3.3K
KAUST Vision CAIR group retweetledi
KAUST CEMSE
KAUST CEMSE@KAUST_CEMSE·
200+ students from across Saudi Arabia explored #AI through art, music & design at the SAAI Factory Hackathon for Kids at #KAUST. Their projects tackled inclusion, climate change & immersive learning—blending creativity with tech. Learn more: cemse.kaust.edu.sa/articles/2025/…
KAUST CEMSE tweet mediaKAUST CEMSE tweet mediaKAUST CEMSE tweet mediaKAUST CEMSE tweet media
English
0
3
6
1.7K
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
#ICLR 2025 🚀 Excited to share that three papers have been accepted at ICLR 2025! 🎉 Huge thanks to my incredibly talented students and collaborators for their dedication and hard work—this wouldn't have been possible without you!
English
1
2
12
2.1K
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
@ICLR25: @KAUSTVisionCAIR’s wenxuan zhang and @ben_nebulous are presenting BFPO and Toddler diffusion this morning; posters 277 and 280 Hall 2; stop by to learn more about carefully modeling the dichotomy of safety and helpfulness and more interpretable and efficient diffusion.
Mohamed Elhoseiny tweet mediaMohamed Elhoseiny tweet media
Mohamed Elhoseiny@moElhoseiny

#ICLR 2025 🚀 Excited to share that three papers have been accepted at ICLR 2025! 🎉 Huge thanks to my incredibly talented students and collaborators for their dedication and hard work—this wouldn't have been possible without you!

English
0
2
8
809
KAUST Vision CAIR group
KAUST Vision CAIR group@KAUSTVisionCAIR·
@CVPR @HaoSuLabUCSD @tolga_birdal 🏆 We are also hosting two 3D vision challenges this year! This year's highlight: the 3DCoMPaT-200 Challenge and the Language-Based Part Grounding Challenge, focusing on 3D object composition and understanding. 📥 C3DV Challenges: bit.ly/4c8DfDS 🧵 3/4
English
1
2
7
2.8K
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
📌 Paper 3: Query-based Knowledge Transfer for Heterogeneous Learning Environments @Norah Alballa, Wenxuan Zhang,@Ziquan Liu. Mohamed Elhoseiny. Marco Canini ✅ This paper introduces Query-based Knowledge Transfer (QKT), a novel framework for decentralized collaborative learning
Mohamed Elhoseiny tweet media
English
1
1
1
151
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
📌 Paper 2: Bi-Factorial Preference Optimization (BFPO): Balancing Safety-Helpfulness in Language Models by Wenxuan Zhang, Philip Torr, Mohamed Elhoseiny*, Adel Bibi*, ✅ BFPO is a supervised learning framework that reformulates RLHF’s joint objective of safety and helpfulness..
Mohamed Elhoseiny tweet media
English
1
1
1
196
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
...into interpretable stages (e.g., contours, palettes, textures) and leverages Schrödinger Bridge optimal transport for seamless transitions between modalities, enhancing control and flexibility in the generation process.
English
1
1
0
121
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
📌 Paper 1: ToddlerDiffusion: Interactive Structured Image Generation with Cascaded Schrödinger Bridge by Eslam Bakr, Liangbing Zhao, Tao HU, Matthieu Cord, Patrick Pérez, Mohamed Elhoseiny ✅This paper introduces a new diffusion framework that decomposes RGB image generation
Mohamed Elhoseiny tweet media
Français
1
1
2
156
KAUST Vision CAIR group retweetledi
Mohamed Elhoseiny
Mohamed Elhoseiny@moElhoseiny·
Excellent collaboration with an amazing team at Meta. Huge thanks to @YoungXiong1 for hosting @xiaoqian_shen's internship! @liuzhuang1234,@Hu_Hsu @garvinchen2,@klightlm @zechunliu,@balakrishnan_vr @Fanyi_Xiao,@hyunwoojkim @bilgeesra,@raghuraman,@vikasc
Yunyang Xiong@YoungXiong1

🚨VideoLLM from Meta!🚨 LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding 📝Paper: huggingface.co/papers/2410.17… 🧑🏻‍💻Code: github.com/Vision-CAIR/Lo… 🚀Project (Demo): vision-cair.github.io/LongVU We propose LongVU, a video LLM with a spatiotemporal adaptive compression mechanism designed for real-world hour-long video understanding. LongVU adaptively reduces the number of video tokens by leveraging (1) DINOv2 feature similarity across frames, (2) Cross-modal text-frame similarity, and (3) temporal frame similarity. 1. High quality on video-based QA: 67.6% on EgoSchema, 66.9% on MVBench, 65.4% on MLVU and 59.5% on VideoMME long 2. +5% accuracy boost on average across various video understanding benchmarks compared to LLaVA-OneVision and VideoChat2 3. Our edge model, LongVU-3B, also outperformed 4B counterparts such as VideoChat2(Phi-3) and Phi-3.5-vision-instruct by a large margin. with: @xiaoqian_shen @liuzhuang1234 @Hu_Hsu @garvinchen2 @klightlm @zechunliu @balakrishnan_vr @Fanyi_Xiao @hyunwoojkim @bilgeesra @raghuraman @moElhoseiny @vikasc

English
0
2
7
1.1K
KAUST Vision CAIR group retweetledi
Sara Beery
Sara Beery@sarameghanbeery·
@hannah_kerner speaking now at the @CV4E_ECCV workshop on the future of geospatial foundation models for the earth 🌍🤩
Sara Beery tweet media
Milan, Lombardy 🇮🇹 English
1
3
5
813
KAUST Vision CAIR group retweetledi
Sara Beery
Sara Beery@sarameghanbeery·
It's a full house!
Sara Beery tweet media
Milan, Lombardy 🇮🇹 English
0
2
3
599