MOAYED HAJi ALi

67 posts

MOAYED HAJi ALi

MOAYED HAJi ALi

@MoayedHaji

Katılım Ağustos 2021
143 Takip Edilen67 Takipçiler
Sabitlenmiş Tweet
MOAYED HAJi ALi
MOAYED HAJi ALi@MoayedHaji·
Great news from #CVPR2024 🎉🎉🎉 Happy to share that our paper ElasticDiffusion: Training-free Arbitrary Size Image Generation was accepted @CVPR. Big thanks to my collaborators @bluevincent and Guha Balakrishnan. Checkout more details from here: elasticdiffusion.github.io
MOAYED HAJi ALi tweet media
English
2
5
21
2.3K
The Telegraph
The Telegraph@Telegraph·
🔴 BBC removed references to ‘Jews’ and ‘jihad’ in Gaza documentary
English
510
2.4K
11K
3.7M
MOAYED HAJi ALi retweetledi
Ahmed Masry
Ahmed Masry@Ahmed_Masry97·
Happy to announce AlignVLM📏: a novel approach to bridging vision and language latent spaces for multimodal understanding in VLMs! 🌍📄🖼️ 🔗 Read the paper: arxiv.org/abs/2502.01341 🧵👇 Thread
Ahmed Masry tweet media
English
2
55
211
22.3K
MOAYED HAJi ALi retweetledi
Tsai-Shien Chen
Tsai-Shien Chen@tsaishien_chen·
Introducing ⚗️ Video Alchemist Our new video model supporting 👪 Multi-subject open-set personalization 🏞️ Foreground & background personalization 🚀 Without the need of inference-time tuning snap-research.github.io/open-set-video… [Results] 1. Sora girl rides a dinosaur on a savanna 🧵👇
English
2
39
230
31.4K
MOAYED HAJi ALi retweetledi
Ivan Skorokhodov
Ivan Skorokhodov@isskoro·
Diffusion models are very strong and robust feature extractors, but recent works were only using them for recognition tasks. In our recent work (led by @MoayedHaji), we harness them for video2audio generation: they by far outperform conventional video feature extractors for audio/video temporal alignment and allow to achieve SotA results in sound quality as well: snap-research.github.io/AVLink/
MOAYED HAJi ALi@MoayedHaji

1/6 Introducing AV-Link, an approach to connect video and audio diffusion models in a self-contained framework to enable video-to-audio and audio-to-video generation with superb audio-video synchronization. Project page: snap-research.github.io/AVLink

English
0
1
13
1.4K
MOAYED HAJi ALi retweetledi
Alper Canberk
Alper Canberk@alpercanbe·
Can we use the intermediate representations of pretrained video generation models to generate audio, and vice versa? 🤔🔀 It turns out that this approach can be powerful. Introducing AV-Link, an approach to connect video and audio diffusion models in a self-contained framework
English
1
3
15
1.9K
MOAYED HAJi ALi
MOAYED HAJi ALi@MoayedHaji·
5/6 For the first time, our approach shows audio-synchronized video generation for “in-the-wild” scenarios, surpassing existing methods in both generation quality and semantic and temporal alignment (#A2V_upscaled_w_prompt" target="_blank" rel="nofollow noopener">snap-research.github.io/AVLink/a2v_w_p…).
English
1
0
0
80
MOAYED HAJi ALi
MOAYED HAJi ALi@MoayedHaji·
1/6 Introducing AV-Link, an approach to connect video and audio diffusion models in a self-contained framework to enable video-to-audio and audio-to-video generation with superb audio-video synchronization. Project page: snap-research.github.io/AVLink
English
1
1
8
1.6K
Another_Daniel
Another_Daniel@anonperson09·
@xwang_lk The fact that 20% of academic discourse on twitter is about review procedures makes me really not want to go into academia. Is it like this irl too?
English
1
0
1
346
Xin Eric Wang
Xin Eric Wang@xwang_lk·
I remember one time while reviewing for an NLP conference, not only could the AC see the reviewer's identities, but the reviewers could also see each other's identities. It worked like a charm. Transparency and accountability are crucial to improve reviewing quality.
English
6
3
90
12K
MOAYED HAJi ALi retweetledi
Ahmed Masry
Ahmed Masry@Ahmed_Masry97·
🧵 1/4 Introducing ColFlor: An Efficient, OCR-Free Vision-Language Document Retrieval Model 🌟 Earlier this year, ColPali transformed document retrieval by removing error-prone OCR pipelines, creating embeddings directly from images. However, its 3 billion parameters make it computationally expensive. That’s where ColFlor comes in! ⚡👇 #AI #NLP huggingface.co/blog/ahmed-mas…
English
2
20
68
9.2K
Nuseir Yassin
Nuseir Yassin@nasdaily·
I am excited to announce that I had my first Free Palestine protest come to a Nas Daily meetup. I told them I agreed with them. I also want a Free Palestine from Hamas. Free Palestine from terrorism. Free Palestine from radical religion. They disagreed. They just wanted a Free Palestine from Jews. Oh well.
English
3.6K
2K
15.5K
4.7M
MOAYED HAJi ALi retweetledi
Zilin Xiao
Zilin Xiao@ZilinXiao2·
I am excited to share that two of our research works will be presented at ECCV 2024. #ECCV2024 They focus on augmenting language models with fine-grained visual recognition ability. AutoVER made successful attempts at generative visual recognition. It was accepted to the ECCV 2024 main conference and was invited to the ILR Workshop as an oral presentation. Collaboration w/ @pcascanteb @vislang #Microsoft Extractive Reranker was accepted to the ILR Workshop as a poster. We explored how the long-context sequence modeling ability of language models can benefit image retrieval, a fundamental computer vision problem.
Zilin Xiao tweet mediaZilin Xiao tweet mediaZilin Xiao tweet media
English
0
4
8
1.1K
إيران بالعربية
إيران بالعربية@iraninarabic_ir·
غادروا مدنكم! فجيش أبابيل قادم للانتقام. #اسماعيل_هنية
إيران بالعربية tweet media
العربية
3.5K
2.4K
22.2K
3.7M
MOAYED HAJi ALi retweetledi
MOAYED HAJi ALi
MOAYED HAJi ALi@MoayedHaji·
@ftm_guney Honestly it is so crowded that it might be not very safe for kids to be here
English
0
0
0
149
F. Güney
F. Güney@ftm_guney·
I’m not at CVPR but still following the news here. incredibly happy to see friends and mentors getting awards, congrats to Angjoo, Andrea, and Andreas’s team! sad to see kids are not allowed at CVPR, like why? looking forward to reading the papers bookmarked after the holiday.
English
2
0
14
2.7K