Hasan Hammoud

208 posts

Hasan Hammoud banner
Hasan Hammoud

Hasan Hammoud

@hammh0a

Ph.D. candidate in Computer Vision and Machine Learning @KaustVision; Former Intern at @samsungresearch; Former Intern at @UniofOxford

🇱🇧🇸🇦 Katılım Mart 2023
627 Takip Edilen829 Takipçiler
Sabitlenmiş Tweet
Hasan Hammoud
Hasan Hammoud@hammh0a·
Just released "DiffCLIP", extending Differential Attention proposed by @ytz2024 to CLIP models - replacing both visual & text encoder attention with the differential attention mechanism! TL;DR: Consistent improvements across all tasks with only 0.003% extra parameters!
Hasan Hammoud tweet media
English
5
10
29
24K
Hasan Hammoud retweetledi
DailyPapers
DailyPapers@HuggingPapers·
Unveiling AraLingBench: Deep Linguistic Evaluation for Arabic LLMs A new human-annotated benchmark of 150 expert-designed questions. It stress-tests grammar, morphology, spelling, comprehension & syntax, revealing LLMs often rely on memorization over true linguistic understanding.
DailyPapers tweet media
English
1
4
8
1K
Hasan Hammoud retweetledi
AI Native Foundation
AI Native Foundation@AINativeF·
1. Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale 🔑 Keywords: Arabic-centric, Hala, translate-and-tune pipeline, lightweight language model, NLP 💡 Category: Natural Language Processing 🌟 Research Objective: - The primary goal is to develop Arabic-centric instruction and translation models that achieve state-of-the-art results using advanced methodologies. 🛠️ Research Methods: - Utilized a translate-and-tune pipeline, compression to FP8, and slerp merging, alongside fine-tuning a lightweight language model on bilingual supervision. 💬 Research Conclusions: - Hala models, trained with varying parameters from 350M to 9B, deliver state-of-the-art performance on Arabic-centric benchmarks, publishing resources to further Arabic NLP research. 👉 Paper link: huggingface.co/papers/2509.14…
AI Native Foundation tweet media
English
1
1
2
129
Hasan Hammoud retweetledi
DailyPapers
DailyPapers@HuggingPapers·
Hala: New Arabic-centric models released on Hugging Face A family of state-of-the-art instruction and translation models, built with a novel translate-and-tune pipeline. Achieves SOTA performance in "nano" (≤2B) and "small" (7-9B) categories on Arabic benchmarks.
DailyPapers tweet media
English
1
3
16
1.1K
Hasan Hammoud
Hasan Hammoud@hammh0a·
We just released Hala: open, state-of-the-art Arabic instruction & translation models! ✨ Includes: • 1.2B Translation model (very light-weight) • 4.6M Arabic Instruction Tuning Dataset • 4 models (350M–9B) 📄 Paper: huggingface.co/papers/2509.14… Don't forget to upvote :)!! 🤗 Models & Data: huggingface.co/collections/ha… 💻 GitHub: github.com/hammoudhasan/H… 🙏 Big thanks to my co-authors Mohammad Zbeeb and @BernardSGhanem !
Hasan Hammoud tweet media
English
0
1
5
722
Hasan Hammoud retweetledi
KAUST
KAUST@KAUST_News·
AI, decoded in under a minute. Prof. Bernard Ghanem @BernardSGhanem from #KAUST, ranked #1 in the Middle East for producing #AItalent, breaks it into four pillars. The expertise driving Saudi Arabia’s bold #AI future.
English
0
6
41
3.5K
Hasan Hammoud
Hasan Hammoud@hammh0a·
New paper out ! Train Long, Think Less. We introduce Curriculum GRPO, start with long reasoning chains, then progressively tighten token budgets to train LLMs that think better with fewer tokens. 📈 +Accuracy, 🔻Token usage, across GSM8K, MATH500 & more. Special thanks to all co-authors! @KumailAlhamoud, Abed Hammoud, Eli Bou-Zeid, @MarzyehGhassemi, @BernardSGhanem. Amazing collaboration between @KAUST, @MIT, and @Princeton. Paper: arxiv.org/abs/2508.08940 Code: github.com/hammoudhasan/c…
Hasan Hammoud tweet media
English
0
2
11
560
Hasan Hammoud retweetledi
Thao Nguyen
Thao Nguyen@thao_nguyen26·
Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔 We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats! arxiv.org/abs/2506.04689
Thao Nguyen tweet media
English
14
59
226
35.8K
Hasan Hammoud retweetledi
Gordon (Guocheng) Qian
Gordon (Guocheng) Qian@guocheng_qian·
📢I am attending #CVPR2025 (Jun 11 - 14). Come to our snap-research.github.io/Omni-ID/ poster to know more about how we achieved the highest ID preservation in personalization and further enables expression following in our follow ups. See you at Fri 4 - 6 pm, ExHall D Poster #326.
Gordon (Guocheng) Qian tweet media
English
1
5
43
4.8K
Hasan Hammoud retweetledi
Tong Zhang
Tong Zhang@TongZhang9801·
📢Excited to share our new paper "Motion-Aware Concept Alignment for Consistent Video Editing". A training-free framework for video semantic mixing: 🔁Blend new concepts into specific objects 🎯Maintain spatial stability & temporal coherence 📊Outperform basselines A thread🧵
Tong Zhang tweet media
English
4
4
18
955
Hasan Hammoud retweetledi
Aleks Petrov
Aleks Petrov@AleksPPetrov·
If you work on long-context compression for LLMs, you've seen the Gisting approach: add a few "gist tokens" and adjust the attention mask so all context flows into them. Elegant and simple… But we found that it COMPLETELY BREAKS when compressing more than just a few tokens 🤯
English
1
2
8
2.4K
Hasan Hammoud
Hasan Hammoud@hammh0a·
Excited to share our new paper "Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think"! We show that digging into an LLM’s intermediate “subthoughts” and aggregating their answers can significantly boost math reasoning performance. A thread 🧵
Hasan Hammoud tweet media
English
4
10
23
4.6K