Ming Shan

45 posts

Ming Shan

Ming Shan

@mingshanhee

Ph.D under Prof @sroylee at SUTD (@sutdsg) | Trustworthy and Explainable Vision-Language Approaches | Hateful Memes & Multimodal Misinformation

Singapore Katılım Haziran 2014
227 Takip Edilen97 Takipçiler
Ming Shan
Ming Shan@mingshanhee·
Today, MBZUAI released K2-V2, a fully transparent and open-source 70B model built entirely from scratch. I am happy to be part of this incredible team that is committed to developing transparent and reproducible models.
MBZUAI@mbzuai

Today, we are releasing a new version of K2 (K2-V2), a 360-open LLM built from scratch as a superior base for reasoning adaptation, while still excelling at core LLM capabilities like conversation, knowledge retrieval, and long-context understanding. K2 fills a major gap: highly capable models with no transparency. Instead of releasing only weights, we’re sharing the full training story — dataset recipes, mid-training checkpoints, logs, code, and evaluation tools. That’s 360-open. What’s inside: • 70B dense transformer engineered as a reasoning-enhanced base model • Native 512K context (extendable via RoPE scaling) • Mid-training reasoning phase • Strong tool-use scaffolding What we’re open-sourcing: • 250M+ reasoning traces (math, planning, multi-step logic) • Full pre- & mid-training data compositions • All mid-training checkpoints • Training logs, code, Eval360 Performance: • GPQA-Diamond: 55.1% mid-training → 69.3% after SFT (strongest fully open 70B model) • KK-8 Logic Puzzles: 83% — competitive with DeepSeek-R1 & OpenAI o3-mini-high • ArenaHard V2: 62.1% — close to Qwen3 235B • Outperforms Qwen2.5-72B and approaches Qwen3-235B despite being smaller and fully transparent. 🔗 The Model: bit.ly/3KIYwuo 🔗Technical Report: bit.ly/49V8h2U 🔗Blog: bit.ly/49V7gb6

English
0
0
3
249
Ming Shan
Ming Shan@mingshanhee·
2) Contrastive Instruction Fine-Tuning Large Multimodal Model for Hateful Meme Classification (ojs.aaai.org/index.php/ICWS……) - Post-train LMMs for hateful meme classification tasks - Applies Contrastive Learning for better classification boundaries
English
0
0
1
106
Ming Shan
Ming Shan@mingshanhee·
So ICWSM Main Conference Day 1 has kicked off! Catch me at "Session 1: Multimodal Meme Analysis" from 10:30 AM to 11:00 AM, where I’ll be presenting two papers.
English
2
2
10
709
Ming Shan
Ming Shan@mingshanhee·
1) Demystifying Hateful Content: Leveraging Large Multimodal Models for Hateful Meme Detection with Explainable Decisions (ojs.aaai.org/index.php/ICWS…) - Reduce reliance on model's implicit knowledge - Provide possible explanations behind models' decisions
English
0
0
1
77
Ming Shan
Ming Shan@mingshanhee·
Flying to attend @icwsm 2025 in Copenhagen, Denmark now! Let's talk about memes, social problems, and MAYBE have a few Danish pastries 🇩🇰✨ #ICWSM2025
English
0
1
12
1.2K
Ming Shan
Ming Shan@mingshanhee·
And nightmare happens... @overleaf went down... 🥲
Ming Shan tweet media
English
2
1
16
3.1K
Ming Shan
Ming Shan@mingshanhee·
SUTD AI Day concludes with @hwaran_lee presenting on "Evaluation with socio-cultural awareness and multilingual LLMs" 🎉!
Ming Shan tweet mediaMing Shan tweet media
English
0
2
3
358
Ming Shan
Ming Shan@mingshanhee·
Next at SUTD AI Day, we have @pliang279 delivering keynote on "New Advances in Multimodal Reasoning"!
Ming Shan tweet media
English
0
1
1
123
Ming Shan
Ming Shan@mingshanhee·
SUTD AI Day, held at @sutdsg, has officially begun! Starting with @preslav_nakov presenting on "Towards Truly Open, Language-Specific, Safe, Factual, and Specialized Large Language Models."
Ming Shan tweet mediaMing Shan tweet media
English
0
2
6
321
Ming Shan
Ming Shan@mingshanhee·
@mohitban47 @iclr_conf @TmlrOrg Welcome to Singapore, and thanks for delivering a keynote at SSNLP 2025. Would love to catch you on postdoctoral opportunity after your keynote session 🙂
English
1
0
1
157
Mohit Bansal
Mohit Bansal@mohitban47·
In Singapore for #ICLR2025 this week to present the papers + keynotes below 👇, and looking forward to seeing everyone -- happy to chat about research, or faculty+postdoc+phd positions, or simply hanging out (feel free to ping to set up a meeting)! 🙂 Also meet several of our awesome students/postdocs/collaborators who will be presenting this exciting batch of papers (incl. oral+spotlight)!
Mohit Bansal tweet media
English
5
31
178
16.6K
Ming Shan
Ming Shan@mingshanhee·
Now, Dr. Faeze Brahman @faeze_brh is delivering "Open Language Model Adaptation and Reliable Evaluation"! An exciting time for open-source and reproducible models! #SSNLP2025
Ming Shan tweet media
English
1
1
2
110
Ming Shan
Ming Shan@mingshanhee·
Prof. @VioletNPeng delivering a talk on "Controllable and Creative Natural Language Generation", lowering the barrier for the layman! #SSNLP2025
Ming Shan tweet media
Singapore 🇸🇬 English
0
2
6
513
Ming Shan
Ming Shan@mingshanhee·
Interactive poster session with buffet lunch 😋! And also, great to have Prof. @preslav_nakov joining us right after landing in Singapore 🇸🇬! #SSNLP2025
Ming Shan tweet mediaMing Shan tweet mediaMing Shan tweet media
English
0
0
4
128
Ming Shan
Ming Shan@mingshanhee·
#SSNLP2025 has started! We have Junyang Lin, Research Lead at Alibaba Qwen, sharing about "Qwen: Towards Generalist Models" right now!
Ming Shan tweet media
Singapore 🇸🇬 English
0
3
12
838
Ming Shan
Ming Shan@mingshanhee·
Can Vision-Language Models (VLMs) answer high-school exam questions across different languages and subjects? Join the #ImageCLEF2025 Multimodal Reasoning Challenge and put them to the test! 🗓 Registration Closes April 25, 2025 – Secure your spot now! 🌐 imageclef.org/2025/multimoda…
Ming Shan tweet media
English
0
3
6
277
Ming Shan
Ming Shan@mingshanhee·
Coming to Singapore for ICLR 2025? Join us on April 23 (Wednesday) for SSNLP 2025! Expect an exciting lineup of inspiring keynote talks and research poster sessions. FREE registration closes on April 4 — spots are limited! Website: ssnlp-website.github.io/ssnlp25/ #Singapore #SSNLP2025
Ming Shan tweet media
English
0
1
3
225
Ming Shan retweetledi
Boyang "Albert" Li
Boyang "Albert" Li@AlbertBoyangLi·
The Singapore Symposium on Natural Language Processing (ssnlp-website.github.io/ssnlp25/) will be held on 23 April (right before ICLR) on the SUTD campus. I'm serving as your humble general chair. We are now calling for student presentations of papers published and accepted in other academic conferences. The deadline is only 5 days away!
English
1
6
14
1.3K
Ming Shan
Ming Shan@mingshanhee·
@jbhuang0604 Watched the video last week. Your FlashAttention video is very helpful and educational 😀
English
0
0
1
131