Ming Shan

45 posts

Ming Shan

@mingshanhee

Ph.D under Prof @sroylee at SUTD (@sutdsg) | Trustworthy and Explainable Vision-Language Approaches | Hateful Memes & Multimodal Misinformation

Singapore Katılım Haziran 2014

227 Takip Edilen97 Takipçiler

Ming Shan@mingshanhee·5 Ara

Today, MBZUAI released K2-V2, a fully transparent and open-source 70B model built entirely from scratch. I am happy to be part of this incredible team that is committed to developing transparent and reproducible models.

MBZUAI@mbzuai

Today, we are releasing a new version of K2 (K2-V2), a 360-open LLM built from scratch as a superior base for reasoning adaptation, while still excelling at core LLM capabilities like conversation, knowledge retrieval, and long-context understanding. K2 fills a major gap: highly capable models with no transparency. Instead of releasing only weights, we’re sharing the full training story — dataset recipes, mid-training checkpoints, logs, code, and evaluation tools. That’s 360-open. What’s inside: • 70B dense transformer engineered as a reasoning-enhanced base model • Native 512K context (extendable via RoPE scaling) • Mid-training reasoning phase • Strong tool-use scaffolding What we’re open-sourcing: • 250M+ reasoning traces (math, planning, multi-step logic) • Full pre- & mid-training data compositions • All mid-training checkpoints • Training logs, code, Eval360 Performance: • GPQA-Diamond: 55.1% mid-training → 69.3% after SFT (strongest fully open 70B model) • KK-8 Logic Puzzles: 83% — competitive with DeepSeek-R1 & OpenAI o3-mini-high • ArenaHard V2: 62.1% — close to Qwen3 235B • Outperforms Qwen2.5-72B and approaches Qwen3-235B despite being smaller and fully transparent. 🔗 The Model: bit.ly/3KIYwuo 🔗Technical Report: bit.ly/49V8h2U 🔗Blog: bit.ly/49V7gb6

English

249

Ming Shan@mingshanhee·26 Haz

Now there is roughly 90GB worth of photos and footage to edit (total about 2 hours)... What have I gotten myself into...

Rajvardhan Oak@oak_raj

Find yourself a @mingshanhee who captures the world so you can truly live in the moment!

English

352

Ming Shan@mingshanhee·24 Haz

2) Contrastive Instruction Fine-Tuning Large Multimodal Model for Hateful Meme Classification (ojs.aaai.org/index.php/ICWS……) - Post-train LMMs for hateful meme classification tasks - Applies Contrastive Learning for better classification boundaries

English

106

Ming Shan@mingshanhee·24 Haz

So ICWSM Main Conference Day 1 has kicked off! Catch me at "Session 1: Multimodal Meme Analysis" from 10:30 AM to 11:00 AM, where I’ll be presenting two papers.

English

709

Ming Shan@mingshanhee·24 Haz

1) Demystifying Hateful Content: Leveraging Large Multimodal Models for Hateful Meme Detection with Explainable Decisions (ojs.aaai.org/index.php/ICWS…) - Reduce reliance on model's implicit knowledge - Provide possible explanations behind models' decisions

English

Ming Shan@mingshanhee·21 Haz

Flying to attend @icwsm 2025 in Copenhagen, Denmark now! Let's talk about memes, social problems, and MAYBE have a few Danish pastries 🇩🇰✨ #ICWSM2025

English

1.2K

Ming Shan@mingshanhee·14 May

And nightmare happens... @overleaf went down... 🥲

English

3.1K

Ming Shan@mingshanhee·25 Nis

SUTD AI Day concludes with @hwaran_lee presenting on "Evaluation with socio-cultural awareness and multilingual LLMs" 🎉!

English

358

Ming Shan@mingshanhee·25 Nis

Next at SUTD AI Day, we have @pliang279 delivering keynote on "New Advances in Multimodal Reasoning"!

English

123

Ming Shan@mingshanhee·25 Nis

SUTD AI Day, held at @sutdsg, has officially begun! Starting with @preslav_nakov presenting on "Towards Truly Open, Language-Specific, Safe, Factual, and Specialized Large Language Models."

English

321

Ming Shan@mingshanhee·24 Nis

Tomorrow (technically today), @sutdsg will be hosting SUTD AI Day where @preslav_nakov @pliang279 and @hwaran_lee will deliver three amazing keynotes! See sutdaiday.github.io/sutdaiday25/ for more information!

English

441

Ming Shan@mingshanhee·23 Nis

@mohitban47 @iclr_conf @TmlrOrg Welcome to Singapore, and thanks for delivering a keynote at SSNLP 2025. Would love to catch you on postdoctoral opportunity after your keynote session 🙂

English

157

Mohit Bansal@mohitban47·22 Nis

Keynotes & workshops+symposiums' details/websites: ▪️Large Model Safety Workshop 2025 (Apr23) --> lmxsafety.com/2025/ ▪️Singapore Symposium on Natural Language Processing (SSNLP) 2025 (Apr23) --> ssnlp-website.github.io/ssnlp25/ ▪️Multimodal Gathering Workshop (NUS) 2025 (Apr27) --> TBD ▪️Co-Organizer -- Workshop on Foundation Models in the Wild (Apr27) --> fm-wild-community.github.io

English

884

Mohit Bansal@mohitban47·21 Nis

In Singapore for #ICLR2025 this week to present the papers + keynotes below 👇, and looking forward to seeing everyone -- happy to chat about research, or faculty+postdoc+phd positions, or simply hanging out (feel free to ping to set up a meeting)! 🙂 Also meet several of our awesome students/postdocs/collaborators who will be presenting this exciting batch of papers (incl. oral+spotlight)!

English

178

16.6K

Ming Shan@mingshanhee·23 Nis

Now, Dr. Faeze Brahman @faeze_brh is delivering "Open Language Model Adaptation and Reliable Evaluation"! An exciting time for open-source and reproducible models! #SSNLP2025

English

110

Ming Shan@mingshanhee·23 Nis

Prof. @VioletNPeng delivering a talk on "Controllable and Creative Natural Language Generation", lowering the barrier for the layman! #SSNLP2025

Singapore 🇸🇬 English

513

Ming Shan@mingshanhee·23 Nis

Interactive poster session with buffet lunch 😋! And also, great to have Prof. @preslav_nakov joining us right after landing in Singapore 🇸🇬! #SSNLP2025

English

128

Ming Shan@mingshanhee·23 Nis

#SSNLP2025 has started! We have Junyang Lin, Research Lead at Alibaba Qwen, sharing about "Qwen: Towards Generalist Models" right now!

Singapore 🇸🇬 English

838

Ming Shan@mingshanhee·24 Mar

Can Vision-Language Models (VLMs) answer high-school exam questions across different languages and subjects? Join the #ImageCLEF2025 Multimodal Reasoning Challenge and put them to the test! 🗓 Registration Closes April 25, 2025 – Secure your spot now! 🌐 imageclef.org/2025/multimoda…

English

277

Ming Shan@mingshanhee·22 Mar

Coming to Singapore for ICLR 2025? Join us on April 23 (Wednesday) for SSNLP 2025! Expect an exciting lineup of inspiring keynote talks and research poster sessions. FREE registration closes on April 4 — spots are limited! Website: ssnlp-website.github.io/ssnlp25/ #Singapore #SSNLP2025

English

225

Ming Shan retweetledi

Boyang "Albert" Li@AlbertBoyangLi·2 Mar

The Singapore Symposium on Natural Language Processing (ssnlp-website.github.io/ssnlp25/) will be held on 23 April (right before ICLR) on the SUTD campus. I'm serving as your humble general chair. We are now calling for student presentations of papers published and accepted in other academic conferences. The deadline is only 5 days away!

English

1.3K

Ming Shan@mingshanhee·3 Mar

@jbhuang0604 Watched the video last week. Your FlashAttention video is very helpful and educational 😀

English

131

Jia-Bin Huang@jbhuang0604·3 Mar

Yup, the SAFE softmax trick! It's so simple and elegantly solves the numerical instability! See my video to learn how we can build upon safe softmax to online softmax, and to FlashAttention. youtu.be/gBMO1JZav44?si…

YouTube

ℏεsam@Hesamation

the Softmax trick:

English

359

45K

Keşfet

@icwsm @overleaf @hwaran_lee @pliang279 @sutdsg @preslav_nakov @mohitban47 @iclr_conf