Kaiyang Zhou

230 posts

Kaiyang Zhou banner
Kaiyang Zhou

Kaiyang Zhou

@kaiyangzhou

Assistant Professor at HKBU. Interested in machine learning & computer vision.

Katılım Mart 2015
462 Takip Edilen2K Takipçiler
Kaiyang Zhou retweetledi
Ziwei Liu
Ziwei Liu@liuziwei7·
🤩The World's Largest Human-Centric Dataset for Physical AI🤩 We @Ropedia are excited to announce 🙆Xperience-10M🙆‍♀️, *10M real human 4D interaction dataset* with - Visual Observations - Spatial Structure - Human Motion - Interaction Dynamics - Task Semantics 🌏A new data foundation for physical and spatial AI @huggingface: huggingface.co/datasets/roped…
Ropedia@ropedia_ai

Today Ropedia releases Xperience-10M at #GTC day 1 — World largest real human 4D interaction dataset at 10M scale. Each trajectory aligns: • visual observations • spatial structure • human motion • interaction dynamics • task semantics A new foundation for physical and spatial AI, try it out @huggingface huggingface.co/datasets/roped…

English
2
29
148
23.2K
Kaiyang Zhou
Kaiyang Zhou@kaiyangzhou·
🚀 Introducing Streamo: Real-time streaming video LLM mastering narration, event grounding & time-sensitive QA! Powered by Streamo-Instruct-465K dataset. Bridges offline video models to interactive assistants. #AI #ComputerVision #VideoLLM #CVPR2026 jiaerxia.github.io/Streamo/
Xia Jiaer@JiaerXia

Build a real-time AI assistant! 🎥 Streamo (to appear in #CVPR2026) enables turning any offline LLM into a live streaming video model. Key features: 💫 Real-time interaction ✨ Diverse instructions 🌟 Multi-task capable Code:github.com/maifoundations… Paper:arxiv.org/pdf/2512.21334

English
0
1
0
290
Kaiyang Zhou retweetledi
Ziqi Huang
Ziqi Huang@ziqi_huang_·
𝗧𝗵𝗲 𝗔𝗜 𝗧𝗮𝗹𝗸𝘀 will be hosting SAM 3D (@weiyaow1 ) and SAM 3D Body (@XitongYang1) from @MetaAI. 🕐 Feb 3 (Tue) - 13:00 SGT | Feb 2 (Mon) - 21:00 PST 📩 PM me for the Zoom link 🔔 Get notified of future talks @TheAITalksOrg: theaitalks.org/subscribe/
Ziqi Huang tweet media
The AI Talks@TheAITalksOrg

🎙️The AI Talks | S5E6 Introducing SAM 3D: Powerful 3D Reconstruction for Physical World Images — by Weiyao Wang @weiyaow1 & Xitong Yang. Can we "3D-fy" everything? 🌍🏃‍♂️ From universal objects to human bodies, explore the new SAM 3D family. Join us👇

English
2
6
18
4.6K
Kaiyang Zhou
Kaiyang Zhou@kaiyangzhou·
QZO can also finetune **Stable Diffusion 3.5 Large** in a single 4090 (which we believe has never been done before) The result is not perfect but encouraging (worth further investigating) excellent work by @SifengS78693
Kaiyang Zhou tweet media
English
0
0
0
74
Kaiyang Zhou
Kaiyang Zhou@kaiyangzhou·
QZO can finetune a **13B LLM** in a 4090 under an extreme quantization setting (2bits)
Kaiyang Zhou tweet media
English
1
0
0
91
Kaiyang Zhou retweetledi
SHANG Sifeng
SHANG Sifeng@SifengS78693·
Excited to share QZO, accepted at #ICLR2026🎉 QZO: 💡combines quantization + zeroth-order optimization via perturbing scales only, simple yet effective ✂️stabilizes training by directional derivative clipping with theoretical guarantees 🌟minimizes VRAM cost by up to 18x
SHANG Sifeng tweet mediaSHANG Sifeng tweet media
English
1
1
0
129
Kaiyang Zhou retweetledi
Ziqi Huang
Ziqi Huang@ziqi_huang_·
We’re honored to have Prof. Qi Dou (@QiDou_) with us next Tuesday! Join us if you’re interested in Agentic AI, Embodied AI, and Medical AI. 📍 Hybrid session: online via @TheAITalksOrg (subscribe or PM me for the Zoom link) 🏫 Onsite at @NTUsg, Arc, LHN-TR+20 (Level 1)
Ziqi Huang tweet media
English
0
4
13
1.4K
Kaiyang Zhou
Kaiyang Zhou@kaiyangzhou·
just went through the work very nice paper highly recommend
Brian Li@Brian_Bo_Li

Just read an excellent survey paper on AI agents. Measuring Agents in Production 📄 Paper: arxiv.org/pdf/2512.04123 🎬 Explained Video (Chinese): bilibili.com/video/BV1rkBNB… It gave me an epiphany about the real status of agents in production and prompted me to reflect on what makes a good survey paper. Here's my checklist for writing a good survey paper in 2026: ⭐ Novel insights – not just a literature dump ⭐ Unified understanding – synthesize, don't just summarize ⭐ Predictive vision – point to future directions that aren't yet consensus ⭐ Final acid test – if your paper could be written by frontier models with deep research, drop it.

English
1
1
3
522
Vishal Patel
Vishal Patel@vishalm_patel·
Honored to be named an IEEE Fellow for contributions to image processing, computer vision & biometrics. Also grateful to be an AAAI Senior Member and a 2025 Clarivate Highly Cited Researcher. Huge thanks to my students, mentors & collaborators! @jhuclsp @HopkinsEngineer
Vishal Patel tweet media
English
32
12
65
8.3K
Kaiyang Zhou retweetledi
The AI Talks
The AI Talks@TheAITalksOrg·
Welcome to Season 5! From Tolman’s cognitive maps to Sherrington’s neural circuits, @DengHokin will re-examine the emergence of latent representations in biological minds. 🕘 Oct 23 2025 🇸🇬SGT – 21:00 (UTC+8) 🇺🇸EST – 09:00 (UTC-4) 🔗theaitalks.org/subscribe/ for talk details!
The AI Talks tweet media
English
1
12
13
5.1K
Kaiyang Zhou
Kaiyang Zhou@kaiyangzhou·
"Measuring Epistemic Humility in Multimodal Large Language Models" A new hallucination benchmark that tests not only recognition accuracy but also humility act: whether the model can identify when no provided answer is correct maifoundations.com/blog/humbleben…
English
0
0
3
423
Kaiyang Zhou
Kaiyang Zhou@kaiyangzhou·
🔥Grounded Chain-of-Thought Makes Multimodal LLMs More Data-Efficient🔥 #ICCV2025 👀GCoT👀 injects visual grounding information into CoT, which enables MLLMs to adapt to new tasks under data-limited regimes! Blog: maifoundations.com/blog/gcot/
English
1
5
46
3K