Wei Lin @ CVPR 2025
73 posts

Wei Lin @ CVPR 2025
@WeiLinCV
Research associate @ ELLIS Unit, LIT AI Lab, Institute for Machine Learning, JKU Linz. Collab with MIT-IBM Watson AI Lab. PhD@TU Graz


Is basic image understanding solved in today’s SOTA VLMs? Not quite. We present VisualOverload, a VQA benchmark testing simple vision skills (like counting & OCR) in dense scenes. Even the best model (o3) only scores 19.8% on our hardest split.

IPLOC accepted to ICCV25 ☺️ Thanks to all the people that were part of it 🩷 The idea for this paper came by a lake during a visit to Graz for a talk. It has traveled with me through too many countries and too many wars, and it’s now a complete piece of work.

Ever wondered how linear RNNs like #mLSTM (#xLSTM) or #Mamba can be extended to multiple dimensions? Check out "pLSTM: parallelizable Linear Source Transition Mark networks". #pLSTM works on sequences, images, (directed acyclic) graphs. Paper link: arxiv.org/abs/2506.11997



🤩🤩🤩



🤩🤩🤩








