

Igor Susmelj
497 posts

@ISusmelj
Co-founder @lightlyAI | Speaker | @ETH | Blogging about ML and data














Vision models have been smaller than language models; what if we scale them up? Introducing Web-SSL: A family of billion-scale SSL vision models (up to 7B parameters) trained on billions of images without language supervision, using VQA to evaluate the learned representation. It finally answers the question we saw in Cambrian-1: why do SSL models lag behind CLIP models in VQA? [1/8]







At this point it's fair to say that we made Zürich a multimodal hotspot! I'm super excited for all the major labs to get established and grow their Zürich presence :) This paper has a special place in my heart, it marks the start of my all time dream team and put us on the map:







Discussing an epic survey/position paper tonight: `Towards System 2 Reasoning in LLMs: Learning How to Think With Meta CoT` arxiv.org/abs/2501.04682 Sat, 11 Jan 2025 @ 7:00 pm UTC Join in on @ykilcher's discord: ykilcher.com/discord