
Introducing Gemma 3 270M 🔥 🤏A tiny model! Just 270 million parameters 🧠 Very strong instruction following 🤖 Fine-tune in just a few minutes, with a large vocabulary to serve as a high-quality foundation developers.googleblog.com/en/introducing…
Thomas Mesnard
166 posts

@Mesnard_Thomas
Research Scientist @Meta Superintelligence Labs. ex-@GoogleDeepMind - Gemma. PhD @IP_Paris_ | @Mila_Quebec | MSc MVA | @ENS_ULM

Introducing Gemma 3 270M 🔥 🤏A tiny model! Just 270 million parameters 🧠 Very strong instruction following 🤖 Fine-tune in just a few minutes, with a large vocabulary to serve as a high-quality foundation developers.googleblog.com/en/introducing…




VaultGemma is a release of an open model trained from scratch with differential privacy. The blog post below and the full tech report linked from the tech report have some nice analyses to present a scaling law for differentially private language models: Blog: research.google/blog/vaultgemm… Paper: arxiv.org/abs/2501.18914






PaliGemma 2 mix is an upgraded vision-language model that supports image captioning, OCR, image Q&A, object detection, and segmentation. With sizes from 3B-28B parameters, there's a model for everyone. Get started. → goo.gle/430HnDe


🌟 Calling all MSc students passionate about computer vision and ML! We’re offering research internships about diffusion models, multi-modal transformers, continual learning, & more. 4 exciting openings await! 🔗 Learn more: valeoai.github.io/interns/ RT to spread the word! 🙌





