Thomas Mesnard

166 posts

Thomas Mesnard

@Mesnard_Thomas

Research Scientist @Meta Superintelligence Labs. ex-@GoogleDeepMind - Gemma. PhD @IP_Paris_ | @Mila_Quebec | MSc MVA | @ENS_ULM

Paris انضم Ekim 2015

310 يتبع528 المتابعون

تغريدة مثبتة

Thomas Mesnard@Mesnard_Thomas·15 Ağu

🚀 After Gemma 3 1B, we’re going tiny: Gemma 270M — fast, on-device, low-power & privacy-first. A new vision: smaller models with strong instruction-following & finetuning, ideal for low-latency edge apps & automation. Congrats to everyone involved! huggingface.co/google/gemma-3…

Omar Sanseviero@osanseviero

Introducing Gemma 3 270M 🔥 🤏A tiny model! Just 270 million parameters 🧠 Very strong instruction following 🤖 Fine-tune in just a few minutes, with a large vocabulary to serve as a high-quality foundation developers.googleblog.com/en/introducing…

English

1.6K

Thomas Mesnard أُعيد تغريده

Sundar Pichai@sundarpichai·15 Eki

An exciting milestone for AI in science: Our C2S-Scale 27B foundation model, built with @Yale and based on Gemma, generated a novel hypothesis about cancer cellular behavior, which scientists experimentally validated in living cells. With more preclinical and clinical tests, this discovery may reveal a promising new pathway for developing therapies to fight cancer.

English

545

3.3K

21.8K

6.9M

Thomas Mesnard@Mesnard_Thomas·14 Eki

@giffmana 🔥

QME

Lucas Beyer (bl16)@giffmana·14 Eki

@Mesnard_Thomas right with gemma3 it's all but two of this screen lol

English

1.2K

Thomas Mesnard@Mesnard_Thomas·14 Eki

Nice! Also Gemma and in particular Gemma 3 1B 👨‍🍳🤌😍

Lucas Beyer (bl16)@giffmana

English

2.5K

Thomas Mesnard@Mesnard_Thomas·13 Eyl

Thrilled to announce the world’s most capable differentially private LLM! Huge congratulations to the entire team — and special kudos to Amer Sinha for his outstanding contributions. Take a look! services.google.com/fh/files/blogs…

Jeff Dean@JeffDean

VaultGemma is a release of an open model trained from scratch with differential privacy. The blog post below and the full tech report linked from the tech report have some nice analyses to present a scaling law for differentially private language models: Blog: research.google/blog/vaultgemm… Paper: arxiv.org/abs/2501.18914

English

1.1K

Thomas Mesnard أُعيد تغريده

Sundar Pichai@sundarpichai·4 Eyl

Introducing EmbeddingGemma, our newest open model that can run completely on-device. It's the top model under 500M parameters on the MTEB benchmark and comparable to models nearly 2x its size – enabling state-of-the-art embeddings for search, retrieval + more.

English

199

520

7.5K

534.8K

Thomas Mesnard أُعيد تغريده

Omar Sanseviero@osanseviero·4 Eyl

Introducing EmbeddingGemma🎉 🔥With only 308M params, this is the top open model under 500M 🌏Trained on 100+ languages 🪆Flexible embeddings (768 to 128 dims) with Matryoshka 🤗Works with your favorite open tools 🤏Runs with as little as 200MB developers.googleblog.com/en/introducing…

English

153

1.2K

83.7K

Thomas Mesnard أُعيد تغريده

Jeff Dean@JeffDean·2 Eyl

We've had a busy August!

Philipp Schmid@_philschmid

August at Google DeepMind was like 🧞‍♂️ 🖼️ 🍌 🚀 🔍 🤏🏻 - Nano Banana (Gemini 2.5 Flash Image) - Gemini Embedding - Veo 3 Fast - Genie 3 - Imagen 4 Fast - Gemma 3 270M - Perch 2 - Kaggle Game Arena - Gemini API Url Context - AI Studio Builder (UI Rework, Prompt Suggestions, GitHub integration …) - AI studio UI (Model Picker, Chat, Scrolling…) and much more!

English

792

80.8K

Thomas Mesnard أُعيد تغريده

Andreas Steiner@AndreasPSteiner·19 Şub

Looking for a small or medium sized VLM? PaliGemma 2 spans more than 150x of compute! Not sure yet if you want to invest the time 🪄finetuning🪄 on your data? Give it a try with our ready-to-use "mix" checkpoints: 🤗 huggingface.co/blog/paligemma… 🎤 developers.googleblog.com/en/introducing…

English

8.7K

Thomas Mesnard أُعيد تغريده

Tris Warkentin@triswarkentin·19 Şub

Thrilled to launch the PaliGemma 2 Mix models today! Try out class-leading vision-language models from 3B to 28B =). #gemmaverse

Google AI Developers@googleaidevs

PaliGemma 2 mix is an upgraded vision-language model that supports image captioning, OCR, image Q&A, object detection, and segmentation. With sizes from 3B-28B parameters, there's a model for everyone. Get started. → goo.gle/430HnDe

English

421

Thomas Mesnard@Mesnard_Thomas·20 Şub

🥳🥳🥳

Google AI Developers@googleaidevs

ART

111

Thomas Mesnard@Mesnard_Thomas·25 Oca

Leaving X for Bluesky ! bsky.app/profile/tmesna…

English

Thomas Mesnard أُعيد تغريده

Andreas Steiner@AndreasPSteiner·5 Ara

🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes. 1/7

English

258

61.8K

Thomas Mesnard أُعيد تغريده

Tris Warkentin@triswarkentin·5 Ara

Thrilled to announce the launch of PaliGemma 2, a state-of-the-art update to our PaliGemma vision-language models! These models are based on Gemma 2, and are available today in 2B, 9B, and 27B sizes. developers.googleblog.com/en/introducing… huggingface.co/blog/paligemma2

English

Thomas Mesnard أُعيد تغريده

Éloi Zablocki@EloiZablocki·23 Kas

📢 Exciting opportunity alert! We (valeo.ai) just posted our annual research internship openings in computer vision & ML. Check out the openings and the great achievements by our past interns here: valeoai.github.io/interns/

valeo.ai@valeoai

🌟 Calling all MSc students passionate about computer vision and ML! We’re offering research internships about diffusion models, multi-modal transformers, continual learning, & more. 4 exciting openings await! 🔗 Learn more: valeoai.github.io/interns/ RT to spread the word! 🙌

English

2.8K

Thomas Mesnard أُعيد تغريده

Tris Warkentin@triswarkentin·3 Eki

Gemma 2 just got even better! 🚀 New Japanese-tuned 2B model AND a $150K Kaggle competition to build Gemma models for every language. Great to have @sundarpichai here to share the excitement! Read more: goo.gle/Gemma4Japan #GemmaDeveloperDay

English

471

141.6K

Thomas Mesnard@Mesnard_Thomas·18 Eyl

❤️

Clément Farabet@clmt

Gemma builds with style

ART

208

Thomas Mesnard@Mesnard_Thomas·17 Eyl

Bravo!

Glenn Cameron Jr@GlennCameronjr

Princeton's Gemma 2 9B SimPO fine-tuned model is doing amazing on @lmsysorg 🤯📈 You can find it on @huggingface : huggingface.co/princeton-nlp/… Nice work @yumeng0818 @xiamengzhou & @danqi_chen

Português

187

Thomas Mesnard أُعيد تغريده

Glenn Cameron Jr@GlennCameronjr·17 Eyl

Princeton's Gemma 2 9B SimPO fine-tuned model is doing amazing on @lmsysorg 🤯📈 You can find it on @huggingface : huggingface.co/princeton-nlp/… Nice work @yumeng0818 @xiamengzhou & @danqi_chen

English

168

53K

Thomas Mesnard أُعيد تغريده

Markus Zimmermann@zimmskal·20 Ağu

Imagine a 27B LLM can beat a 405B model in writing quality code by investing a few milliseconds in static code repair. Now stop imagining and take a look at this chart 🌈 Just for Go, we have the following stats: - Increases score +22.9% across 45 applicable models - +26.2% response files compiled (avg. 17 files, 150 tasks total) - mistral-tiny has +71% in score: beats mistral-small and mistral-medium - Gemma 2 27B has +16% in score: beats GPT4o and Llama 3.1 405B Proof that the approach of doing code fixes over a static analysis should be the default for every code response and coding assistant. Makes code instantly more useful.

English

11.4K

اكتشف

@Yale @giffmana @sundarpichai @lmsysorg @huggingface @yumeng0818 @xiamengzhou @danqi_chen