Yoan Di Cosmo (@yoandicosmo) - Twitter Profili | Zamantika Mersobahis Locabet

Yoan Di Cosmo@yoandicosmo·3h

artists, researchers and entrepreneurs, at the highest level, are the same kind of people

English

0

5

60

Yoan Di Cosmo@yoandicosmo·5d

Greatness often starts with a treason cf Fairchild's "traitorous eight"

English

0

1

2

173

Yoan Di Cosmo@yoandicosmo·6d

ZXX

0

3

124

Yoan Di Cosmo@yoandicosmo·4 May

Lisbon is a beautiful city

English

0

6

106

Yoan Di Cosmo@yoandicosmo·22 Nis

@pacovilletard congrats @GregoireGambatt & @pacovilletard!

English

0

1

59

Paco Villetard@pacovilletard·21 Nis

We're coming out of stealth to announce our cyber defense research lab. We are exploring data and post-training techniques to build superhuman cyber defenders. Our mission is to make sure the West always wins. The last 3 months we've built an automated data pipeline to create training data from 80k CVEs (aka public vulnerabilities). Our next topic? Post training a model that's better at fixing all the vulnerabilities in your codebase. Like really fixing them. Not saying it's secure when there are still ways to exploit them. Here are the questions that keep us awake at night: How do you train a model to defend without improving its capabilities to attack? What's the right reward? How to measure the defense capabilities? How do you create synth training data that reproduces real systems? What kind of access do you give an ai cyber defender? How far can you trust it? If you know insanely good cyber experts (red team, blue team, CTF aficionados) or ML engineers (synth data generation and post-training models), send them my way. We need to make models far better at defending.

English

85

47

353

116.3K

Yoan Di Cosmo@yoandicosmo·21 Nis

Super grateful to have had the opportunity to work alongside the HF team on this project! Thanks again @akseljoonas ;)

Aksel@akseljoonas

Introducing ml-intern, the agent that just automated the post-training team @huggingface It's an open-source implementation of the real research loop that our ML researchers do every day. You give it a prompt, it researches papers, goes through citations, implements ideas in GPU sandboxes, iterates and builds deeply research-backed models for any use case. All built on the Hugging Face ecosystem. It can pull off crazy things: We made it train the best model for scientific reasoning. It went through citations from the official benchmark paper. Found OpenScience and NemoTron-CrossThink, added 7 difficulty-filtered dataset variants from ARC/SciQ/MMLU, and ran 12 SFT runs on Qwen3-1.7B. This pushed the score 10% → 32% on GPQA in under 10h. Claude Code's best: 22.99%. In healthcare settings it inspected available datasets, concluded they were too low quality, and wrote a script to generate 1100 synthetic data points from scratch for emergencies, hedging, multilingual etc. Then upsampled 50x for training. Beat Codex on HealthBench by 60%. For competitive mathematics, it wrote a full GRPO script, launched training with A100 GPUs on hf.co/spaces, watched rewards claim and then collapse, and ran ablations until it succeeded. All fully backed by papers, autonomously. How it works? ml-intern makes full use of the HF ecosystem: - finds papers on arxiv and hf.co/papers, reads them fully, walks citation graphs, pulls datasets referenced in methodology sections and on hf.co/datasets - browses the Hub, reads recent docs, inspects datasets and reformats them before training so it doesn't waste GPU hours on bad data - launches training jobs on HF Jobs if no local GPUs are available, monitors runs, reads its own eval outputs, diagnoses failures, retrains ml-intern deeply embodies how researchers work and think. It knows how data should look like and what good models feel like. Releasing it today as a CLI and a web app you can use from your phone/desktop. CLI: github.com/huggingface/ml… Web + mobile: huggingface.co/spaces/smolage… And the best part? We also provisioned 1k$ GPU resources and Anthropic credits for the quickest among you to use.

English

0

2

180

Aksel@akseljoonas·21 Nis

Introducing ml-intern, the agent that just automated the post-training team @huggingface It's an open-source implementation of the real research loop that our ML researchers do every day. You give it a prompt, it researches papers, goes through citations, implements ideas in GPU sandboxes, iterates and builds deeply research-backed models for any use case. All built on the Hugging Face ecosystem. It can pull off crazy things: We made it train the best model for scientific reasoning. It went through citations from the official benchmark paper. Found OpenScience and NemoTron-CrossThink, added 7 difficulty-filtered dataset variants from ARC/SciQ/MMLU, and ran 12 SFT runs on Qwen3-1.7B. This pushed the score 10% → 32% on GPQA in under 10h. Claude Code's best: 22.99%. In healthcare settings it inspected available datasets, concluded they were too low quality, and wrote a script to generate 1100 synthetic data points from scratch for emergencies, hedging, multilingual etc. Then upsampled 50x for training. Beat Codex on HealthBench by 60%. For competitive mathematics, it wrote a full GRPO script, launched training with A100 GPUs on hf.co/spaces, watched rewards claim and then collapse, and ran ablations until it succeeded. All fully backed by papers, autonomously. How it works? ml-intern makes full use of the HF ecosystem: - finds papers on arxiv and hf.co/papers, reads them fully, walks citation graphs, pulls datasets referenced in methodology sections and on hf.co/datasets - browses the Hub, reads recent docs, inspects datasets and reformats them before training so it doesn't waste GPU hours on bad data - launches training jobs on HF Jobs if no local GPUs are available, monitors runs, reads its own eval outputs, diagnoses failures, retrains ml-intern deeply embodies how researchers work and think. It knows how data should look like and what good models feel like. Releasing it today as a CLI and a web app you can use from your phone/desktop. CLI: github.com/huggingface/ml… Web + mobile: huggingface.co/spaces/smolage… And the best part? We also provisioned 1k$ GPU resources and Anthropic credits for the quickest among you to use.

English

136

643

4.7K

1.2M

Yoan Di Cosmo@yoandicosmo·21 Nis

@akseljoonas @huggingface I need an ml-intern!! 🤗🔥

English

1

0

2

1.7K

Yoan Di Cosmo retweetledi

Axel@ax_pey·30 Mar

Time to build general-purpose robots, on hardware made in SF We are bringing @NASA @GoogleDeepMind @scale_AI and more at @ycombinator for a general-purpose robotics hackathon Each team will have a robot and compete across all Al modalities to make the coolest AI project ⬇️

English

28

43

436

44.3K

Yoan Di Cosmo retweetledi

Mistral AI@MistralAI·10 Şub

Introducing Mistral AI's biggest hackathon ever! 📅 Feb 28 - Mar 1 🌍 Paris | London | NY | SF | Tokyo | Singapore | Sydney & online 48 hours. The best hackers. 🤝 Partners: @wandb @nvidia @awscloud @HackIterate 🏆 $200K in prizes. Special awards from @elevenlabs @huggingface @JUmp @whitecircle @supercell Link in 🧵

English

85

210

1.6K

248.9K

Yoan Di Cosmo@yoandicosmo·3 Ara

@agustinanfosso Vamos!!!

Español

1

0

1

128

Agustín (Kaizer)@agustinanfosso·3 Ara

Miren las views de este citado y del original. Les dije que b2c no era tan difícil

Agustín (Kaizer)@agustinanfosso

B2C no es tan difícil como decían

Español

2

0

24

5.7K

Yoan Di Cosmo@yoandicosmo·1 Ara

@Karim_RC Let's goo!

English

1

0

1

214

Karim@Karim_RC·1 Ara

We brought in a writer of Toy Story to build a robot that feels alive. Introducing Ongo, a desk lamp that will light up your life. Pre-order one of the first 100 units (link below).

English

331

212

2.7K

1M

Yoan Di Cosmo@yoandicosmo·30 Kas

@zqwq333 @cognition @originator @SzymonRybczak thanks for having us @zqwq333 :)

English

0

3

92

Yoan Di Cosmo retweetledi

Zoe Qin@zqwq333·30 Kas

It’s the last weekend before December so I guess we’ll squeeze in a last hack 💻🎄 🇬🇧🇫🇷🇵🇱🇺🇸🇪🇸🇨🇳losing count on how many others are in the room This time on RL with @cognition @originator, organised by @yoandicosmo Who else should we collaborate with next year 👀

English

2

15

1.6K

Yoan Di Cosmo@yoandicosmo·22 Kas

we'll only select 100 participants, sign up quickly! luma.com/vnia1awh

English

0

1

252

Yoan Di Cosmo@yoandicosmo·22 Kas

the jury will be composed of: - @ipsorakis, Head of Field Engineering EMEA @cognition - @lewishemens from @originator_real - ben from @originator_real - @zqwq333 , VP at @dawncapital and others tba

English

1

0

1

316

Yoan Di Cosmo@yoandicosmo·22 Kas

@cognition and @originator_real are coming to London for their first ever hackathon in the city, hosted with our friends @AnthropicAI and @dawncapital for the @HackIterate London Hackathon next week all infos below 👇

English

2

0

2

401

Yoan Di Cosmo@yoandicosmo·17 Kas

@ADarmouni @unaitefr @dpmlol amazing project

English

0

1

47

Axel Darmouni@ADarmouni·16 Kas

Had a really fun time at the @unaitefr hackathon, on the Open Track on which I made a LoL replay analyzer! Not gonna lie, was quite inspired by the products (most notably @dpmlol) I’ve seen over twitter, and the goal was to make a tool that automatizes game reviews and analyzes complete logs you can gather through the RIOT API to get game feedbacks using GenAI (Claude 4.5 Sonnet from @AnthropicAI here) As a hardstuck E4 Lillia player, the tool was even useful to me even in this very prototype-like state and gave me tips which I didn’t consider -like dying too much early game or repeating the same mistake- Thanks @yoandicosmo for the organization! Leaving the video I’ve submitted below, hope you enjoy the watch! 🤗

English

1

0

3

136

Yoan Di Cosmo

Keşfet