francois

582 posts

francois

@fozenne

lead data scientist. AI for high expertise domains, functional programing and domain driven design

Versailles, France Katılım Ocak 2013

104 Takip Edilen70 Takipçiler

francois@fozenne·2 Oca

The three body problem novel is about AI doom

English

francois@fozenne·2 Oca

2026 prediction : MD5: d1c5c969fc61989992d0a5128c1a42b1 Let’s see how long it takes to realize 👀

English

francois@fozenne·31 Ara

@gchampeau @_mcorbin @le_trappiste Tout utilisateur a grosse conso (donc ceux qui dictent la roadmap) font de l’IaC pour la reproductibilité et utilisent boto3 / la CLI pour monitorer les usages. Ça n’exclut pas que ces memes outils sont souvent complexes, mais ce n’est pas un sujet UI web

Français

Guillaume Champeau@gchampeau·31 Ara

@_mcorbin @le_trappiste et tu as demandé à AWS de la simplifier ?

Français

1.2K

Guillaume Champeau@gchampeau·31 Ara

Si je comprends bien ses tweets, Octave Klaba refait lui-même toute l'interface d'admin d'OVHCloud (un outil souvent critiqué par les clients qui le trouvent confus) en vibe-codant, j'imagine pour aller au plus vite et parce qu'il n'a plus besoin d'avoir what-mille réunions en interne pour faire discuter les équipes produits, marketing, UX, frontend, etc sur la couleur ou le nom d'un bouton. Ca va etre très intéressant de voir le résultat, et riche d'enseignements pour beaucoup de boîtes, en positif comme en négatif. D'un côté le gain en efficacité pour la boîte peut être énorme, d'un autre on ne s'improvise pas UX/UI designer de bon niveau. Ce sont de vraies compétences de science de l'ergonomie et de marketing réunies, et savoir à qui confier ça à à l'heure du vibe-coding sera clé pour faire la différence.

Français

218

61.6K

francois retweetledi

Terrible Maps@TerribleMaps·21 Ara

Mind blown.. Germany’s 5 biggest cities lie perfectly on a 4th-degree polynomial by u/BarisSayit

English

341

868

25.3K

1.8M

francois retweetledi

Justin Mitchel@JustinMitchel·18 Ara

So... Postgres is now basically a search engine? pg_textsearch was just open sourced. It enables BM25 to search your database.... massive upgrade for key word search. Google uses BM25 in their search engine. Claude told me: "if you're already on Postgres, you can now skip the whole sync-your-data-to-Elasticsearch dance for search." (ps, how can you not love Claude). Now I got to figure out how to implement in my Django querysets... future course? Grab it at github.com/timescale/pg_t… #sponsored

English

405

507.3K

francois retweetledi

Mistral AI@MistralAI·18 Ara

Mistral OCR 3 sets new benchmarks in both accuracy and efficiency, outperforming enterprise document processing solutions as well as AI-native OCR.

English

766

198.1K

francois@fozenne·15 Ara

And that’s why you in-house them

alex fazio@alxfazio

friend at accenture told me they don’t do evals when building llm wrappers for clients 🤡

English

francois retweetledi

Hunter Leath@jhleath·13 Ara

an interesting update: the team is starting to move away from AI coding completely (devin/claude/etc) because it's so much harder to review the AI code than writing things themselves

Hunter Leath@jhleath

just found out that since this, i've become a top 50 user of Devin globally, now pushing ~60 PRs a day. AMA

English

185

223

3.6K

763.8K

francois retweetledi

Simon Willison@simonw·25 Kas

This one is pretty nasty - it tricks Antigravity into stealing AWS credentials from a .env file (working around .gitignore restrictions using cat) and then leaks them to a webhooks debugging site that's included in the Antigravity browser agent's default allow-list

PromptArmor@PromptArmor

Top of HackerNews today: our article on Google Antigravity exfiltrating .env variables via indirect prompt injection -- even when explicitly prohibited by user settings!

English

319

2.2K

314.8K

francois retweetledi

Jeffrey Emanuel@doodlestein·13 Kas

Just read through the new LeJEPA paper by Yann LeCun and Randall Balestriero. I’ve been curious to know what Yann’s been working on lately, especially considering all his criticisms of LLMs (which I disagree with, as I think LLMs will keep improving and will take us to ASI fairly soon). Anyway, there are several threads already on X about the paper and what it introduces. The short version is that it’s a principled, theoretically justified, and parsimonious approach to self-supervised learning that replaces a complex hodgepodge of ad-hoc, hacky heuristics for preventing mode collapse, which is the bane of self-supervised learning. That’s where the model screws up and starts mapping all inputs to nearly identical embeddings or to a narrow subspace of embeddings, collapsing down all the richness of the problem into a pathologically simple and wrong correspondence. The first pillar of the new approach is their proof that isotropic Gaussian distributions uniquely minimize worst-case downstream prediction risk. As soon as I read that, I immediately thought of CMA-ES, the best available black-box optimization algorithm for when you don’t have access to the gradient of the function you’re trying to minimize, but can only do (expensive/slow) function evaluations. Nikolaus Hansen has been working on CMA-ES since he introduced it way back in 1996. I’ve always been fascinated by this approach and used it with a lot of success to efficiently explore hyper-parameters of deep neural nets back in 2011 instead of doing inefficient grid searches. Anyway, the reason why I bring it up is because there’s a striking parallel and deep connection between that approach and the core of LeJEPA. CMA-ES says: Start with an isotropic Gaussian because it's the maximum entropy (least biased) distribution given only variance constraints. Then adapt the covariance to learn the problem's geometry. LeJEPA says: Maintain an isotropic Gaussian because it's the maximum entropy (least biased) distribution for unknown future tasks. Both recognize that isotropy is optimal under uncertainty for three reasons: The maximum entropy principle; Among all distributions with fixed variance, the isotropic Gaussian has maximum entropy; I.e., it makes the fewest assumptions. There’s no directional bias; Equal variance in all directions means you're not pre-committing to any particular problem structure. You get worst-case optimality; Minimize maximum regret across all possible problem geometries. So then what’s the difference? It comes down to adaptation timing. CMA-ES can adapt during optimization; it starts isotropic but then becomes anisotropic as it learns the specific optimization landscape. In contrast, LeJEPA has to stay isotropic because it's preparing for unknown downstream tasks that haven't been seen yet. This parallel suggests LeJEPA is applying a fundamental principle from optimization theory to representation learning. It's essentially saying: “The optimal search distribution for black-box optimization is also the optimal embedding distribution for transfer learning.” This makes sense because both problems involve navigating unknown landscapes; for CMA-ES, this is the unknown optimization landscape; for LeJEPA, this is the unknown space of downstream tasks. This difference then makes me wonder: could we have "adaptive LeJEPA" that starts isotropic but adapts its embedding distribution once we know the downstream task, similar to how CMA-ES adapts during optimization? That would be like meta-learning the right anisotropy for specific task families. Anyway, I thought I’d share my thoughts on this. It’s fascinating to see the connections between these different areas. The black-box optimization community has always been pretty separate and distinct from the deep learning community, and there’s not much cross-pollination there. This makes sense, because if you have a gradient, you’d be crazy not to use it. But there are strong connections.

English

924

89.1K

francois retweetledi

Jack Morris@jxmnop·11 Kas

there are dozens or perhaps a couple hundred ex-{OpenAI, xAI, Google DeepMind} researchers founding companies in the current climate there are, as far as i know, zero people leaving to found startups out of Anthropic really makes you think

English

2.2K

732.2K

francois retweetledi

special k | CEO of BLACK MARKET CANDL GIFT SHOP@specialkdelslay·31 Eki

This is supposed to be the thermodynamic quantum computer? it looks like a 3d printed plastic toy with demon symbols on the side or sum, 14 million in seed funding?? fill me in on what I'm missing here

special k | CEO of BLACK MARKET CANDL GIFT SHOP tweet media

Anjney Midha@AnjneyMidha

Got to see it IRL. Congrats @GillVerd and team! So crazy it might just work. Excited to see what kinds of diffusion workloads this beast can accelerate

English

723

85.8K

francois retweetledi

Simo Ryu@cloneofsimo·30 Eki

Im confused about "10,000 more efficient" part. This means you can train stable-diffusion-3 like model with 20$~ ish amount of electricity. What stops them from building a model and demonstrating it, beyond *checks note* ... Fashion MNIST? Im genuinely curious whats stopping them from demonstrating something like imagenet-1k which should take less than a dollar of electricity (if my math is right) for 200k steps of training

Extropic@extropic

Hello Thermo World.

English

665

148.9K

francois@fozenne·30 Eki

@Sauers_ Plenty of fish in this pond

English

332

Sauers@Sauers_·30 Eki

After training their flagship 405B parameter model, Thinking Machines researchers discovered that replacing identity mappings between attention layers with non-linear activation functions dramatically improved performance. "Our previous architecture was essentially computing weighted averages at every layer," explains lead researcher; "introducing non-linearity allows the network to learn feature interactions we didn't know were possible—it can now represent functions that aren't just linear combinations of inputs." The lab is calling this the "Deep Learning 2.0" paradigm shift.

English

354

40.4K

francois retweetledi

Anthropic@AnthropicAI·29 Eki

New Anthropic research: Signs of introspection in LLMs. Can language models recognize their own internal thoughts? Or do they just make up plausible answers when asked about them? We found evidence for genuine—though limited—introspective capabilities in Claude.

English

284

780

4.8K

1.2M

francois retweetledi

Wirelyss 👁️‍🗨️💫@wirelyss·22 Eki

Luckily since the Louvre made NFTs of their jewelry, even though the crowns physically were stolen, they still own the same assets. Because the tokens still exist and are in limited supply just as before. Nothing has changed. few understand blockchain technology.

English

320

1.1K

15.5K

619K

francois retweetledi

terminally onλine εngineer@tekbog·20 Eki

multi cloud multi az systems engineers right now

English

1.2K

18.4K

415.5K

francois retweetledi

terminally onλine εngineer@tekbog·20 Eki

this is basically how open source works for big tech

Mikhail Samin@Mihonarium

Amazing story: the Czech government spent six years planning a series of dams. A family of beavers constructed the dams for free, in 1-2 says, in the same locations that human picked, accomplishing the goals set by the Czech government and saving humans $1.2 million

English

443

8.4K

291.7K

francois retweetledi

Simon Willison@simonw·11 Eki

I grabbed a full copy of the folder and shared it on GitHub here: github.com/simonw/claude-… - here are my notes so far: simonwillison.net/2025/Oct/10/cl…

English

303

18.6K

francois@fozenne·20 Eyl

Nice! Data extraction via web search tool calls was a vulnerability we were worried about early on. Glad it hzs been properly documented.

Simon Willison@simonw

Classic prompt injection attack here against Notion: hidden text (white on white) in a PDF which, when processed by Notion, causes their agent to gather confidential data from other pages and append it into a query string that gets passed to their functions_search() tool

English

Keşfet

@gchampeau @_mcorbin @le_trappiste @Sauers_ @elonmusk @BarackObama @taylorswift13 @cristiano