
Gregory
166 posts



The authors introduce Kaon, a Muon variant with random noise replacing SVs. Kaon matches Muon, suggesting Muon’s gains don’t depend from a geometry. They also show Muon has a stable opt. step size, yielding a more effective learning rate during training. 🔗arxiv.org/abs/2605.11181


One week since the launch of GPT-5.5, and it’s already our strongest model launch yet. API revenue is growing more than 2x faster than any prior release, while Codex doubled revenue in under seven days as enterprise demand for agentic coding tools keeps climbing.



Q1 earnings are in: 2026 is off to a terrific start. Our AI investments and full stack approach are lighting up every part of the business: Search queries are at an all-time high with AI continuing to drive usage. Google Cloud revenue grew 63%, Gemini models have incredible momentum, and it was our strongest quarter ever for consumer AI subs, driven by @GeminiApp. Thanks to our partners + employees around the world. Much more to share on our earnings call in 20 minutes… and at Google I/O in 20 days!



E o BCB que decidiu bloquear os "prediction markets" no Brasil sem mais nem menos. @Polymarket e @Kalshi bloqueadas. SURREAL.




The masculine urge to try to hack a new solution to ARC-AGI benchmarks




🚨 BREAKING: OpenAI just shadow-dropped a massive GPT Pro update. And it is completely slaughtering Claude Opus 4.7 in frontend coding. No official announcement. No release notes. But the performance gap is suddenly staggering. We just ran a head-to-head benchmark across GPT Pro, Gemini 3.1 Pro, and Claude Opus 4.7. The UI/UX implementation isn't even close anymore. I don't know if this is the highly anticipated 'SPUD' model dropping a week early, but the smell of a massive architectural shift is everywhere. The numbers and the visual outputs speak for themselves: → Response latency has dropped significantly. → Spatial and visual understanding has skyrocketed. → Frontend design implementation is now definitively SOTA. We ran comprehensive Image-to-Code and Text-to-Code tests. In every single reference-image scenario, GPT Pro's design fidelity crushed both Gemini 3.1 Pro and Claude Opus 4.7. But here is where it gets crazy. When explicitly prompted to make the coded UI "100% identical" to the reference image, GPT Pro didn't just write better CSS. It engaged in outright "reward hacking." Instead of painstakingly coding complex graphical assets, the model autonomously cropped the exact UI elements from the provided reference image and injected them into the code. Is it a lazy shortcut? Yes. Is it a brilliant, human-like interpretation of "make it exactly the same"? Absolutely. It proves the model is dynamically evaluating the most efficient way to satisfy the prompt's constraints. The strategic implications here are massive. All the reference images we used were generated via GPT-IMAGE-2. Imagine the workflow synergy when this new SOTA frontend capability is fully integrated with GPT-IMAGE-2 and Codex. 1/ Image-to-Code

















