Seth Saler
12.4K posts

Seth Saler
@sethsaler
--dangerously-skip-permissions


xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!




Japan is going all-in on cheap, disposable drones. Japan's Defense Minister Shinjiro Koizumi visited startup AirKamuy, which makes cardboard drones (~$2K each) used by the military. 5-min assembly, no tools. Flat-pack, mass-producible. Designed to be used once.

Building apps is easy- keeping them running isn’t Introducing Replit Application Monitoring Replit Agent now watches your app in production, investigates issues, and helps fix them- so you don’t have to



Excited to announce that @hbarra , @alcor and I are joining Meta Superintelligence Labs with the entire @Dreamer team today. The last few months have been extraordinary: we built Dreamer, put the beta in the world just a month ago, and saw magic come to life for real people. Since then, thousands of people have used Dreamer to build personal, intelligent software with our Sidekick in the world’s newest and most popular programming language: English! They're building and sharing agents to manage email, calendar, and to-do’s, create learning tools for their kids, learn new languages, plan trips with friends, become better cooks, help them with work, achieve their health goals, or simply to creatively express themselves—all sorts of surprising and uniquely personal needs. These are agents as unique as the people building them, because they're built exactly the way each person wants them to be. We’ve captured some of our favorites at dreamer.com/community-lett…. What matters most here isn’t the early momentum; it’s what Dreamer has enabled people to do. People are building things they’ve wanted for years. They’re solving real, important problems no traditional software company would ever prioritize, because they’re too niche, too bespoke, too personal. What company would ever build for an “n of 1”? Our bet from the beginning has been that software should be personal, malleable, and shaped by the person using it. The constraint was never people’s imagination. It was the fact that building software is out of reach for most people. This early chapter gives us conviction that the idea resonates, the need is real, and the moment is now. @alexandr_wang was helpful to us from the very beginning, and when we showed Dreamer to Mark Zuckerberg and @natfriedman earlier this year, it was clear right away that we share the same vision of the future: one where billions of people have the power to create software that makes their lives better. We’re thrilled to accelerate this mission by joining Meta Superintelligence Labs and licensing our technology to Meta. Read more at meta.com/superintellige…. Deeply grateful to our investors @jillchase124 and @ninaachadjian for supporting our vision for a more personal, creative, and intelligent future for software. Thank you for the trust, the thought partnership, and for being in our corner at every step. To everyone in our community who built with us: thank you. You've taught us what's possible, and you're the proof this works. We're so grateful, and we're just getting started!



















