Glitch Truth
1.6K posts

Glitch Truth
@glitchtruth
I work inside tech. I see what the press releases hide. Follow for the unfiltered version nobody else says.




Never thought I'd say this, but Copilot Excel is actually good now






@garrytan R.I.P. Blinkist Seriously though, I foresee an uptick in people finding ways to make kindle (+unlimited) more agent friendly ( against AZN TOS)




@replit building in replit right now feels like this








At @Replit they’re empowering a new wave of million-dollar founders. Cofounders @amasad and @HayaOdeh joined us at @southpkcommons to discuss: – The rise of AI-native founders – New AI models and their capabilities – And why most founders quit too early Full Minus One episode out now. (00:00) Coming to America broke (and building anyway) (03:30) Early Replit proof points kept the mission alive (07:00) Cloud vs. local: why security tips the scales (00:30) Execute daily, predict quarterly (13:30) The 2023 roadmap Replit just finished executing (17:00) Agent 4 and the end of context amnesia (21:30) The death of the ICP (25:30) What actually changed in AI models December 2024 (30:30) "Seek Pain"—Replit's most counterintuitive cultural value (33:30) Why consultants are the most mispriced AI-era hire (37:30) Hunger over credentials—how Replit finds elite talent (40:30) Co-founding with your partner—the honest answer (45:30) Make micro-predictions or get left behind by AI (48:00) Raising kids in a world you can't predict





xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!



22 cool things to do at SaaStrAIAnnual.com 2026 (May 12-14, SF Bay): 1/ Deploy '26: the new half-day AI GTM Agent Summit on May 12. Built entirely around shipping agents in production, not theorizing about them. The opening session is the single most-registered session at the entire event. 2/ Watch Amelia build an AI VP of Marketing live on stage.Tue May 12, 4:15 PM. 10K is real. 14,230+ lines of code. Runs Monday standups, sends campaigns, integrates with Salesforce. 3/ Build your own QBee (our AI VP of Customer Success) in 45 minutes. Wed May 13, 5:00 PM. CS leaders walk out with their own production agent. 4/ The Vibe Lab. Dedicated build zone, @Replit engineers on-site all 3 days. Bring a real problem from your company. Leave with a working version. No sign-up. 5/ Stop Waiting on Engineering: Vibe Coding Workshop for Founders. Wed May 13, 3:30 PM. Bring a problem. Ship the fix. 6/ RevOps Who Ship: Build Your Own Pipeline Tools with Replit. Thu May 14, 9:00 AM. 7/ The CMO Summit with Denise Persson, CMO of Snowflake. 4th year. 150 of the best B2B + AI marketing leaders. Half-day, invite-only, no vendor pitches, no fluff panels. May 12. 8/ The CRO + CEO Summit. 250+ CROs and VPs of Sales with 100+ CEOs at $20 M+ ARR. 9/ The FDE Summit. The CS function is being rebuilt faster than any other GTM role right now. 10/ Andrew Bialecki, Klaviyo CEO: candid walkthrough of how a single prompt became Composer, the AI agent now serving 200K customers. 11/ See AI agents running live across the entire conference floor. QBee, 10K, Agentforce, Monaco, Momentum, Founderscape — all production tools driving real workloads. Every speaker with an agent in production was asked to bring it. Don't marvel. Take notes. 12/ Vibe Coding for Designers: Wed May 13, 11:30 AM. Designers ship their own internal tools instead of waiting on the next engineering sprint. 13/ Vibe Coding for PMMs: Wed 9:00 AM if solo, Thu 11:30 AM if bringing your team. The function quietly using AI hardest right now. 14/ Daniel Vassilev, Relevance AI: "From Copilots to Coworkers: What Breaks When You Deploy Your First AI Workforce." The unglamorous part nobody talks about. 15/ The CPO panel: Anneka Gupta (Rubrik), Rachel Wolan (Webflow), Emrecan Doga (Glean), Anique Drumright (Harvey). 4 companies, same hard problem: turning agentic demos into products enterprises actually trust. 16/ Tom Occhino, Vercel CPO: Deploy kickoff. What's actually working with agents in production vs. what's possible. 17/ Jeanne DeWitt Grosser, Vercel COO: operational realities of running a hyper-growth AI infrastructure company. 18/ Eleanor Dorfman, Head of Industries at Anthropic: "No Legacy, No Playbook: Building Anthropic's AI-Native Sales Team." Building a revenue org from zero in 2026. No legacy systems. No legacy comp plans. Doing it in real time. You're looking at where the rest of the industry is heading in 24 months. 19/ CEO Gamma on GTM: the breakout AI-native scaling profitably with no traditional sales team. While every VC tells founders to hire 50 AEs, they went the other way. The new capital-efficient growth. 20/ Amjad Masad / Replit deep dive with me. What's working with AI agents today. What's still to come. What lands in the next 6-12 months. We both have strong opinions. 21/ Meet-a-VC. 250+ VCs. 1,000+ scheduled 1-on-1 meetings via Who Do You Want to Meet matchmaking. Better than any cold email you'll ever send. 22/ The side events. 3 days of happy hours, dinners, brunches, and after parties across SF Bay. Where the real deals get done. 10,500+ founders, execs, and VCs. 250+ speakers. 300+ sessions. 40+ acres at San Mateo County Events Center. Tracking 140%+ of last year. May 12-14. Come ship. 👾 SaaStrAIAnnual.com 2026. May 12-14 in SF Bay


