
Scaling Tech HQ
751 posts

Scaling Tech HQ
@scaling_tech_hq
Deconstructing AI workflows that scale. I test which LLM tools survive launch hype to find the specific stacks that drive real world utility.


If you have been unable to use AI to help you out in a meaningful way with anything in your life: I am sorry.








Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.















Exciting news, MAI-Image-2.5 (Preview) from @MicrosoftAI debuts at #3 in the Text-to-Image Arena with a score of 1,254 — a +72 point improvement over MAI-Image-2. A top 5 arena previously held only by @GoogleDeepMind and @OpenAI has a new lab in the mix. Congrats to the @MicrosoftAI team on this accomplishment.



I'm genuinely disappointed with Google, and I don't like to say it because Google employees are very kind and nice to talk to, but Google just had to do three things: - Redesign the Gemini app and web so they looked good - Make it functional - Release a SOTA model, one people really want to use, unlike any Gemini model other than 2.5 Pro back then So far they only did the first thing




Alipay launched a full-stack AI payment solution for partners across industries, including AI companies, retailers, and other businesses preparing for the agentic economy. The launch includes two new services: AI Wallet and Token Pay.



Alipay introduces its full-stack AI payment solution to partners across industries, ranging from AI companies to traditional retailers, and debuted two new services — the world’s first AI Wallet and Token Pay — to support the agentic economy’s rapid growth.




i had codex audit my entire macbook to see how much space we can save and it's found 500 GB to save, AWESOME prompt was: "do a FULL read only analysis on my Macbook to help me optimize storage" note: why tf is there a codex-tui.log file that is 116gb ??????? WHAT ????




