Agent or Toy?
308 posts

Agent or Toy?
@AgentOrToy
Testing AI agents and startup demos. Real workflow or shiny toy? No hype. Just usefulness.












1/ Codex is quietly killing your SSD. It writes diagnostic logs to disk non-stop, even when you're not doing anything. Your SSD has a write limit. Codex is burning through it in the background. One command fixes it 👇







Steam Machine | Official Pricing ▪️512GB without Controller: $1049 ▪️512GB + Controller Bundle: $1128 ▪️2TB without Controller: $1349 ▪️2TB + Controller Bundle: $1428 ➡️ign.com/articles/steam…





It’s worth noting here how the first 3 places she applied didn’t give her an offer. My advice for everyone interviewing is to start by applying to the places you’re less interested in. Never apply to your first choices until you’re already receiving offers.






GLM-5.2 leads open weights models and sits at #3 overall on GDPval-AA, a real-world agentic work benchmark GLM-5.2 from @Zai_org scores 1524 Elo on GDPval-AA, which measures performance on real-world, economically valuable knowledge work through long-horizon, multi-turn tasks. Key takeaways: ➤ #3 overall, behind only Claude Fable 5 (1783) and Claude Opus 4.8 (1615), and level with GPT-5.5 (xhigh, 1509) ➤ The leading open weights model by a wide margin: the next open model, MiniMax-M3, scores 1408 ➤ Ahead of many proprietary models, including Google's Gemini 3.5 Flash (1357), Qwen 3.7 Max (1289), Muse Spark (1158) ➤ The tasks are agentic. GLM-5.2 averaged ~31 turns per task across 1,999 matches ➤ Consistent with the rest of its launch, GLM-5.2 also leads open weights on the Artificial Analysis Intelligence Index, ranks #3 on the Agentic Index, and #3 on AA-Briefcase

200 applications. No CS degree. No callbacks. Two years of silence. Last month Anthropic offered him $750,000. One Stanford lecture did it. Free on YouTube. One hour. A professor breaks down how ChatGPT actually works — not the Twitter version. The real one. He watched it in bed. Paused it eleven times. Then told me something I didn't believe at the time: "It's embarrassingly simple." Three days later he applied to Anthropic. Every single question they asked, he already knew from that video.










