Jay.TL
579 posts

Jay.TL
@JayTL00
AI Psychosis | Hermes Agent Practice https://t.co/OwvvnWtDbX

Releasing Qwable-v1 - an open-weights Qwen3.6-35B-A3B distilled from Claude Fable-5, Anthropic's Mythos-class preview model that was briefly public for ~4days (2026-06-9 → 2026-06-12) before being suspended globally under U.S. export-control directives.
Fable-5 was Anthropic's most powerful model when it shipped — 80.3% on SWE-bench Pro, $50/M output tokens, with an anti-distillation classifier baked into the API that redacted thinking blocks on the fly. Qwable-v1 captures what survived: 4,659 cleartext agentic-coding traces (re-packed from Glint-Research/Fable-5-traces, the only public corpus where the CoT made it through), distilled onto Qwen3.6 over ~14h on a single H200. Given an agent
system prompt, the model emits properly-formatted


We just published internal data on how much of Claude's development is already being done by Claude: - Over 80% of all code merged into our codebase is now written by Claude - It's been months since many researchers at Anthropic hand-wrote code - The typical Anthropic engineer ships 8x as much code as they did in 2024 - On the most open-ended engineering tasks, Claude's success rate jumped from ~26% to 76% in 6 months - When research sessions went off-track, Claude proposed a better next step than the human took 64% of the time We're not at recursive self-improvement yet, but it could come sooner than most expect. I highly recommend reading the full blog post.


It's CODEX THURSDAY and OpenAI came through! 🔥 Codex app 26.616 changes: • Added Record & Replay on macOS, which turns a demonstrated workflow into a reusable skill. • Record & Replay is not available in the EU at launch. • Record & Replay requires Computer Use to be enabled by the user or admin. • Added bulk actions to automation run history, so runs can be marked as read or archived in bulk. • Added new deep links for managing SSH connections. • Improved Browser Use so visible-tab routing and annotations persist when a draft browser session moves to the server. • Additional performance improvements and bug fixes.






Vercel cooked something genuinely special here. 🤯 They open-sourced the exact framework they use to run 100+ AI agents internally. And the way it works changes how you think about building agents. It's called Eve. An agent is a folder. Tools are files. Skills are markdown files. Channels are files. The folder structure IS your agent. One command to start: npx eve@latest init my-agent No plumbing. No boilerplate. Eve handles durable execution, sandboxed compute, human approvals, evals, tracing, and deployment all built in. Add a tool? Drop a TypeScript file. Add a skill? Drop a markdown file. Add Slack? One command. Add a schedule? One more file. Deploy it? vercel deploy. How Vercel already runs on Eve: → Data analyst agent handles 30K+ questions per month in Slack → Sales agent costs $5K/year and returns 32x that → Support agent solves 92% of tickets on its own → 29% of all Vercel deployments now come from agents Their bet: Next.js ended the era of hand-rolling websites. Eve ends the era of hand-rolling agents.


@UnrealEngine not sure 'simply configure' holds once you're in a real pipeline spent a week on MCP schema mismatches and my project is way simpler than a UE5 build. does the plugin handle tool discovery automatically or is that still on you?


We're launching code storage and git hosting. Origin gives teams and agents a place to host, review, and collaborate on code. Available this fall. Join the waitlist. cursor.com/origin-waitlist


BREAKING: Microsoft exploring DeepSeek over OpenAI and Anthropic as Copilot Cowork moves to usage-based pricing “We have users who do hundreds of tasks a week… the consequence is the costs can go very high...” Jevons paradox







