reid
53 posts

reid
@reidschryer
🌍 experiment more! data x design personality hire @nucleus_talent



As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.

SOTA on SWE-Bench Pro (58.4): GLM-5.1 delivers significant leaps in coding and agentic performance.




Won't be announcing the names of the NEXT fellows but since Arfur already blew the lid off the raise, hyped that Adhi and Noah are both in Cohort 001. They were already dangerous. Hoping this kind of exposure and mentor group just throws gasoline on the fire for 5cc. Lfg @eightyhi @Nostroah

Introducing MCP for arXiv Let your research agents stand on the shoulders of giants Fast multi-turn retrieval, keyword search, and embedding search tools across millions of arXiv papers 🚀

Imagine your AI doing design research before generating UI. Studying real screens and user flows instead of guessing That’s Refero MCP refero.design/mcp


Thank you for your attention to this matter. cc: @AnthropicAI @DarioAmodei


Opus 4.6 is state-of-the-art on several evaluations including agentic coding, multi-discipline reasoning, knowledge work, and agentic search. We're also shipping new features across Claude in Excel, Claude in PowerPoint, Claude Code, and our API to let Opus 4.6 do even more.


In the coming weeks, we plan to start testing ads in ChatGPT free and Go tiers. We’re sharing our principles early on how we’ll approach ads–guided by putting user trust and transparency first as we work to make AI accessible to everyone. What matters most: - Responses in ChatGPT will not be influenced by ads. - Ads are always separate and clearly labeled. - Your conversations are private from advertisers. - Plus, Pro, Business, and Enterprise tiers will not have ads.



