
Jack
833 posts





GPT-5.5 by @OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Congrats to @OpenAI on the release. Full category breakdowns by modality in the thread.

Lane Thomas walkoff and I’ll post my fully erect shaft and balls




Everyone in the world has to take a private vote by pressing a red or blue button. If more than 50% of people press the blue button, everyone survives. If less than 50% of people press the blue button, only people who pressed the red button survive. Which button would you press?


We asked this to a large sample of nationally representative Americans - blue wins by a 3:1 margin!


two facts - Opus 4.7 is a decent upgrade. if it's worse for you it's a skill issue - GPT-5.5 will absolutely IQ mog Opus 4.7






I’m muting people on X who don’t understand how far ahead Anthropic is. It’s hurting my brain so much that I had to do this. If you think OpenAI is in anyway better than Anthropic, please just block me to save both us some trouble.







