Rookie
50 posts






GPT-5.5 by @OpenAI is now live in the Arena, landing across multiple leaderboards. Here’s how it ranks by modality: - Code Arena (agentic web dev): #9, a strong +50pt jump over GPT-5.4 - Document Arena (analysis & long-content reasoning): #6, on par with Sonnet 4.6 - Text Arena: #7, Math #3, Instruction Following: #8 - Expert Arena: #5 - Search Arena: #2 - Vision Arena: #5 Strong, well-rounded performance, especially in Code (+50 pts vs GPT-5.4). Congrats to @OpenAI on the release. Full category breakdowns by modality in the thread.



You must try this. GPT Image-2 can do PALM reading and I’m so here for it. Full prompt below ⤵️





be DeepSeek >need to achieve batch invariance so bad >split-k is the only optimal solution but is batch variant >Thinking Machines ($50b valuation) could barely recover the performance for their solution, gets 1.6x slower >hold_my_beer.jpg > dual-kernel strategy >"match or even surpass the perf of standard split-k in most major scenarios" >DeepSeek strikes again >$20b valuation btw






