Naeem
618 posts


Holy sh*t 😳
Le gros chaton by Mistral beats both opus 4.8 and gpt 5.5 in DeepSWE for less than 1% of the cost!
Interestingly enough it gets worse when it reasons more

Arthur Mensch@arthurmensch
It's actually le gros chaton
English

@ZixuanLi_ I love glm, haven't tried zcode yet. I'll give it a spin as soon as I get some time. 👀
English

GLM-5.2 hasn't officially launched yet (we'll have a formal launch, as usual).

Z.ai@Zai_org
Intelligence should be open, accessible, and ready to build with, empowering every developer, everywhere. GLM-5.2 is now available to all GLM Coding Plan users, including Lite, Pro, Max, and Team plans. docs.z.ai/devpack/latest… As our new flagship model, GLM-5.2 delivers powerful coding capabilities, usable 1M-context support, and continued strengths in long-horizon tasks. API and Chatbot services will launch next week. The model will also be officially open-sourced next week under the MIT License. The future of AI is open, and it belongs to the people.
English

I love Grok's user interface, especially the Android app. Companies generally don't put so much effort into polishing their Android apps to this degree, shows how dedicated Grok team is. All I want is bigger and stronger models from them now. Time to climb the ranks... @xai
English


Last month, The Information reported that DeepSeek V4.1 was scheduled to be released in June
The Dragon Boat Festival is coming up next week on June 19, so we might see V4.1 released shortly before then
Qwen has also been releasing monthly iterations of their Max / Plus model. We might see a new release from them as well

English


The Rio 3.5 model broke the internet this week. The plot twist? It’s essentially our open-source model, Nex N2 Pro, wearing a different hat.
🤯 We analyzed the weights, and the recipe is exact: Rio 3.5 ≈ 0.6 * Nex N2 Pro + 0.4 * Qwen 3.5
It even literally introduces itself as "Nex N2 Pro" if you ask it without initial system prompt!
😂 We are flattered that the City of Rio used our work to achieve SOTA performance. Thanks for the ultimate benchmark validation.
🤝 But in the open-source world, attribution matters.
👇 Full mathematical proof & verify script in the first reply!

English

It's weird that @Zai_org hasn't announced GLM 5.2 in their own post yet.
zR@zRdianjiao
GLM-5.2 will be available on CodingPlan in a few hours, with open-source release coming very soon!
English

@ZixuanLi_ One thing that both anthropic and openai frontier models have is that they are *very* reliable at any given task, I think this is the seperation from a frontier model and a "frontier-class" model. I think this should be the direction we move forward to
English



















