Sabitlenmiş Tweet
Wraient
169 posts


@toastedmel0n @macbethAI Should try for Qwen 3.6 35B its probably the smallest acceptable model
Kimi K 2.6 is maybe the best "cheap" model
Opus 4.6/7 is probably the best overall model (if cost is no concern lol)
English


@_guillecasaus How is this tool supposed to prevent you from hitting the limit?
English

We’re unveiling a new look for Artificial Analysis!
We’ve come a long way since launching Artificial Analysis over 2 years ago. Today, we benchmark 400+ models, 50+ inference providers, and benchmark not only language models but also image, video, speech, music, hardware, and agents.
Our mission to support the AI ecosystem with independent benchmarking remains the same, but our brand and website refresh is designed to better reflect how much we’ve grown and how much further we plan to go.
A huge thank you to everyone who has been part of the Artificial Analysis community along the way: from developers choosing models and building agents, to labs, inference and hardware providers, and fellow independent researchers.
English
Wraient retweetledi
Wraient retweetledi

Introducing GLM-5.1: The Next Level of Open Source
- Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo.
- Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations.
Blog: z.ai/blog/glm-5.1
Weights: huggingface.co/zai-org/GLM-5.1
API: docs.z.ai/guides/llm/glm…
Coding Plan: z.ai/subscribe
Coming to chat.z.ai in the next few days.

English
Wraient retweetledi

@Alibaba_Qwen @ArtificialAnlys How long before you publish results for qwen3.6
English

(1/8)🚀 Introducing Qwen3.6-Plus: Towards Real-World Agents! 🤖
Today, we’re thrilled to drop a major milestone in our journey toward native multimodal agents.
Here is what makes Qwen3.6-Plus a game-changer:
💻 Next-level Agentic Coding: Smarter, faster execution.
👁️ Enhanced Multimodal Vision: Sharper perception & reasoning.
🏆 Top-tier Performance: Maintaining leading general capabilities.
📚 1M Context Window: Available by default via our API.
Built on your invaluable feedback from the Qwen3.5 era, we’re laying a rock-solid foundation for real-world devs. Get ready to experience truly transformative ✨ Vibe Coding ✨.
Huge thanks to our community! Go try it out and show us what you can build. 👇
Chat: chat.qwen.ai
API: modelstudio.console.alibabacloud.com/ap-southeast-1…
Blog: qwen.ai/blog?id=qwen3.6
🔔Noted:More Qwen3.6 models to come and be open-sourced! Stay tuned~ 👀#Qwen #AI #AgenticCoding #VibeCoding #Agents

English

Giving away 5 Codex Pro plans
Each person will get 3 months of free Codex Pro (highest tier).
Winners will be selected from comments in 48 hours, comment below why you want it.
OpenAI@OpenAI
Today, we closed our latest funding round with $122 billion in committed capital at an $852B post-money valuation. The fastest way to expand AI’s benefits is to put useful intelligence in people’s hands early and let access compound globally. This funding gives us resources to lead at scale. openai.com/index/accelera…
English

🚨BREAKING FRONTIER MODEL NEWS
claude mythos set for release april 16th
dario has more leaks than the titanic, here’s some info from anthropic staff.
>95 or higher on every single benchmark. except arc agi 3, yet to be tested on.
>dramatically outperforms opus 4.6 on coding, reasoning, and cyber
>anthropic privately warning government officials about its capabilities
>so powerful they’re calling it
“unprecedented cybersecurity risk”
>already being tested with early access customers
>priced at $120/$600 per million tokens
>10 million token context window
>enterprise use only
capybara is here.
capygpt is agi.
English
Wraient retweetledi

Claude code source code has been leaked via a map file in their npm registry!
Code: …a8527898604c1bbb12468b1581d95e.r2.dev/src.zip

English















