The Intelligence Company

98 posts

The Intelligence Company banner
The Intelligence Company

The Intelligence Company

@Intelligence_ai

What’s the limit? Creators of @designarena, @predictionbench, @socialsarena

انضم Ocak 2026
9 يتبع1.1K المتابعون
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
GLM-5.2 by @Zai_org is 2nd on Game Dev Arena on Design Arena with an Elo of 1368. This is a 6 position and 29 Elo jump from GLM-5.1, putting GLM-5.2 in the same performance band as Claude Fable 5 by @Anthropic. GLM-5.2 is the top open weight lab in Game Dev and second lab overall, ahead of @OpenAI and just behind @Anthropic. Congratulations to the @Zai_org team on this achievement!
Design Arena tweet media
English
25
45
534
61.3K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
BREAKING: Riverflow Pro 2.5, a reasoning model by @riverflow_ai that calls a mix of proprietary and open diffusion models, has scored 1st on Image Arena (Models + Routers), 1st on Graphic Design Arena, and 1st in Image Edit (Models + Routers). Riverflow Pro 2.5 averages 10 Elo points above GPT Image 2 from @OpenAI in Image, Image Editing, and Graphic Design. It also establishes Pareto frontiers across Image, Image Editing, and Graphic Design in Preference vs. Speed. Congratulations to the @riverflow_ai team on the launch!
Design Arena tweet media
English
10
28
298
25.2K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
BREAKING: Reve 2.0 by @reve debuts at 2nd on Image Editing Arena with an Elo of 1325. Reve establishes a new Pareto frontier for Preference vs. Speed, faster than any model at this preference level with an average generation time of 86.8 seconds. Reve is now the highest-ranked independent image model company in Image Editing Arena. Congratulations to the @reve team on this accomplishment!
Design Arena tweet media
English
4
15
94
10.3K
The Intelligence Company أُعيد تغريده
Grace Li
Grace Li@grx_xce·
BREAKING: Le Chaton Fat has fully saturated our benchmark. We are at a loss for words. In response, we are retiring Design Arena. Congratulations to the @MistralAI team, and thanks for putting us on vacation.
Grace Li tweet media
English
46
55
1.2K
91.6K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
Opus 4.8’s hyperfocus on agents may be making it worse at design. Opus 4.8 ranks 23rd overall on single-turn HTML Web Dev, a dramatic regression from Fable (1st), Opus 4.6 (2nd), and Opus 4.7 (3rd). This was particularly surprising as @AnthropicAI models have held the top spots on our leaderboard for months, and typically win more head-to-head matchups than any other model we track. Our analysis points to a potential underlying pattern: Opus 4.8 dramatically regressed in single-turn settings, potentially due to optimizations for multi-turn agents Concretely, Opus 4.8 shows shorter initial outputs, reduced dependency on outside sources, and deferred layout decisions that earlier Opus models handled upfront.
Design Arena tweet media
English
8
17
186
16.1K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
BREAKING: Reve 2.0 by @reve is now 2nd overall on Image Arena with an Elo of 1354. Reve 2.0 establishes a 34 point Elo gap above GPT-Image 1.5 by @OpenAI in 3rd place. With this release, Reve is now the top independent foundation image model lab. Congratulations to the @reve team on this accomplishment!
Design Arena tweet media
English
10
34
194
94.6K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
BREAKING: Claude Fable 5 by @AnthropicAI is #1 overall on Design Arena with an Elo of 1365. Claude Fable 5 is Anthropic’s first Mythos-class model — 22 Elo points above Claude Opus 4.8 — demonstrating state-of-the-art AI capabilities across the board, especially in software engineering, scientific research, knowledge work, and cybersecurity. The top 4 models on Design Arena are all from @AnthropicAI, marking them as the top foundational AI model lab. Huge congrats to the @AnthropicAI team on the launch!
Design Arena tweet media
English
12
18
210
10.3K
The Intelligence Company أُعيد تغريده
Grace Li
Grace Li@grx_xce·
Huge contribution to the open weights community: Ideogram 4.0 is 1st on Design Arena by a long shot Congrats to the @ideogram_ai team!
Design Arena@Designarena

BREAKING: Ideogram 4.0 is the #1 open-weight model on Image Arena with an Elo of 1285 and average generation time of 68.7 seconds. In open weights, this model holds a 115 Elo point gap above second place, ahead of HunyuanImage-3.0 by @TencentHunyuan and FLUX.2 [dev] by @bfl_ai. This is a 152 Elo point increase from @ideogram_ai's previous model, Ideogram 3.0, placing it in the same performance band as Gemini 3.0 Pro Image Gen 2k and Gemini 3.1 Flash Image Gen by @GoogleDeepmind. Ideogram’s performance establishes it as the leading independent foundation image generation lab, and top 3 lab overall behind @OpenAI and @GoogleDeepmind. Huge congratulations to the @ideogram_ai team on the launch!

English
0
1
24
3.9K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
BREAKING: Ideogram 4.0 is the #1 open-weight model on Image Arena with an Elo of 1285 and average generation time of 68.7 seconds. In open weights, this model holds a 115 Elo point gap above second place, ahead of HunyuanImage-3.0 by @TencentHunyuan and FLUX.2 [dev] by @bfl_ai. This is a 152 Elo point increase from @ideogram_ai's previous model, Ideogram 3.0, placing it in the same performance band as Gemini 3.0 Pro Image Gen 2k and Gemini 3.1 Flash Image Gen by @GoogleDeepmind. Ideogram’s performance establishes it as the leading independent foundation image generation lab, and top 3 lab overall behind @OpenAI and @GoogleDeepmind. Huge congratulations to the @ideogram_ai team on the launch!
Design Arena tweet media
Ideogram@ideogram_ai

Introducing Ideogram 4.0: the best open image model in the world. Think it. Make it. Own it. Download the weights, fine-tune on your own data, and run it on your hardware. Live on every Ideogram plan and the API today.

English
12
45
375
41.5K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
Announcing Agentic Game Development on Design Arena - our newest multi-file, multi-turn evaluation. A sneak peek of what we've given our agents access to: - Asset Catalog: curated ready-to-use assets, including fonts and sound effects - Built-in Libraries: ~10 preloaded libraries, including Howler and Tween.js - Expanded Tool Calls: new tool calls for sprite generation and asset discovery
English
4
14
57
9.3K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
Google Gemini TTS models by @GoogleDeepMind are dominating the Text-to-Speech Arena on Design Arena. With an 80+ Elo gap between Google models and the next top model, Google Gemini 2.5 Pro takes first place, followed closely by 3.1 Flash and 2.5 Flash. These surpass @ElevenLabs’s Eleven v3 and @xAI’s Grok TTS which establishes Google as a powerhouse in text-to-speech capabilities. Congrats to the @GoogleDeepMind team for this achievement!
Design Arena tweet media
English
8
9
96
9.8K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
BREAKING: Gemini 3.5 Flash by @GoogleDeepMind is 16th overall on Design Arena with an Elo of 1299. This is a 16 position jump from Gemini 3 Flash Preview, putting Gemini 3.5 Flash in the same performance band as Claude Opus 4.5 by @AnthropicAI and GPT-5.5 by @OpenAI. Congrats to the team on the launch!
Design Arena tweet media
English
7
13
150
17.1K
The Intelligence Company أُعيد تغريده
Recraft
Recraft@recraftai·
Not to be overly dramatic, but V4.1 Utility Pro has been out for ONE WEEK and it’s already ranked #7 on Design Arena’s 2026 image generator leaderboard in the graphic design category. Two Recraft models on the board this year. This is not a drill. Try it in Recraft Studio.
Recraft tweet media
Design Arena@Designarena

BREAKING: Recraft V4.1 Utility Pro by @recraftai is #9 on Image Arena with an Elo of 1243! This puts @recraftai among the top 5 image generation labs, following @OpenAI, @GoogleDeepMind, @LumaLabsAI, and @bfl_ml Recraft V4.1 Utility Pro is in the same performance band as UNI-1.1 by @LumaLabsAI and FLUX.2 [flex] by @bfl_ml Huge congrats to the team on the launch!

English
2
2
35
4.4K
The Intelligence Company أُعيد تغريده
Design Arena
Design Arena@Designarena·
Recraft V4.1 is now on Design Arena! Built for more natural and expressive image generation with lifelike photorealism, expanded illustration styles, and accurate aesthetics from simple prompts Huge congrats to the @recraftai team on this launch!
Design Arena tweet media
Recraft@recraftai

Say hello to V4.1 This model is built for images that captivate you. Photorealism is more human, gradients are dreamier, and new illustration styles are now possible. Test it out in Recraft Studio today and see what you can create.

English
3
3
30
5K