Arena.ai

2.9K posts

Arena.ai banner
Arena.ai

Arena.ai

@arena

Where AI meets the real world. Formerly LMArena. We measure and advance the frontier of AI through community-driven evaluation. We’re hiring → https://t.co/XBZCrseaWF

US Katılım Mart 2023
211 Takip Edilen136.4K Takipçiler
Sabitlenmiş Tweet
Arena.ai
Arena.ai@arena·
LMArena is now Arena. A name that takes us back to our roots with a powerful mission: to measure and advance the frontier of AI for real-world use. We have grown from a small PhD research project to a platform powered by a global community of millions. This rebrand has been shaped by the people who use it. 👇 Take a look inside the rebrand.
English
57
74
845
137.2K
Arena.ai retweetledi
Xudong Lin
Xudong Lin@Xudong_Lin_AI·
Proud of our team that makes the huge leap happen compared to last version but this is just the start. Better models are lined up and we keep improving every week. Join us towards Superhuman Multimodal Intelligence job-boards.greenhouse.io/xai/jobs/50826… !!
Arena.ai@arena

Grok 4.20 Beta Reasoning makes @xAI a top 5 lab in Vision Arena. Scoring 1240, this model ranks #11 across all Vision models today. Congrats to the @xAI team for this milestone!

English
9
21
148
17.4K
Arena.ai retweetledi
Alibaba Group
Alibaba Group@AlibabaGroup·
Proud moment! 😎 Qwen 3.5 Max Preview is bringing the heat! ✅ #3 Math ✅ Top 10 Arena Expert ✅ Top 15 Overall Big thanks to the team and everyone who tested it🚀
Arena.ai@arena

Qwen 3.5 Max Preview has landed in top 10 for Arena Expert and top 15 for Text Arena. It shows particular strength in Math. Highlights: - #3 Math - #10 Expert - #15 Text Arena - Top 20 for Writing, Literature & Language, Life, Physical, & Social Science, Entertainment, Sports, & Media, and Medicine & Healthcare Congrats to the @Alibaba_Qwen team for this new milestone!

English
2
8
47
7K
Arena.ai
Arena.ai@arena·
Battle GPT-5.4 Mini High vs. all the best frontier models in the Code Arena - and don't forget to vote! arena.ai/code
English
0
0
8
3K
Arena.ai
Arena.ai@arena·
Check out the Vision Arena leaderboard details to filter and customize your view in a variety of ways like: price, context and license. arena.ai/leaderboard/vi…
English
0
0
8
2.9K
Arena.ai
Arena.ai@arena·
Grok 4.20 Beta Reasoning makes @xAI a top 5 lab in Vision Arena. Scoring 1240, this model ranks #11 across all Vision models today. Congrats to the @xAI team for this milestone!
Arena.ai tweet media
English
8
7
159
19.2K
Arena.ai retweetledi
Microsoft AI
Microsoft AI@MicrosoftAI·
Meet MAI‑Image‑2. Built with creatives, for real creative work. Ranked #5 on @arena’s text‑to‑image leaderboard. Available now: msft.it/6014QUCBe
English
45
105
745
87.4K
Arena.ai retweetledi
Mustafa Suleyman
Mustafa Suleyman@mustafasuleyman·
Our new image generator MAI-Image-2 is out! Available now on MAI Playground for everything from lifelike realism to detailed infographics. Our team has been pushing immensely hard for this release, and we are now among the top models out there: #3 family on @arena. Check out the details in our blog: microsoft.ai/news/introduci… It's shipping soon in Copilot and Bing Image Creator, as well as Microsoft Foundry. Really proud of our progress on models and products - stay tuned for new releases and come join us on our Superintelligence mission!
Mustafa Suleyman tweet mediaMustafa Suleyman tweet mediaMustafa Suleyman tweet mediaMustafa Suleyman tweet media
English
66
84
472
147.8K
Arena.ai
Arena.ai@arena·
Let’s dive deeper into the massive improvements between MAI-Image-2 vs. MAI-Image-1 by @MicrosoftAI. MAI-Image-2 shows significant gains across all sub-categories for Text-to-Image: Gains across all 7 sub-categories in order of magnitude: - Text Rendering (+115 pts) - Portraits (+105 pts) - Product, Branding & Commercial Design (+102 pts) - Photorealistic & Cinematic Imagery (+97 pts) - 3D Imaging & Modeling (+92 pts) - Art (+87 pts) - Cartoon, Anime & Fantasy (+81 pts)
Arena.ai tweet media
Arena.ai@arena

MAI-Image-2 debuts at #5 in the Image Arena! Highlights: - #5 in Text-to-Image overall - #5 for 3D Imaging & Modeling, Cartoon, Anime & Fantasy, Photorealistic & Cinematic Imagery, Art and Portraits - #6 for Product, Branding & Commercial Design Congrats to the @MicrosoftAI team on this milestone!

English
8
12
117
14.2K
Arena.ai
Arena.ai@arena·
MAI-Image-2 debuts at #5 in the Image Arena! Highlights: - #5 in Text-to-Image overall - #5 for 3D Imaging & Modeling, Cartoon, Anime & Fantasy, Photorealistic & Cinematic Imagery, Art and Portraits - #6 for Product, Branding & Commercial Design Congrats to the @MicrosoftAI team on this milestone!
Arena.ai tweet media
English
4
13
110
20.6K
Arena.ai retweetledi
Qwen
Qwen@Alibaba_Qwen·
Pretty proud of this one! 😎 Qwen 3.5 Max Preview just hit #3 in Math, Top 10 in Arena Expert, and Top 15 overall! We're already back in the lab optimizing the preview experience. Even sharper performance coming soon—stay tuned! 🚀
Arena.ai@arena

Qwen 3.5 Max Preview has landed in top 10 for Arena Expert and top 15 for Text Arena. It shows particular strength in Math. Highlights: - #3 Math - #10 Expert - #15 Text Arena - Top 20 for Writing, Literature & Language, Life, Physical, & Social Science, Entertainment, Sports, & Media, and Medicine & Healthcare Congrats to the @Alibaba_Qwen team for this new milestone!

English
36
46
676
59.8K
Arena.ai
Arena.ai@arena·
With the preview of Qwen 3.5 Max Preview by @Alibaba_Qwen, we’re looking back at past Qwen Max variants to see how far it has progressed. Where Qwen 3.5 Max sees the largest gains vs. Qwen 3 Max: - Text Overall (+45pts) - Creative Writing (+57pts) - Math (+49pts) - Entertainment, Sports & Media (+48pts) - Writing, Literature & Language (+45pts) This a vast improvement overall across all categories since Qwen 2.5 Max.
Arena.ai tweet media
Arena.ai@arena

Qwen 3.5 Max Preview has landed in top 10 for Arena Expert and top 15 for Text Arena. It shows particular strength in Math. Highlights: - #3 Math - #10 Expert - #15 Text Arena - Top 20 for Writing, Literature & Language, Life, Physical, & Social Science, Entertainment, Sports, & Media, and Medicine & Healthcare Congrats to the @Alibaba_Qwen team for this new milestone!

English
4
23
205
20.1K
Arena.ai
Arena.ai@arena·
Qwen 3.5 Max Preview lands in top 15 for Text Arena, showing strength in the Math category: - #5 Math category - #15 for Text overall
Arena.ai tweet media
English
2
0
22
4.4K
Arena.ai
Arena.ai@arena·
Qwen 3.5 Max Preview has landed in top 10 for Arena Expert and top 15 for Text Arena. It shows particular strength in Math. Highlights: - #3 Math - #10 Expert - #15 Text Arena - Top 20 for Writing, Literature & Language, Life, Physical, & Social Science, Entertainment, Sports, & Media, and Medicine & Healthcare Congrats to the @Alibaba_Qwen team for this new milestone!
Arena.ai tweet media
English
8
11
242
118.6K