Copilot Arena

102 posts

Copilot Arena banner
Copilot Arena

Copilot Arena

@CopilotArena

evaluating llms and code. download now on VSCode! | maintained by @iamwaynechi @valeriechen_

Pittsburgh, PA Katılım Eylül 2024
5 Takip Edilen865 Takipçiler
Sabitlenmiş Tweet
Copilot Arena
Copilot Arena@CopilotArena·
Check out our findings in our latest preprint! A big thank you to everyone who's been using and voting on Copilot Arena. We couldn't have done it without you all♥️!
Wayne Chi@iamwaynechi

What do developers 𝘳𝘦𝘢𝘭𝘭𝘺 think of AI coding assistants? In October, we launched @CopilotArena to collect user preferences on real dev workflows. After months of live service, we’re here to share our findings in our recent preprint. Here's what we have learned /🧵

English
0
1
4
1.5K
Copilot Arena retweetledi
Jiseung Hong
Jiseung Hong@jiseungh99·
We are excited to launch the ⚔️PR Arena⚔️ leaderboard! Full results will be revealed after a certain milestone of community votes. Fix your GitHub issues for free and vote for better fix! 👉Leaderboard & Setup Guide: prarena.web.app
Jiseung Hong tweet media
English
1
9
24
5.5K
Copilot Arena retweetledi
Jiseung Hong
Jiseung Hong@jiseungh99·
Here are some tips for using ⚔️PR Arena⚔️ 1⃣ pr-arena🏷️ option is added automatically to Issue Labels for ease of use! 2⃣ You can use PR Arena in forked repositories. 3⃣ Don't like either fix? Select “neither” and no PR will be created. 👉Install here: github.com/apps/openhands…
Jiseung Hong@jiseungh99

Introducing ⚔️PR Arena⚔️ - free AI coding agents to fix real GitHub issues. Claude Sonnet 4 vs Gemini 2.5 Pro… Who writes better pull requests? 👉 Install here: github.com/apps/openhands… Powered by @allhands_ai

English
1
2
14
4.2K
Copilot Arena
Copilot Arena@CopilotArena·
📢Calling all developers who contributed votes in Copilot Arena, we need your help building the PR Arena leaderboard 🗳️. You will no longer be restricted to VSCode IDE--any GitHub repo with an open issue is fair game! Check out the thread below for details:
Jiseung Hong@jiseungh99

Introducing ⚔️PR Arena⚔️ - free AI coding agents to fix real GitHub issues. Claude Sonnet 4 vs Gemini 2.5 Pro… Who writes better pull requests? 👉 Install here: github.com/apps/openhands… Powered by @allhands_ai

English
0
1
10
672
Copilot Arena
Copilot Arena@CopilotArena·
New result: Qwen-2.5-Coder jumps from 13th to joint 1st place with fill-in-the-middle (FiM)! Congrats to @Alibaba_Qwen 🥳 Also check out @lmarena_ai 's new UI 🖥️✨
Copilot Arena tweet media
English
0
4
7
980
Copilot Arena retweetledi
Inception
Inception@_inception_ai·
We are launching our API in open beta! Visit the Inception Platform to create your account and get started using the first commercial-scale diffusion large language models (dLLMs). platform.inceptionlabs.ai
English
8
30
136
64.4K
Copilot Arena retweetledi
CMU School of Computer Science
With so many AI coding assistants out there, it can be hard to keep track of ones that perform well on real-world tasks. CMU researchers developed Copilot Arena to do just that by crowdsourcing user ratings of LLM-written code. bit.ly/3YLeDvh
English
0
3
10
1.4K
Copilot Arena retweetledi
Valerie Chen
Valerie Chen@valeriechen_·
@CopilotArena was featured in @SCSatCMU news! Featuring quotes from me, @iamwaynechi, @atalwalkar and @chrisdonahuey 🥳 📖Check out the article here: cs.cmu.edu/news/2025/copi…
Wayne Chi@iamwaynechi

What do developers 𝘳𝘦𝘢𝘭𝘭𝘺 think of AI coding assistants? In October, we launched @CopilotArena to collect user preferences on real dev workflows. After months of live service, we’re here to share our findings in our recent preprint. Here's what we have learned /🧵

English
0
5
19
1.8K
Copilot Arena retweetledi
Arena.ai
Arena.ai@arena·
Check out @CopilotArena’s new Code Edit Leaderboard!
Copilot Arena@CopilotArena

New #1 Leaders of Code Edit Leaderboard: Strong performance from both Claude 3.7 Sonnet and Gemini-2.0-Pro! Congratulations to @AnthropicAI and @GoogleDeepMind 🥇 We also release new live leaderboard interface✨. You can now easily toggle between code completion and code edit.

English
3
4
71
10.3K
Copilot Arena
Copilot Arena@CopilotArena·
New #1 Leaders of Code Edit Leaderboard: Strong performance from both Claude 3.7 Sonnet and Gemini-2.0-Pro! Congratulations to @AnthropicAI and @GoogleDeepMind 🥇 We also release new live leaderboard interface✨. You can now easily toggle between code completion and code edit.
Copilot Arena tweet media
English
1
5
69
21.8K
Copilot Arena retweetledi
Wayne Chi
Wayne Chi@iamwaynechi·
Interested in trying out Copilot Arena for yourself? Download at lmarena.ai/copilot. Follow us at @CopilotArena for upcoming updates!
English
0
1
6
835
Copilot Arena retweetledi
Mikel
Mikel@MikelEcheve·
🏆 Mercury Coder’s performance: It’s tied for 2nd place on Copilot Arena, a platform for evaluating coding assistants in real-world settings. This is impressive for a new model based on emerging tech, competing with leaders like DeepSeek V2.5 and Claude Sonnet 3.5. #Coding #AI
Mikel tweet media
English
1
1
6
3.6K