Cursor: "We're sharing a new method for scoring models on agentic coding tasks. Here's h"

Post

Cursor@cursor_ai·12 Mar

We're sharing a new method for scoring models on agentic coding tasks. Here's how models in Cursor compare on intelligence and efficiency:

English

207

255

2.9K

606.7K

Cursor@cursor_ai·12 Mar

We use a combination of offline benchmarks and online evals to measure model quality. This makes results more useful, especially as public benchmarks are increasingly saturated.

English

296

58K

Cursor@cursor_ai·12 Mar

Learn more: cursor.com/blog/cursorben…

English

148

35.4K

Mariano@Mariandipietra·6d

@cursor_ai please share the repo where the code for the tests resides, I want to reproduce it

English

410

chi@chilir_·12 Mar

@cursor_ai i’m confused, why does it say lower score is better in the final figure for online evals?

English

1.1K

Useless Devs@uselessdevsHQ·4d

@cursor_ai Hope you get bankrupt at some point with all your practice screwing your userbase with lies. Sad that it has to come that far! At least be honest in your communication lol. OH VEY

English