Post

Cursor
Cursor@cursor_ai·
We're sharing a new method for scoring models on agentic coding tasks. Here's how models in Cursor compare on intelligence and efficiency:
Cursor tweet media
English
207
255
2.9K
606.7K
Cursor
Cursor@cursor_ai·
We use a combination of offline benchmarks and online evals to measure model quality. This makes results more useful, especially as public benchmarks are increasingly saturated.
Cursor tweet media
English
5
10
296
58K
Mariano
Mariano@Mariandipietra·
@cursor_ai please share the repo where the code for the tests resides, I want to reproduce it
English
0
0
0
410
chi
chi@chilir_·
@cursor_ai i’m confused, why does it say lower score is better in the final figure for online evals?
English
1
0
0
1.1K
Useless Devs
Useless Devs@uselessdevsHQ·
@cursor_ai Hope you get bankrupt at some point with all your practice screwing your userbase with lies. Sad that it has to come that far! At least be honest in your communication lol. OH VEY
English
0
0
0
22
共有