Datacurve

11 posts

Datacurve banner
Datacurve

Datacurve

@datacurve

Research and data to advance frontier models.

San Francisco شامل ہوئے Şubat 2024
11 فالونگ4.2K فالوورز
پن کیا گیا ٹویٹ
Datacurve
Datacurve@datacurve·
Opus 4.8 is now on DeepSWE. On the default high thinking effort, it scores 6% higher than Opus 4.7 xhigh, while also lowering average cost per task.
English
60
99
1.4K
648.4K
Datacurve
Datacurve@datacurve·
Opus 4.8 delivers efficiency gains by solving tasks in fewer steps, directly reducing the total number of input tokens required per task.
Datacurve tweet media
English
8
5
158
94.8K
Datacurve
Datacurve@datacurve·
Opus 4.8 is now on DeepSWE. On the default high thinking effort, it scores 6% higher than Opus 4.7 xhigh, while also lowering average cost per task.
English
60
99
1.4K
648.4K
Datacurve ری ٹویٹ کیا
Datacurve ری ٹویٹ کیا
Serena Ge (Datacurve)
Serena Ge (Datacurve)@serenaa_ge·
Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.
Serena Ge (Datacurve) tweet media
English
504
742
6K
1.9M
Datacurve ری ٹویٹ کیا
Serena Ge (Datacurve)
Serena Ge (Datacurve)@serenaa_ge·
I presented today at Demo day Day 2 and @TechCrunch featured us @datacurve! Just been reading TC and listening to TC Daily Crunch since high school mornings... a surreal feeling to see us on it. Also, post-demo sadness cuz now YC is coming to an end
Serena Ge (Datacurve) tweet mediaSerena Ge (Datacurve) tweet media
English
9
5
150
29.7K