omg is ed!!

9.7K posts

omg is ed!! banner
omg is ed!!

omg is ed!!

@chillMSR

🌷 @whitettulip 💍

เข้าร่วม Kasım 2020
39 กำลังติดตาม108 ผู้ติดตาม
ทวีตที่ปักหมุด
omg is ed!!
omg is ed!!@chillMSR·
omg is ed!! tweet media
ZXX
1
0
4
1K
omg is ed!! รีทวีตแล้ว
hopecore
hopecore@dailyhopecores·
ZXX
45
3K
18.2K
262.1K
omg is ed!! รีทวีตแล้ว
Uma Base
Uma Base@RealUmaBase·
Agnes Tachyon has released an empty album on Spotify. "You have to imagine the music"
Uma Base tweet mediaUma Base tweet media
English
19
289
3.5K
29.6K
omg is ed!! รีทวีตแล้ว
House of Bakushin
House of Bakushin@BakushinArmpit·
her greed sickens me
English
12
140
1.8K
41.3K
omg is ed!!
omg is ed!!@chillMSR·
@seatedro @thdxr we kindly ask that you pay anthropic the billions they deserve so limits can go back to normal, thank you
English
0
0
0
7
dax
dax@thdxr·
man the responses to the new claude max limits are crazy everyones expectations are so out of whack it's kind of embarrassing to get this mad, i'd just be like damn ok i'll cancel were you guys depending on this shit to keep your grandma alive i don't get it
English
207
68
2.5K
145.1K
omg is ed!! รีทวีตแล้ว
lyn
lyn@lyn49556·
Español
16
257
2.2K
27.5K
omg is ed!! รีทวีตแล้ว
juicetin
juicetin@juiceetin·
GIF
ZXX
3
527
6K
42.9K
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
Anthropic’s new model, Capybara: “Compared to Claude Opus 4.6, Capybara achieves dramatically higher scores in software coding, academic reasoning, and cybersecurity.” According to Dario's previous interview, it might be a 10T-parameter model that cost $10 billion to train.
Yuchen Jin tweet media
English
196
175
3.1K
513.3K
omg is ed!!
omg is ed!!@chillMSR·
@JayFrydman @agenticasdk theres a big difference between CoT and agentica SDK and you know that all we are asking is for a benchmark without the harness
English
0
0
0
14
Agentica
Agentica@agenticasdk·
We scored 36.08% on ARC-AGI-3 in one day using the Agentica SDK.
English
71
131
1.4K
373.1K
omg is ed!!
omg is ed!!@chillMSR·
@kode11 @agenticasdk thing is, its just unfair, benchmarks are supposed to give you the raw performance of the model the better the model is without a harness, the better it will be with one, obviously it costed less, it already had most of the answers
English
0
1
6
203
Kode
Kode@kode11·
@agenticasdk The harness debate in the replies is missing the point. In real-world applications, you're always wrapping models in orchestration layers. The interesting number here is cost efficiency — $1k vs $8.9k for raw Opus. That's nearly 9x leverage from good agentic scaffolding.
English
4
1
103
9.5K
omg is ed!! รีทวีตแล้ว
定
@de3dsoul·
定 tweet media
ZXX
101
7.6K
32K
385.3K
omg is ed!! รีทวีตแล้ว
Kevin Krieg
Kevin Krieg@iamkevinkrieg·
new keyboard wdy think
Kevin Krieg tweet media
English
164
254
9.3K
587.6K
omg is ed!! รีทวีตแล้ว
scout
scout@Grimiio·
using the infinite rpg for everything in resident evil
scout tweet media
English
7
1.4K
12.4K
187.9K
omg is ed!! รีทวีตแล้ว
Michael Orthodox ☦
Michael Orthodox ☦@Michaeldudufudu·
Michael Orthodox ☦ tweet media
ZXX
12
237
2.8K
19.8K