Cheap FLOPs

42 posts

Cheap FLOPs banner
Cheap FLOPs

Cheap FLOPs

@cheapflops

Katılım Temmuz 2025
39 Takip Edilen1 Takipçiler
Elliot Arledge
Elliot Arledge@elliotarledge·
kernelbench.com is live. can frontier models write fast triton/cuda/cutlass/cute-dsl/ptx without cheating? i add kernelbench-hard to the existing kernelbench-v3 (which was built on top of KernelBench from @anneouyang et al.) hard mode has: fp8 gemm, topk, sonic MoE fwd, KimiDeltaAttention, paged attention decode, kahan softmax, w4a16 gemm. all of these require deep understanding of the sm120 architecture (benchmarking happens on my local rtx pro 6000 blackwell setup). gpt 5.5 xhigh and claude opus 4.7 max clear took the W here, but i was honestly surprised with kimi-k2.6 and deepseek v4 pro. this is just the first public iteration. im open to constructive criticism (dm works)
Elliot Arledge tweet mediaElliot Arledge tweet mediaElliot Arledge tweet media
English
16
18
171
9.9K
Cheap FLOPs
Cheap FLOPs@cheapflops·
@iScienceLuvr Cloudflare tunnel to a web app running on your computer with xterm.js
English
1
0
0
777
Tanishq Mathew Abraham, Ph.D.
Claude Code has remote control which is amazing but has low rate limits. Codex has high rate limits but no remote control. How do I get the best of both worlds???
English
160
1
289
39.1K
Cheap FLOPs
Cheap FLOPs@cheapflops·
@scaling01 They always do this: deny, say it's a docs bug and then it happens like a week later
English
0
0
1
160
Ranju
Ranju@whatRanjuSaid·
@icanvardar most software engineers love anthropic(claude)
English
1
0
1
106
Can Vardar
Can Vardar@icanvardar·
anthropic hates software engineers
English
81
19
523
14.9K
Cheap FLOPs
Cheap FLOPs@cheapflops·
@Hesamation Yes it's garbage, ChatGPT's is really annoying as well.. it used to be unlimited now it cuts you off after a few seconds and says you've gone over your limit but they'll still do it for you if you click again, then it works
English
0
0
0
33
ℏεsam
ℏεsam@Hesamation·
let’s take a moment and appreciate how Claude’s built-in speech to text is ABSOLUTELY CRAP. > you speak 2 minutes > writes the first 20 words > cuts off without telling you > has a ton of typos, misses spaces
ℏεsam tweet media
English
9
2
35
3.8K
Cheap FLOPs
Cheap FLOPs@cheapflops·
@koylanai They are just pretending to have made a mistake so people think it's fixed.. and that they aren't purposefully screwing over the users. 'Whoops- we changed the system prompt to make a bunch more money!'
English
1
0
20
1K
Muratcan Koylan
Muratcan Koylan@koylanai·
The frustrating part is that the Claude Code team, along with people deep in AI psychosis, have been gaslighting anyone who raises concerns about Claude Code's recent issues. "your reasoning setting is wrong" "oh that benchmark is wrong" "we checked our code, nothing is wrong" "skill issues" "Hate" is a strong word, but when you're paying a lot of money for a product and it actually makes your job harder, to the point where people make you start questioning the quality of your own work, it really becomes a problem. I'm glad they identified the issue, and I genuinely want them to succeed, along with Cursor, Codex, OpenCode, and others, we need more alternatives, but we also absolutely need open benchmarks. anthropic.com/engineering/ap…
Muratcan Koylan tweet media
Muratcan Koylan@koylanai

WE DON'T HATE CLAUDE CODE ENOUGH WHY ARE WE PAYING THOUSANDS OF DOLLARS IF YOUR EVERY RELEASE IS MAKING THE HARNESS LESS USABLE?

English
23
35
475
31K
Cheap FLOPs
Cheap FLOPs@cheapflops·
@elliotarledge Tldr: they're claiming to be morons so people don't suspect malice
Cheap FLOPs tweet media
English
1
0
1
46
Cheap FLOPs
Cheap FLOPs@cheapflops·
@dedene @AnthropicAI I'm not convinced they really fixed the root issue still.. given all their recent pricing shenanigans I think they're engaging in subterfuge on multiple levels
English
0
0
1
206
Peter Dedene
Peter Dedene@dedene·
@cheapflops @AnthropicAI Yeah, my weekly limit was due for tonight 21h anyway. This single reset doesn't give me 6 weeks of poor usage back.
English
1
0
5
2.1K
John Titor III
John Titor III@johntitorIII·
@bcherny Boris, eu amo Claude Code. Mas cara.. eu to sentindo tanto o 4.7 e 4.6 100% lobotomizados. 😭
Português
1
0
0
150
Cheap FLOPs
Cheap FLOPs@cheapflops·
@bcherny In his next tweet in the thread goes on to say the main thing people complained about is still an issue
Cheap FLOPs tweet media
English
0
0
0
662
Boris Cherny
Boris Cherny@bcherny·
We take these reports incredibly seriously. In my time on the team, this has probably been the most complex investigation we’ve had. The root causes were not obvious, and there were many confounders.
English
87
9
920
196.3K
Boris Cherny
Boris Cherny@bcherny·
Separately, we’ve also heard reports of issues with Opus 4.7 in Claude Code. The team is working on those and we’ll share more as we roll out improvements over the coming days.
English
69
16
614
87.6K
Boris Cherny
Boris Cherny@bcherny·
We’re resetting usage limits for subscribers. Thank you so much for your feedback and patience!
English
124
21
931
99.2K
Cheap FLOPs
Cheap FLOPs@cheapflops·
@bcherny You really don't lol.. you just gaslight people and then when the heat is on you retweet a bunch of BS to hide from folks
English
0
0
1
1.5K
D.T.F Skinner
D.T.F Skinner@dtfskinner·
@ClaudeDevs You could’ve just said: “We got caught throttling you all”
English
1
0
5
5.3K
ClaudeDevs
ClaudeDevs@ClaudeDevs·
Over the past month, some of you reported Claude Code's quality had slipped. We investigated, and published a post-mortem on the three issues we found. All are fixed in v2.1.116+ and we’ve reset usage limits for all subscribers.
English
2K
2.6K
40K
6.4M
ClaudeDevs
ClaudeDevs@ClaudeDevs·
We’re making changes to catch these types of issues earlier, including more internal dogfooding with configs that exactly match those of our users and creating a broader set of evals and running them against isolated system prompt changes
English
38
26
2K
584.5K