
cudnn_cu12
1.9K posts

cudnn_cu12
@_proteuss_
Interests: Machine learning research, learning in neural networks, applied AI, startups, etc






We've released the QR problem, a more robust qr_v2 with a fresh leaderboard so please resubmit! Thank you to @blelbach, @myainotez and @nikhilbarhate99 for sharing feedback. Sorry if I missed anyone! I considered automatically backfilling all submissions but the rankings do change quite a bit so I figured a refresh would be better. Changelog * Fail submissions if they fail when we change random seeds * Add nasty correctness cases with more degenerate inputs in mixed batches * Recheck correctness when doing perf testing to avoid Volkswagen cheat * Reject Nan/Inf residuals * Validate each matrix factorization residual, since averaging was hiding bad matrices * Old QR is still open so folks can't see submissions but you can't submit anything to it Wontfix * Stream hacking is still banned via very blunt ban of the word "stream" we don't have a good solution for this * CUDA graphs are allowed but not particularly interesting to us Best submissions so far if I resubmit their solutions are

one intel b70 ($950), first day setup Qwen3.6-27B W4A16 (autoround), *no* MTP 128k context, kv cache fp16 1 session: 28.1 tok/s 2 concurrent sessions: 52.0 tok/s cumulative 4 concurrent sessions: 87.8 tok/s cumulative 64 concurrent sessions: 234.7 tok/s cumulative





Gemma 4 12B Coder is here and it's a game changer for local code generation. This GGUF model packs Google's latest gemma-4 architecture into a compact 12B size, perfect for running on consumer hardware. It's optimized for reasoning and thinking, making it ideal for developers who want fast, private coding assistance without the cloud.


This is a watershed moment. GLM-5.2 solidly beat Opus 4.8 and human participants in our backend take-home, making the whole thing obsolete. It also pushed forward the state-of-the-art for multi-stage media-to-transcript, with a new release: offmute-v2. I come with receipts.

first submission - im not last








