DeepReinforce

2

201

DeepReinforce retweetledi

Qwen@Alibaba_Qwen·2h

Congratulations to the GrandCode team on this remarkable achievement！👏 The last stronghold of coding has been conquered, and we are incredibly proud that Qwen is the engine behind it! 💻🔥 This is a real milestone moment for coding intelligence and a fascinating example of how far agentic RL systems have come. The pace of AI progress is truly mind-blowing. Stay tuned .🌍✨

The last stronghold of coding has just been conquered by AI. In the most recent three Codeforces live competitions, i.e., Round 1087, Round 1088, and Round 1089, GrandCode, our agentic AI system, ranked first in all of them, beating all human participants, including legendary grandmasters. GrandCode is a multi-agent reinforcement learning system designed for competitive programming. It orchestrates a variety of agentic modules (hypothesis proposal, solver, test generator, summarization, etc) and jointly improves them through post-training and online test-time RL. GrandCode is developed based on Qwen. Huge respect to the Qwen @Alibaba_Qwen team for their contributions to the community. It is hard to imagine how quickly AI has advanced in just one year: 1st — GrandCode (March 2026) 8th — Gemini 3.1 Pro (February 2026) 175th — OpenAI o3 (April 2025) We can’t wait to see what happens over the next year.

English

9

12

124

13.3K

DeepReinforce@deep_reinforce·2h

👀Mode details at: deep-reinforce.com/grandcode.pdf

The last stronghold of coding has just been conquered by AI. In the most recent three Codeforces live competitions, i.e., Round 1087, Round 1088, and Round 1089, GrandCode, our agentic AI system, ranked first in all of them, beating all human participants, including legendary grandmasters. GrandCode is a multi-agent reinforcement learning system designed for competitive programming. It orchestrates a variety of agentic modules (hypothesis proposal, solver, test generator, summarization, etc) and jointly improves them through post-training and online test-time RL. GrandCode is developed based on Qwen. Huge respect to the Qwen @Alibaba_Qwen team for their contributions to the community. It is hard to imagine how quickly AI has advanced in just one year: 1st — GrandCode (March 2026) 8th — Gemini 3.1 Pro (February 2026) 175th — OpenAI o3 (April 2025) We can’t wait to see what happens over the next year.

Français

0

11

620

DeepReinforce@deep_reinforce·3h

2/ 🔹The original submission code for the three competitions: github.com/deepreinforce-… 🔹Tech report link: github.com/deepreinforce-… 🔹Blog: deep-reinforce.com/cp.html

English

8

35

2.5K

DeepReinforce@deep_reinforce·3h

1/ Screenshots of our standings in the three competitions.

English

7

40

2.2K

DeepReinforce@deep_reinforce·3h

The last stronghold of coding has just been conquered by AI. In the most recent three Codeforces live competitions, i.e., Round 1087, Round 1088, and Round 1089, GrandCode, our agentic AI system, ranked first in all of them, beating all human participants, including legendary grandmasters. GrandCode is a multi-agent reinforcement learning system designed for competitive programming. It orchestrates a variety of agentic modules (hypothesis proposal, solver, test generator, summarization, etc) and jointly improves them through post-training and online test-time RL. GrandCode is developed based on Qwen. Huge respect to the Qwen @Alibaba_Qwen team for their contributions to the community. It is hard to imagine how quickly AI has advanced in just one year: 1st — GrandCode (March 2026) 8th — Gemini 3.1 Pro (February 2026) 175th — OpenAI o3 (April 2025) We can’t wait to see what happens over the next year.

English

80

141

246

198K

DeepReinforce@deep_reinforce·1d

🥳details at : github.com/deepreinforce-…

English

4

108

DeepReinforce@deep_reinforce·1d

🧑‍🍳CUDA-L2 now supports H100 and RTX 3090. 🔹On H100 under server mode, CUDA-L2 achieves +41.7%, +40.5%, +42.1%, and +22.1% over torch.matmul, cuBLAS, cuBLASLt-heuristic, and cuBLASLt-AutoTuning. 🔹On RTX 3090 under server mode, CUDA-L2 achieves +28.7%, +35.3%, +28.1%, and +19.8% over torch.matmul, cuBLAS, cuBLASLt-heuristic, and cuBLASLt-AutoTuning. 🥳More updates will come. Stay tuned🫡 #CUDA #AI

English

0

10

321

DeepReinforce@deep_reinforce·17 Mar

☺️ Stay tuned!!

English

87

DeepReinforce@deep_reinforce·17 Mar

🤗 We also open-source the optimization code and fix a division-by-zero issue in null-scattering PDF computations for participating media. > Fix PR: github.com/mmp/pbrt-v4/pu… > Optim PR: github.com/mmp/pbrt-v4/pu…

English

0

2

132

DeepReinforce@deep_reinforce·17 Mar

🧑‍🍳New Use Case Drop! 🧐We used IterX to optimize pbrt-v4’s hottest CPU paths, tackling a core bottleneck in physically based rendering: the cost of doing more work per ray than necessary during traversal, intersection, spectrum math, and volumetric sampling. 🥳On an AMD EPYC 7402 24-Core (8 threads), at 16 spp, across 8 scenes, our optimization delivered an average speedup of 11.8%, with volumetric scenes improving by about 14%. 🎁We also offer unlimited credits for all users! 🫡Thanks for this amazing community @GPUOpen @seanbax @KostasAAA @stigatle @marcosalvi @adyaman #AMD #IterX #DeepReinforce

🥳Introducing IterX: an automated system for deep code optimization using reinforcement learning. 🧐Simply define a reward function, and IterX automatically iterates toward the optimal solution through thousands of trials and explorations using RL. 🎁Every new user receives 30M free tokens. We can’t wait to see what you build with IterX. 🧵

English

2

0

6

725

DeepReinforce@deep_reinforce·17 Mar

☺️ Stay tuned!!

English

75

DeepReinforce@deep_reinforce·26 Şub

🧵2/2: 🎁We offer unlimited credits for all participants. > Please mention “MLSys 2026 FlashInfer” in the form. You will receive unlimited credits. iterx.deep-reinforce.com/earn-rewards

English

3

195

DeepReinforce@deep_reinforce·26 Şub

🧵1/2: Recipe of our solution: github.com/deepreinforce-…

English

0

3

205

DeepReinforce@deep_reinforce·26 Şub

🥳IterX for MLSys 2026 NVIDIA Track Fused MoE 🧑‍🍳IterX achieves a 15.62× speedup on H100 and 14.84× on B200, significantly surpassing GPT-5.2 Pro and Claude 4.6 Opus on the Fused MoE setting of FlashInfer AI Kernel generation contest. 🎁We offer unlimited credits for all participants. Come and Join! 🤗We’ve also open-sourced the full recipe to reproduce our results. All MLSys 2026 challenge participants are welcome to build on top of it. #NVIDIAGTC

English

3

13

856

DeepReinforce@deep_reinforce·10 Şub

Feedback is appreciated !!

English

0

5

461

DeepReinforce@deep_reinforce·10 Şub

🚀Major IterX upgrade: Agent integration is now supported! 🔹Optimizing Hardcore Code: SOTA in Infra, CUDA, Smart Contracts, DBs, AI/ML Ops and beyond. 🔹No Manual Code: Agents (claude code, cursor) handle the integration. Effortless Onboarding! #CUDA #AI

🥳Introducing IterX: an automated system for deep code optimization using reinforcement learning. 🧐Simply define a reward function, and IterX automatically iterates toward the optimal solution through thousands of trials and explorations using RL. 🎁Every new user receives 30M free tokens. We can’t wait to see what you build with IterX. 🧵

English

38

29

149

1.1M

DeepReinforce@deep_reinforce·10 Şub

🔗iterx.deep-reinforce.com/run

QME

12

11.4K

DeepReinforce@deep_reinforce·3 Şub

@SohomScalesX thanks !!

English