⬣PulseChain LIVE⬣ 💥 (@PulseChainLIVE) - Twitterプロフィール

⬣PulseChain LIVE⬣ 💥 がリツイート

🚀 Nemotron-Cascade-2 was built from the raw underperforming Nemotron-3-Nano-30B-A3B-Base (zero tuning) & Benchmaxxed Hard to beat Qwen3.5-35B-A3B. ⁉️ Is this bad if open sourced? Here’s what happened 👇 ✅ They loaded it with 4.4M competition math samples + 816K proofs ✅ Heavy SFT + Cascade RL (GRPO + MOPD distillation) ✅ Teachers: DeepSeek-V3.2, GPT-OSS-120B & Qwen3-235B ✅ Forced step-by-step thinking + Python tool calling 🎯 Goal: Turn a weak base into a tiny math olympiad monster (30B total / 3B active MoE) 🏆 Results (👀 verified): • IMO 2025: 35/42 (Gold medal performance) • AIME 2025: 92.4% (98.6% with tools) — beats Qwen 91.9% • HMMT Feb 2025: 94.6% • LiveCodeBench v6: 87.2% which crushes Qwen by 12+ pts ❓Is that so bad? Hell no. It’s brilliant specialization! They engineered a 30B math genius instead of another generic chatbot. @NVIDIA just showed smart post-training > raw scale. 🔥

Wei Ping@_weiping

🚀 Introducing Nemotron-Cascade 2 🚀 Just 3 months after Nemotron-Cascade 1, we’re releasing Nemotron-Cascade 2: an open 30B MoE with 3B active parameters, delivering best-in-class reasoning and strong agentic capabilities. 🥇 Gold Medal-level performance on IMO 2025, IOI 2025, and ICPC World Finals 2025: • Capabilities once thought achievable only by frontier proprietary models (e.g. Gemini Deep Think) or frontier-scale open models (i.e. DeepSeek-V3.2-Speciale-671B-A37B). • Remarkably high intelligence density with 20× fewer parameters. 🏆 Best-in-class across math, code reasoning, alignment, and instruction following: • Outperforms the latest Qwen3.5-35B-A3B (2026-02-24) and even larger Qwen3.5-122B-A10B (2026-03-11). 🧠 Powered by Cascade RL + multi-domain on-policy distillation: • Significantly expand Cascade RL across a much broader range of reasoning and agentic domains than Nemotron-Cascade 1, while distilling from the strongest intermediate teacher models throughout training to recover regressions and sustain gains. 🤗 Model + SFT + RL data: 👉 huggingface.co/collections/nv… 📄 Technical report: 👉 research.nvidia.com/labs/nemotron/…

English

3

1

29

2.5K

⬣PulseChain LIVE⬣ 💥 がリツイート

David Hendrickson@TeksEdge·7h

🚨 It’s Actually Happening! Jensen’s Vision of an Open Source Future is Here with @NVIDIA Nemotron-Cascade-2! Heavy testing is commencing. @NVIDIA Nemotron-Cascade-2 vs Qwen3.5-35B-A3B 30B total / 3B active MoE vs 35B total / 3B active MoE Same “intelligence density” size. Totally different results. ⚡ 🏆 CONTEST MEDALS • IMO 2025 → Gold (35/42 pts) • IOI 2025 → 439.3 vs 348.6 • ICPC World Finals → Gold (10/12 problems) 📊 BENCHMARK BEATDOWN • LiveCodeBench v6 → 87.2% (88.4% TIR) vs 74.6% 🔥 • ArenaHard v2 → 83.5% vs 65.4% (+18 pts) • AIME 2025 → 92.4% vs 91.9% • IFBench → 82.9% vs 70.2% Where Qwen fights back • SWE Verified → 60.5% vs 50.2% • Knowledge (MMLU-Pro / GPQA) → Qwen edge Bottom line Nemotron-Cascade-2 delivers higher reasoning density on math, coding & agentic tasks while being fully open-weight + 1M context. NVIDIA’s Cascade RL + Multi-Domain Distillation is the cheat code. Open model that actually wins gold medals in 2025 competitions. Which are you loading first? 👀👇

Wei Ping@_weiping

🚀 Introducing Nemotron-Cascade 2 🚀 Just 3 months after Nemotron-Cascade 1, we’re releasing Nemotron-Cascade 2: an open 30B MoE with 3B active parameters, delivering best-in-class reasoning and strong agentic capabilities. 🥇 Gold Medal-level performance on IMO 2025, IOI 2025, and ICPC World Finals 2025: • Capabilities once thought achievable only by frontier proprietary models (e.g. Gemini Deep Think) or frontier-scale open models (i.e. DeepSeek-V3.2-Speciale-671B-A37B). • Remarkably high intelligence density with 20× fewer parameters. 🏆 Best-in-class across math, code reasoning, alignment, and instruction following: • Outperforms the latest Qwen3.5-35B-A3B (2026-02-24) and even larger Qwen3.5-122B-A10B (2026-03-11). 🧠 Powered by Cascade RL + multi-domain on-policy distillation: • Significantly expand Cascade RL across a much broader range of reasoning and agentic domains than Nemotron-Cascade 1, while distilling from the strongest intermediate teacher models throughout training to recover regressions and sustain gains. 🤗 Model + SFT + RL data: 👉 huggingface.co/collections/nv… 📄 Technical report: 👉 research.nvidia.com/labs/nemotron/…

English

1

4

33

3K

⬣PulseChain LIVE⬣ 💥 がリツイート

Dr. Clown, PhD@DrClownPhD·14h

We are so fvcked...

Filipino

608

1.3K

9.8K

923.6K

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·4h

@opencode So when is ready to be use for OpenClaw as main agent LLM ?

English

0

12

OpenCode@opencode·3d

MiniMax M2.7 available in Go - Better at complex tasks over M2.5 - Fast - give it a plan and it runs with it - Self-evolution - do a task > check results > fix mistakes > try again

English

57

46

1.8K

127.6K

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·4h

👇👇

Benjamin Marie@bnjmn_marie

For OpenClaw, just use Qwen3.5 27B! Q4 GGUFs match the original's model accuracy You don't need expensive hardware or models

ART

0

22

⬣PulseChain LIVE⬣ 💥 がリツイート

alexintosh@Alexintosh·1d

Wanted to try SSD streaming but did not have enough disk for 397B-A17B. 7.2 tok/s for the 35B-A3B on M2 macbook air. wild.

Dan Woods@danveloper

x.com/i/article/2034…

English

16

31

429

65.2K

⬣PulseChain LIVE⬣ 💥 がリツイート

alexintosh@Alexintosh·10h

Qwen3.5 35B on my iPhone at 5.6 tok/sec. wild. x.com/Alexintosh/sta…

alexintosh@Alexintosh

I just ran Qwen3.5 35B on my iPhone at 5.6 tok/sec. Fully on-device. 4bit | 256 experts. Model: 19.5GB. iPhone: 12GB RAM. wild.

English

0

7

620

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·9h

@steeve First time I read about your project. When you release sm121 I will test it on DGX Spark and post here benchmark results. Thanks.

English

0

28

Steeve Morin@steeve·9h

@PulseChainLIVE because we forgot, will fix

English

1

0

2

142

Steeve Morin@steeve·17h

Yesterday I built 33 000 flash attention kernels in about 4 minutes from sm80 to sm120. For x86_64 and arm64. From my mac. Bazel is wild man.

English

3

74

8.2K

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·10h

👇👇👇

stevibe@stevibe

Got a 24GB Graphics Card? These 6 coding models all fit on it (Q4): - qwen3.5:27b (17GB) - qwen3.5:35b (24GB) - glm-4.7-flash (19GB) - nemotron-3-nano:30b (24GB) - nemotron-cascade-2:30b (24GB) - gpt-oss:20b (14GB) I gave them the same challenge: draw a campfire with HTML Canvas. Why Canvas? HTML/CSS forgives bad syntax — things still render. JavaScript + Canvas doesn't — one mistake and the screen goes black.

ART

0

30

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·15h

😱😬

prayingforexits 🏴‍☠️@mrexits

Yearly repost and reminder that an unspecified government agency proactively takes this video down from Youtube all the time

ART

0

73

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·15h

OMG, it fits in a pick-up truck.

prayingforexits 🏴‍☠️@mrexits

Always so funny stumbling across random Palmer Luckey side quests on the depths of the internet

English

0

42

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·15h

🤬🤬🤬

klöss@kloss_xyz

let me explain the ramifications of this… → 150,000 people just got locked out of their own cars… across 46 states… for 6 days straight and counting → not a software bug. not a glitch. not AI permissions gone wrong. → hackers flooded Intoxalock’s servers and all these vehicles just stopped starting… → these are court ordered breathalyzer devices… people who messed up in the past but have been doing everything right since (hopefully)… and now they can’t drive to work because someone else’s security system failed wild connect the dots… your electric car talks to a server to start. one breach and it’s a 50,000 dollar paperweight your insulin pump syncs to a server. your pacemaker data lives on a server. one breach and it’s not a car that stops working… it’s a body your smart home lock runs through a server. one breach and your front door either won’t open or won’t close now zoom out… Gartner projects $2.5 trillion going into AI this year… only $240 billion into securing the systems it runs on. that’s a 10 to 1 bet that nothing goes wrong the four biggest tech companies (Alphabet, Microsoft, Meta, and Amazon) are rumored to spend $700 billion on AI infrastructure this year alone… while cybercrime is projected to cost the world $10.5 trillion now imagine this happens to Tesla. to a hospital network. to the power grid… every new AI integration is a new attack surface. every API is a new door. every device that “talks to the cloud” is one more thing that can be turned off by someone you’ll never meet and I’m not saying every one of these systems will experience something who really knows what’s secure or isn’t but if you’re building right now… security isn’t the last layer you add. it’s the first one. → 150,000 people have just found out what happens when nobody prioritizes that… archaic government systems and legacy businesses are likely first on the chopping block I hope the rest of us continuously learn from it instead of living it the weakest link in every system is the one nobody bothered to secure like what wild system vulnerability will we see next? does someone hack Area 51?

ART

0

47

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·15h

Happy for you man. Maybe you add a DGX Spark to the shopping list and unlock that monster.

0xSero@0xSero

😳 1x RTX Pro 6000 Blackwell secured now will Nvidia give me MSRP

English

0

1

101

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·16h

@spark_arena @TeksEdge @AMD @Apple this one reflects storagereview article betterr for 10% power consumption for the same speed.

English

0

17

sparkarena@spark_arena·21h

@PulseChainLIVE @TeksEdge @AMD @Apple There are no thermal differences between the devices youtu.be/QbtScohcdwI?si…

YouTube

English

3

0

2

173

David Hendrickson@TeksEdge·1d

The Ultimate 128GB Local AI Hardware Battle 🥊💻 Judging Qwen3.5-27B (Bartowski IQ4_NL) on top unified-memory machines: 1️⃣ @AMD Strix Halo (Ryzen AI Max+ 395)💰 ~$2,500 | 🚀 9 - 12 tps (decode) | 🎮 Full Windows AAA gaming 🏆 Speed + value king. 🖕 2️⃣ @Apple Mac Studio M3 Ultra💰 ~$5,000 | 🚀 8–12 tps | 🍎 Apple macOS & ecosystem w/solid speeds; limited AAA gaming 3️⃣ @NVIDIA DGX Spark (GB10 Blackwell)💰 $4,699 | 🚀 ~10 tps (~20 tps x2 node) | 🐧 Linux/AI research only w/strong prefill + nerfed decode bandwidth-limited. Difficult pooling (new cables may fix). AAA gaming not optimized for Grace Verdict: AMD wins for most power users and best speed/price/gaming combo. (Community benchmarks; YMMV with setup/context) Which are you buying? 👇

English

51

30

380

73.6K

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·16h

@spark_arena @TeksEdge @AMD @Apple BTW, Asus released a week ago an updated version of DGX OS recovery .iso that includes NVIDIA AI Workbench and seem to have better drivers integration than before. NVIDIA has still the 6 months old .iso on their website.

English

0

22

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·16h

@spark_arena @TeksEdge @AMD @Apple Look at the data not the conclusions of that video and storagereview article. Asus gx10 consumes 10% less for the same speeds, has full copper heat sink, the unit is 20% heavier. I got GX10 GPU at 80W for inference with no thermal issues and its much cheaper. For me GX10 wins.

English

0

1

57

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·23h

@briancaffey @Teknium I tested Nemotrone a few days ago, so I`m not 100% sure that I got 10 t/s or more but I tried a REAP version of MiniMax m2.5 yesterday and got 10 t/s and that`s about the same size but not MOE so you are right.

English

0

2

106

Brian Caffey@briancaffey·23h

@PulseChainLIVE @Teknium I’m getting about double that on my Spark, unless you aren’t counting <thinking> @PulseChainLIVE but I agree that smaller optimized models with higher concurrency are more fun :D

English

1

0

1

40

Teknium (e/λ)@Teknium·1d

Just got an Nvidia Spark setup. Hermes Agent installed without any issues. Now lets see what model it should be powered by 😉

English

39

7

291

13.3K

⬣PulseChain LIVE⬣ 💥@PulseChainLIVE·23h

@TeksEdge @AMD @Apple You put #1 a 150 TFLOP non FP4 tensor non CUDA and #3 the 1000 TFLOP CUDA and FP4 tensor 😅. Try parallelism on DGX and you ll be amazed.

English

0

5

1K

⬣PulseChain LIVE⬣ 💥

ディスカバー