Pascal2_22./

1

3

realsir ./@realsirandrew·20h

@Pascal2_22 @commonstack_ai hooooot ./

0

1

7

Pascal2_22./@Pascal2_22·1d

DeepSeek V4 Flash is now live on @commonstack_ai 9.8x lower FLOPs than V3.2. 13.7x smaller KV cache. 1M context window. $0.14 input / $0.28 output per 1M tokens. Cached reads at $0.028. Frontier level architecture efficiency at a fraction of the cost. One key. One integration. Ready to route.

Pascal2_22./@Pascal2_22

The right charts show exactly how constrained Labs redesign attention to need less HBM. DeepSeek didn't solve long context by throwing more memory at it. They redesigned how attention accumulates memory so the KV cache stays flat instead of growing linearly. That's architectural innovation under resource constraint not hardware brute force as Frontier Labs approach it. The left Chart shows: Performance of DeepSeek V4 Pro Max, beating or matching Claude Opus 4.6, GPT-5.4 and Gemini 3.1 Pro across nearly every benchmark. Knowledge, reasoning, agentic tasks. The performance gap between V4 and frontier closed source models is either marginal or nonexistent on most tasks. On the Right chart, the Efficiency of Deepseek V4 Pro runs at 3.7x lower FLOPs than V3.2 at long context. V4 Flash runs at 9.8x lower FLOPs. KV cache — the memory that explodes as context grows — is 9.5x to 13.7x smaller. Same benchmark performance. Fraction of the compute and memory cost. Frontier labs scale infrastructure to match model demands. DeepSeek scales architecture to outrun the hardware bill.

English

@gokuoldskool @commonstack_ai

0

9

89

Pascal2_22./@Pascal2_22·23h

GIF

QME

19

Pascal2_22./@Pascal2_22·1d

@gokuoldskool @commonstack_ai higher

English

0

2

24

Goku ./@gokuoldskool·1d

DeepSeek V4 flash is now available on @commonstack_ai DeepSeek V4 Flash matches GPT-5.4 (High) performance but at a fraction of the cost: > Input: $0.14/M vs $30/M (214x cheaper) > Output: $0.28/M vs $180/M (643x cheaper) Both support 1M context. A real bargain for high-intelligence workloads.

English

0

5

157

Pascal2_22./@Pascal2_22·1d

Deepseek V4 flash is now available on @commonstack_ai by @Gradient_HQ , you can try it for free with test credits: commonstack.ai/account/signup…

English

1

59

Pascal2_22./@Pascal2_22·1d

The right charts show exactly how constrained Labs redesign attention to need less HBM. DeepSeek didn't solve long context by throwing more memory at it. They redesigned how attention accumulates memory so the KV cache stays flat instead of growing linearly. That's architectural innovation under resource constraint not hardware brute force as Frontier Labs approach it. The left Chart shows: Performance of DeepSeek V4 Pro Max, beating or matching Claude Opus 4.6, GPT-5.4 and Gemini 3.1 Pro across nearly every benchmark. Knowledge, reasoning, agentic tasks. The performance gap between V4 and frontier closed source models is either marginal or nonexistent on most tasks. On the Right chart, the Efficiency of Deepseek V4 Pro runs at 3.7x lower FLOPs than V3.2 at long context. V4 Flash runs at 9.8x lower FLOPs. KV cache — the memory that explodes as context grows — is 9.5x to 13.7x smaller. Same benchmark performance. Fraction of the compute and memory cost. Frontier labs scale infrastructure to match model demands. DeepSeek scales architecture to outrun the hardware bill.

English

0

14

312

Pascal2_22./@Pascal2_22·1d

@yuangao

GIF

QME

50

Yuan ./@yuangao·1d

Ok, which one should I try first?

English

19

0

22

1.5K

Pascal2_22./@Pascal2_22·1d

@HexxRL Maxxxx efficiency

English

5

Hexx ./@HexxRL·1d

Deepseek V4 Pro Max gives you GPT 5.4 max & Opus 4.6 max at a fraction of the price. Whale dropped after everyone else dropped this month. Open source will win. 🐋

DeepSeek@deepseek_ai

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English

3

35

1.3K

Pascal2_22./@Pascal2_22·1d

@Leila1113514 @deepseek_ai Deepseek the real Goat

English

1

24

𝓁ℯ𝒾𝓁𝒶 ./@Leila1113514·1d

DeepSeek V4 is actually insane. Just saw the benchmarks and it's outperforming frontier models in coding and reasoning while being a fraction of the cost. @deepseek_ai seeking deep!

English

0

6

275

Pascal2_22./@Pascal2_22·3d

Iykyk

Parallax@tryParallax

glad we could help! with the agentic adoption soaring, privacy and token cost are already the top concerns for both agent and human users. that's what parallax's built for.

Suomi

@flowerlang20 @tryParallax

9

187

Pascal2_22./@Pascal2_22·3d

GIF

QME

0

1

4

Ms Ngọc Anh ./@flowerlang20·3d

@Pascal2_22 @tryParallax Big win./

English

0

1

5

Pascal2_22./@Pascal2_22·3d

@tryParallax is the ultimate operating system that turns mismatched hardware into one unified service. Model too big for one machine? Parallax shards it across your laptop, a lab GPU, a teammate's workstation, orchestrated seamlessly opening up a wide range of ways to host and run AI apps and agents on your own infrastructure. Sovereign. Yours. @Gradient_HQ

rw ./@gradientintern

Awesome to see @tryParallax’s distributed framework for heterogeneous machines being implemented and serving up inferences! Build and customize your own clusters for AI like never before 🤖 ./ LFG @Gradient_HQ

English

0

7

98

Pascal2_22./@Pascal2_22·3d

@Adikastakes Am officially not a liverpool fan next season

English

Sky Sports Premier League@SkySportsPL

29

Adika@Adikastakes·3d

Day ruined.

Arne Slot is expected to continue as Liverpool head coach next season, Sky Sports News understands, as the club close in on Champions League qualification 🔴

English

105

194

2.1K

42.3K

Pascal2_22./@Pascal2_22·3d

@realsirandrew real./

English

1

8

realsir ./@realsirandrew·3d

@Pascal2_22 high ./

English

0

1

13

Pascal2_22./@Pascal2_22·4d

Kimi K2.6 just dropped and it's sitting at #4 on the Artificial Analysis Intelligence Index—trailing only the big three (Anthropic, Google, OpenAI at 57). Here's what actually matters: The agentic performance is wild. Elo jumped from 1309 (K2.5) to 1520 on GDPval-AA. This thing handles real knowledge work like presentations, analysis, code execution with tools. 96% on τ²-Bench Telecom puts it in the same tier as frontier models for tool use. Hallucination rate dropped hard. 39% (down from 65% in K2.5). It's learned to shut up when it doesn't know something instead of making stuff up. That's the gap between useful and dangerous in production. Token usage is high (~160M for the full benchmark) but so is everyone else at this level. Claude Sonnet 4.6 uses ~190M. GPT 5.4 uses ~110M. You're paying for reasoning quality, not efficiency.

English

0

9

115

Pascal2_22./@Pascal2_22·4d

Kimi K2.6 now available on @commonstack_ai by @Gradient_HQ commonstack.ai/account/signup…

English

1

34

Pascal2_22./@Pascal2_22·5d

@flowerlang20 ./ We gone win

English

0

1

25

Ms Ngọc Anh ./@flowerlang20·5d

@Pascal2_22 It's something big to look forward to./ Win win

English

0

1

20

Pascal2_22./@Pascal2_22·5d

Win rate is a terrible proxy for long-horizon AI performance. 6-player No-Limit Hold'em results -- PPO: 25% win rate, +5.6 BB/100 (wins small pots, loses value) -- ISO-LLM: 22% win rate, +15.8 BB/100 (strategic value optimization) -- GPT-4o: 14% win rate, -8.2 BB/100 (zero-shot fails) ISO by @Gradient_HQ achieves 3x better long-term returns while being harder to exploit. Strategic foresight > greedy optimization

English

0

12

161

Pascal2_22./@Pascal2_22·5d

@Leila1113514 @zkchrls 🤣

QME

24

𝓁ℯ𝒾𝓁𝒶 ./@Leila1113514·6d

@zkchrls

QME

FINAL FANTASY XIV@FF_XIV_EN

0

33

Midnight ./@zkchrls·18 Nis

My dream job.

Bee-gin your adventures and level up with @JollibeeUSA and FINAL FANTASY XIV starting on April 21! 🐝 sqex.to/JwTts (And fear not, no chocobo companions were harmed in the making of this collaboration!) #jollibee #jollibeexffxiv

English

0

2

61

Pascal2_22./@Pascal2_22·17 Nis

Most AI agents optimize for win rate and fail in long-horizon games because they treat non-stationarity as pure noise instead of predicting strategic regimes. ISO by @Gradient_HQ introduces prediction-aware learning: agents forecast strategic context and adapt policies in real-time. Here's how it works: Pipeline: Expert demos → SFT → Strategic Reward Model → ISO-GRPO Runtime: Predict context → Route policy → Update learner Result: 10.6x cheaper regret, equilibrium convergence scaling with prediction accuracy not horizon length. Win rate ≠ long-term value. Strategic foresight wins.

English

1

18

178

Pascal2_22./@Pascal2_22·17 Nis

@AlloMoses69463 @Gradient_HQ Big love

English