DeepSeek

157 posts

DeepSeek banner
DeepSeek

DeepSeek

@deepseek_ai

Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.

Katılım Ekim 2023
0 Takip Edilen979.3K Takipçiler
Sabitlenmiş Tweet
DeepSeek
DeepSeek@deepseek_ai·
⚠️ Heads-up to anyone using the DeepSeek-V3.2-Exp inference demo: earlier versions had a RoPE implementation mismatch in the indexer module that could degrade performance. Indexer RoPE expects non-interleaved input, MLA RoPE expects interleaved. Fixed in github.com/deepseek-ai/De….
English
213
169
2.2K
635.4K
DeepSeek
DeepSeek@deepseek_ai·
🚀 Launching DeepSeek-V3.2 & DeepSeek-V3.2-Speciale — Reasoning-first models built for agents! 🔹 DeepSeek-V3.2: Official successor to V3.2-Exp. Now live on App, Web & API. 🔹 DeepSeek-V3.2-Speciale: Pushing the boundaries of reasoning capabilities. API-only for now. 📄 Tech report: huggingface.co/deepseek-ai/De… 1/n
DeepSeek tweet media
English
888
2.5K
16.1K
5.2M
DeepSeek
DeepSeek@deepseek_ai·
💻 API Update 🎉 Lower costs, same access! 💰 DeepSeek API prices drop 50%+, effective immediately. 🔹 For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: api-docs.deepseek.com/guides/compari… 🔹 Feedback welcome: feedback.deepseek.com/dsa 3/n
DeepSeek tweet media
English
32
97
1K
357.6K
DeepSeek
DeepSeek@deepseek_ai·
🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model! ✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. 👉 Now live on App, Web, and API. 💰 API prices cut by 50%+! 1/n
English
322
909
7.1K
1.4M
DeepSeek
DeepSeek@deepseek_ai·
📊 DeepSeek-V3.1-Terminus delivers more stable & reliable outputs across benchmarks compared to the previous version. 👉 Available now on: App / Web / API 🔗 Open-source weights here: huggingface.co/deepseek-ai/De… Thanks to everyone for your feedback. It drives us to keep improving and refining the experience! 🚀 2/2
DeepSeek tweet media
English
22
65
872
142.6K
DeepSeek
DeepSeek@deepseek_ai·
🚀 DeepSeek-V3.1 → DeepSeek-V3.1-Terminus The latest update builds on V3.1’s strengths while addressing key user feedback. ✨ What’s improved? 🌐 Language consistency: fewer CN/EN mix-ups & no more random chars. 🤖 Agent upgrades: stronger Code Agent & Search Agent performance. 1/2
English
188
540
4.6K
822.3K
DeepSeek
DeepSeek@deepseek_ai·
Pricing Changes 💳 🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time) 🔹 Until then, APIs follow current pricing 📝 Pricing page: api-docs.deepseek.com/quick_start/pr… 5/5
DeepSeek tweet media
English
33
57
945
175.6K
DeepSeek
DeepSeek@deepseek_ai·
Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 🧠 Hybrid inference: Think & Non-Think — one model, two modes ⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 🛠️ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks Try it now — toggle Think/Non-Think via the "DeepThink" button: chat.deepseek.com 1/5
English
510
1.8K
15K
2.1M
DeepSeek
DeepSeek@deepseek_ai·
🚀 DeepSeek-V3-0324 is out now! 🔹 Major boost in reasoning performance 🔹 Stronger front-end development skills 🔹 Smarter tool-use capabilities ✅ For non-complex reasoning tasks, we recommend using V3 — just turn off “DeepThink” 🔌 API usage remains unchanged 📜 Models are now released under the MIT License, just like DeepSeek-R1! 🔗 Open-source weights: huggingface.co/deepseek-ai/De…
DeepSeek tweet media
GIF
English
681
1.9K
11.8K
1.6M
DeepSeek
DeepSeek@deepseek_ai·
🚀 Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: 🔧 Cross-node EP-powered batch scaling 🔄 Computation-communication overlap ⚖️ Load balancing Statistics of DeepSeek's Online Service: ⚡ 73.7k/14.8k input/output tokens per second per H800 node 🚀 Cost profit margin 545% 💡 We hope this week's insights offer value to the community and contribute to our shared AGI goals. 📖 Deep Dive: bit.ly/4ihZUiO
English
778
1.2K
9.2K
4M