DeepSeek
@deepseek_ai
Followers
974K
Following
32
Media
96
Statuses
157
Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Joined October 2023
β οΈ Heads-up to anyone using the DeepSeek-V3.2-Exp inference demo: earlier versions had a RoPE implementation mismatch in the indexer module that could degrade performance. Indexer RoPE expects non-interleaved input, MLA RoPE expects interleaved. Fixed in https://t.co/2BDzSyt1cW.
103
153
2K
π Open Source Release π¦ DeepSeek-V3.2 Model: https://t.co/Kh8HzHl3uX π¦ DeepSeek-V3.2-Speciale Model: https://t.co/ZKUg5IC0AJ π Tech report: https://t.co/hduS9hlMpX 5/n
huggingface.co
27
118
1K
π» API Update πΉ V3.2: Same usage pattern as V3.2-Exp. πΉ V3.2-Speciale: Served via a temporary endpoint: base_url=" https://t.co/WoEd0WDQ5x". Same pricing as V3.2, no tool calls, available until Dec 15th, 2025, 15:59 (UTC Time). π‘ V3.2 now supports Thinking in Tool-Use β
16
61
1K
Speak up. Spend freely. Tell Congress to create a de minimis tax exemption for Bitcoin that allows us to spend our money on gas, groceries, and everyday costs with no strings attached. Act today.
0
2
30
π€ Thinking in Tool-Use πΉ Introduces a new massive agent training data synthesis method covering 1,800+ environments & 85k+ complex instructions. πΉ DeepSeek-V3.2 is our first model to integrate thinking directly into tool-use, and also supports tool-use in both thinking and
9
68
1K
π World-Leading Reasoning πΉ V3.2: Balanced inference vs. length. Your daily driver at GPT-5 level performance. πΉ V3.2-Speciale: Maxed-out reasoning capabilities. Rivals Gemini-3.0-Pro. π₯ Gold-Medal Performance: V3.2-Speciale attains gold-level results in IMO, CMO, ICPC World
73
306
3K
π Launching DeepSeek-V3.2 & DeepSeek-V3.2-Speciale β Reasoning-first models built for agents! πΉ DeepSeek-V3.2: Official successor to V3.2-Exp. Now live on App, Web & API. πΉ DeepSeek-V3.2-Speciale: Pushing the boundaries of reasoning capabilities. API-only for now. π Tech
767
2K
16K
π Open Source Release π Model: https://t.co/kORJG3nCWN π Tech report: https://t.co/X8Wcqbhg5a π Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!) 4/n
github.com
Contribute to deepseek-ai/DeepSeek-V3.2-Exp development by creating an account on GitHub.
20
101
841
π» API Update π Lower costs, same access! π° DeepSeek API prices drop 50%+, effective immediately. πΉ For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://t.co/3RNKA89gHR πΉ Feedback welcome:
25
93
1K
β‘οΈ Efficiency Gains π€ DSA achieves fine-grained sparse attention with minimal impact on output quality β boosting long-context performance & reducing compute cost. π Benchmarks show V3.2-Exp performs on par with V3.1-Terminus. 2/n
9
49
693
π Introducing DeepSeek-V3.2-Exp β our latest experimental model! β¨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. π Now live on App, Web, and API. π° API prices cut by 50%+! 1/n
326
922
7K
π DeepSeek-V3.1-Terminus delivers more stable & reliable outputs across benchmarks compared to the previous version. π Available now on: App / Web / API π Open-source weights here: https://t.co/Jh4RudofKm Thanks to everyone for your feedback. It drives us to keep improving
19
64
859
π DeepSeek-V3.1 β DeepSeek-V3.1-Terminus The latest update builds on V3.1βs strengths while addressing key user feedback. β¨ Whatβs improved? π Language consistency: fewer CN/EN mix-ups & no more random chars. π€ Agent upgrades: stronger Code Agent & Search Agent performance.
189
549
5K
If you work in finance, ignoring Ethereum is no longer optional. Come to ETHConf NYC to: - map the shift from legacy rails to programmable settlement - learn how L2s, RWAs & stablecoins change market structure - meet the people shaping the next generation of finance
0
0
0
Pricing Changes π³ πΉ New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time) πΉ Until then, APIs follow current pricing π Pricing page: https://t.co/IyYitNzedg 5/5
33
59
950
Model Update π€ πΉ V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3 πΉ Tokenizer & chat template updated β new tokenizer config: https://t.co/r3y717EVFp π V3.1 Base Open-source weights: https://t.co/5wlDui34hH π V3.1 Open-source weights:
14
49
880
Tools & Agents Upgrades π§° π Better results on SWE / Terminal-Bench π Stronger multi-step reasoning for complex search tasks β‘οΈ Big gains in thinking efficiency 3/5
10
53
844
API Update βοΈ πΉ deepseek-chat β non-thinking mode πΉ deepseek-reasoner β thinking mode π§΅ 128K context for both π Anthropic API format supported: https://t.co/DcWmJMA1CP β
Strict Function Calling supported in Beta API: https://t.co/jFhJQ4wyN3 π More API resources, smoother
13
43
819
PostgreSQL tuning is complex β and mostly manual. At AI DBA, Luigi Nardi (DBtune) shows how agentic AI can automatically tune PostgreSQL server parameters in real-world environments. Jan 23 β’ Free virtual conference Register for free
0
0
13
Introducing DeepSeek-V3.1: our first step toward the agent era! π π§ Hybrid inference: Think & Non-Think β one model, two modes β‘οΈ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 π οΈ Stronger agent skills: Post-training boosts tool use and
chat.deepseek.com
Chat with DeepSeek AI.
519
2K
15K
π DeepSeek-R1-0528 is here! πΉ Improved benchmark performance πΉ Enhanced front-end capabilities πΉ Reduced hallucinations πΉ Supports JSON output & function calling β
Try it now: https://t.co/IMbTch8Pii π No change to API usage β docs here: https://t.co/Qf97ASptDD π
564
2K
10K
π DeepSeek-V3-0324 is out now! πΉ Major boost in reasoning performance πΉ Stronger front-end development skills πΉ Smarter tool-use capabilities β
For non-complex reasoning tasks, we recommend using V3 β just turn off βDeepThinkβ π API usage remains unchanged π Models are
683
2K
12K
π Day 6 of #OpenSourceWeek: One More Thing β DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: π§ Cross-node EP-powered batch scaling π Computation-communication overlap βοΈ Load balancing Statistics of DeepSeek's Online Service: β‘ 73.7k/14.8k
github.com
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation - deepseek-ai/open-infra-index
782
1K
9K
Earn unlimited cash back on your trades when you open a Lightspeed account!
38
22
475