DeepSeek @deepseek_ai X Profile

DeepSeek

@deepseek_ai

Followers

973K

Following

32

Media

89

Statuses

145

Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.

Joined October 2023

Don't wanna be here? Send us removal request.

DeepSeek

@deepseek_ai

7 months

To prevent any potential harm, we reiterate that @deepseek_ai is our sole official account on Twitter/X. Any accounts:.- representing us.- using identical avatars.- using similar names.are impersonations. Please stay vigilant to avoid being misled!.

4K

6K

78K

DeepSeek

@deepseek_ai

3 days

Pricing Changes 💳. 🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time).🔹 Until then, APIs follow current pricing.📝 Pricing page: 5/5

26

48

862

Grok

@grok

5 days

Join millions who have switched to Grok.

255

277

2K

DeepSeek

@deepseek_ai

3 days

Model Update 🤖. 🔹 V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3.🔹 Tokenizer & chat template updated — new tokenizer config: 🔗 V3.1 Base Open-source weights: 🔗 V3.1 Open-source weights:.

9

42

793

DeepSeek

@deepseek_ai

3 days

Tools & Agents Upgrades 🧰. 📈 Better results on SWE / Terminal-Bench.🔍 Stronger multi-step reasoning for complex search tasks.⚡️ Big gains in thinking efficiency. 3/5

10

50

828

DeepSeek

@deepseek_ai

3 days

API Update ⚙️. 🔹 deepseek-chat → non-thinking mode.🔹 deepseek-reasoner → thinking mode.🧵 128K context for both.🔌 Anthropic API format supported: ✅ Strict Function Calling supported in Beta API: 🚀 More API resources, smoother.

11

40

799

DeepSeek

@deepseek_ai

3 days

Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀. 🧠 Hybrid inference: Think & Non-Think — one model, two modes.⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528.🛠️ Stronger agent skills: Post-training boosts tool use and.

442

2K

16K

DeepSeek

@deepseek_ai

3 months

🚀 DeepSeek-R1-0528 is here!. 🔹 Improved benchmark performance.🔹 Enhanced front-end capabilities.🔹 Reduced hallucinations.🔹 Supports JSON output & function calling. ✅ Try it now: 🔌 No change to API usage — docs here: 🔗

569

2K

10K

DeepSeek

@deepseek_ai

5 months

🚀 DeepSeek-V3-0324 is out now!. 🔹 Major boost in reasoning performance.🔹 Stronger front-end development skills.🔹 Smarter tool-use capabilities. ✅ For non-complex reasoning tasks, we recommend using V3 — just turn off “DeepThink”.🔌 API usage remains unchanged.📜 Models are

694

2K

12K

DeepSeek

@deepseek_ai

6 months

🚀 Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview. Optimized throughput and latency via:.🔧 Cross-node EP-powered batch scaling.🔄 Computation-communication overlap.⚖️ Load balancing. Statistics of DeepSeek's Online Service:.⚡ 73.7k/14.8k.

789

1K

9K

DeepSeek

@deepseek_ai

6 months

🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access. Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks. ⚡ 6.6 TiB/s aggregate read throughput in a 180-node cluster.⚡ 3.66 TiB/min.

529

1K

11K

DeepSeek

@deepseek_ai

6 months

🚀 Day 4 of #OpenSourceWeek: Optimized Parallelism Strategies. ✅ DualPipe - a bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training. 🔗 ✅ EPLB - an expert-parallel load balancer for V3/R1. 🔗.

github.com

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training. - deepseek-ai/DualPipe

451

849

6K

DeepSeek

@deepseek_ai

6 months

🚨 Off-Peak Discounts Alert!. Starting today, enjoy off-peak discounts on the DeepSeek API Platform from 16:30–00:30 UTC daily:. 🔹 DeepSeek-V3 at 50% off.🔹 DeepSeek-R1 at a massive 75% off. Maximize your resources smarter — save more during these high-value hours!

543

713

7K

DeepSeek

@deepseek_ai

6 months

🚀 Day 3 of #OpenSourceWeek: DeepGEMM. Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMMs, powering V3/R1 training and inference. ⚡ Up to 1350+ FP8 TFLOPS on Hopper GPUs.✅ No heavy dependency, as clean as a tutorial.✅ Fully Just-In-Time compiled.

472

1K

7K

DeepSeek

@deepseek_ai

6 months

🚀 Day 2 of #OpenSourceWeek: DeepEP. Excited to introduce DeepEP - the first open-source EP communication library for MoE model training and inference. ✅ Efficient and optimized all-to-all communication.✅ Both intranode and internode support with NVLink and RDMA.✅.

519

1K

8K

DeepSeek

@deepseek_ai

6 months

🚀 Day 1 of #OpenSourceWeek: FlashMLA. Honored to share FlashMLA - our efficient MLA decoding kernel for Hopper GPUs, optimized for variable-length sequences and now in production. ✅ BF16 support.✅ Paged KV cache (block size 64).⚡ 3000 GB/s memory-bound & 580 TFLOPS.

560

1K

10K

DeepSeek

@deepseek_ai

6 months

🚀 Day 0: Warming up for #OpenSourceWeek! . We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,.

1K

3K

21K

DeepSeek

@deepseek_ai

6 months

🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference!. Core components of NSA:.• Dynamic hierarchical sparse strategy.• Coarse-grained token compression.• Fine-grained token selection. 💡 With

899

2K

16K

DeepSeek

@deepseek_ai

6 months

🎉 Excited to see everyone’s enthusiasm for deploying DeepSeek-R1! Here are our recommended settings for the best experience:. • No system prompt.• Temperature: 0.6.• Official prompts for search & file upload: • Guidelines to mitigate model bypass.

701

2K

16K

Barstool Sports

@barstoolsports

3 days

RT @PardonMyTake: Tuesday night max woke Big Cat up with a flashlight at 2am because he thought we were going to get sued. @forthepeople ht….

0

13

0

DeepSeek

@deepseek_ai

7 months

📢 Terminology Correction: DeepSeek-R1’s code and models are released under the MIT License.

339

78

919

DeepSeek

@deepseek_ai

7 months

🌐 API Access & Pricing. ⚙️ Use DeepSeek-R1 by setting model=deepseek-reasoner.💰 $0.14 / million input tokens (cache hit).💰 $0.55 / million input tokens (cache miss).💰 $2.19 / million output tokens. 📖 API guide: 🐋 5/n

241

359

4K