deepseek_ai Profile Banner
DeepSeek Profile
DeepSeek

@deepseek_ai

Followers
973K
Following
32
Media
87
Statuses
140

Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.

Joined October 2023
Don't wanna be here? Send us removal request.
@deepseek_ai
DeepSeek
5 months
To prevent any potential harm, we reiterate that @deepseek_ai is our sole official account on Twitter/X. Any accounts:.- representing us.- using identical avatars.- using similar names.are impersonations. Please stay vigilant to avoid being misled!.
4K
6K
78K
@deepseek_ai
DeepSeek
1 month
πŸš€ DeepSeek-R1-0528 is here!. πŸ”Ή Improved benchmark performance.πŸ”Ή Enhanced front-end capabilities.πŸ”Ή Reduced hallucinations.πŸ”Ή Supports JSON output & function calling. βœ… Try it now: πŸ”Œ No change to API usage β€” docs here: πŸ”—
Tweet media one
Tweet media two
484
2K
10K
@deepseek_ai
DeepSeek
3 months
πŸš€ DeepSeek-V3-0324 is out now!. πŸ”Ή Major boost in reasoning performance.πŸ”Ή Stronger front-end development skills.πŸ”Ή Smarter tool-use capabilities. βœ… For non-complex reasoning tasks, we recommend using V3 β€” just turn off β€œDeepThink”.πŸ”Œ API usage remains unchanged.πŸ“œ Models are
Tweet media one
Tweet media two
679
2K
12K
@deepseek_ai
DeepSeek
4 months
πŸš€ Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview. Optimized throughput and latency via:.πŸ”§ Cross-node EP-powered batch scaling.πŸ”„ Computation-communication overlap.βš–οΈ Load balancing. Statistics of DeepSeek's Online Service:.⚑ 73.7k/14.8k.
787
1K
9K
@deepseek_ai
DeepSeek
4 months
πŸš€ Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access. Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks. ⚑ 6.6 TiB/s aggregate read throughput in a 180-node cluster.⚑ 3.66 TiB/min.
532
1K
11K
@deepseek_ai
DeepSeek
4 months
πŸš€ Day 4 of #OpenSourceWeek: Optimized Parallelism Strategies. βœ… DualPipe - a bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training. πŸ”— βœ… EPLB - an expert-parallel load balancer for V3/R1. πŸ”—.
451
841
6K
@deepseek_ai
DeepSeek
4 months
🚨 Off-Peak Discounts Alert!. Starting today, enjoy off-peak discounts on the DeepSeek API Platform from 16:30–00:30 UTC daily:. πŸ”Ή DeepSeek-V3 at 50% off.πŸ”Ή DeepSeek-R1 at a massive 75% off. Maximize your resources smarter β€” save more during these high-value hours!
Tweet media one
544
713
7K
@deepseek_ai
DeepSeek
4 months
πŸš€ Day 3 of #OpenSourceWeek: DeepGEMM. Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMMs, powering V3/R1 training and inference. ⚑ Up to 1350+ FP8 TFLOPS on Hopper GPUs.βœ… No heavy dependency, as clean as a tutorial.βœ… Fully Just-In-Time compiled.
473
1K
7K
@deepseek_ai
DeepSeek
4 months
πŸš€ Day 2 of #OpenSourceWeek: DeepEP. Excited to introduce DeepEP - the first open-source EP communication library for MoE model training and inference. βœ… Efficient and optimized all-to-all communication.βœ… Both intranode and internode support with NVLink and RDMA.βœ….
519
1K
8K
@deepseek_ai
DeepSeek
4 months
πŸš€ Day 1 of #OpenSourceWeek: FlashMLA. Honored to share FlashMLA - our efficient MLA decoding kernel for Hopper GPUs, optimized for variable-length sequences and now in production. βœ… BF16 support.βœ… Paged KV cache (block size 64).⚑ 3000 GB/s memory-bound & 580 TFLOPS.
561
1K
11K
@deepseek_ai
DeepSeek
4 months
πŸš€ Day 0: Warming up for #OpenSourceWeek! . We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,.
1K
3K
21K
@deepseek_ai
DeepSeek
5 months
πŸš€ Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference!. Core components of NSA:.β€’ Dynamic hierarchical sparse strategy.β€’ Coarse-grained token compression.β€’ Fine-grained token selection. πŸ’‘ With
Tweet media one
Tweet media two
Tweet media three
Tweet media four
901
2K
16K
@deepseek_ai
DeepSeek
5 months
πŸŽ‰ Excited to see everyone’s enthusiasm for deploying DeepSeek-R1! Here are our recommended settings for the best experience:. β€’ No system prompt.β€’ Temperature: 0.6.β€’ Official prompts for search & file upload: β€’ Guidelines to mitigate model bypass.
703
2K
16K
@deepseek_ai
DeepSeek
5 months
πŸ“’ Terminology Correction: DeepSeek-R1’s code and models are released under the MIT License.
340
74
916
@deepseek_ai
DeepSeek
6 months
🌐 API Access & Pricing. βš™οΈ Use DeepSeek-R1 by setting model=deepseek-reasoner.πŸ’° $0.14 / million input tokens (cache hit).πŸ’° $0.55 / million input tokens (cache miss).πŸ’° $2.19 / million output tokens. πŸ“– API guide: πŸ‹ 5/n
Tweet media one
Tweet media two
244
361
4K
@deepseek_ai
DeepSeek
6 months
πŸ› οΈ DeepSeek-R1: Technical Highlights. πŸ“ˆ Large-scale RL in post-training.πŸ† Significant performance boost with minimal labeled data.πŸ”’ Math, code, and reasoning tasks on par with OpenAI-o1.πŸ“„ More details: πŸ‹ 4/n
Tweet media one
241
833
5K
@deepseek_ai
DeepSeek
6 months
πŸ“œ License Update!. πŸ”„ DeepSeek-R1 is now MIT licensed for clear open access.πŸ”“ Open for the community to leverage model weights & outputs.πŸ› οΈ API outputs can now be used for fine-tuning & distillation. πŸ‹ 3/n.
79
423
5K
@deepseek_ai
DeepSeek
6 months
πŸ”₯ Bonus: Open-Source Distilled Models!. πŸ”¬ Distilled from DeepSeek-R1, 6 small models fully open-sourced.πŸ“ 32B & 70B models on par with OpenAI-o1-mini.🀝 Empowering the open-source community. 🌍 Pushing the boundaries of **open AI**!. πŸ‹ 2/n
Tweet media one
75
344
3K
@deepseek_ai
DeepSeek
6 months
πŸš€ DeepSeek-R1 is here!. ⚑ Performance on par with OpenAI-o1.πŸ“– Fully open-source model & technical report.πŸ† MIT licensed: Distill & commercialize freely!. 🌐 Website & API are live now! Try DeepThink at today!. πŸ‹ 1/n
Tweet media one
2K
7K
37K
@deepseek_ai
DeepSeek
6 months
⚠️ Important Notice:. βœ… 100% FREE - No ads, no in-app purchases.πŸ›‘οΈ Download only from official channels to avoid being misled.πŸ“² Search "DeepSeek" in your app store or visit our website for direct links. 🌟 3/3.
104
139
2K