DeepSeek-R1
671B
Model parameter quantity
DeepSeek
Affiliated organization
Open Source
License Type
January 20, 2025
Release time
Model Introduction
DeepSeek-R1 extensively utilized reinforcement learning techniques during the post-training phase, significantly enhancing the model's reasoning capabilities with only a minimal amount of annotated data. On tasks involving mathematics, coding, and natural language inference, its performance is on par with the official release of OpenAI's o1.
Comprehensive score
Language dialogue
Knowledge reserve
Reasoning association
Mathematical calculation
Code writing
Command following
Swipe left and right to view more


Language comprehension ability
Capable of understanding complex contexts and generating logically coherent sentences, though occasionally off in tone control.
7.5


Knowledge coverage scope
Covers more than 200 specialized fields, integrating the latest research findings and cross-cultural knowledge in real time.
9.0


Reasoning ability
Can perform logical reasoning with more than three steps, though efficiency drops when handling nonlinear relationships.
8.5
Model comparison
DeepSeek-R1 vs Qwen2.5-7B-Instruct
Like Qwen2, the Qwen2.5 language models support up to 128K tokens and can generate up to 8K tokens. They also maintain multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
DeepSeek-R1 vs Gemini-2.5-Pro-Preview-05-06
Gemini 2.5 Pro is a model released by Google DeepMind artificial intelligence research team, using version number Gemini-2.5-Pro-Preview-05-06.
DeepSeek-R1 vs GPT-4o-mini-20240718
GPT-4o-mini is an API model produced by OpenAI, with the specific version number being gpt-4o-mini-2024-07-18.
DeepSeek-R1 vs Doubao-1.5-thinking-pro-250415
The new deep thinking model Doubao-1.5 performs outstandingly in professional fields such as mathematics, programming, scientific reasoning, and general tasks such as creative writing. It has reached or is close to the industry's top tier level on multiple authoritative benchmarks such as AIME 2024, Codeforces, and GPQA
Related model
DeepSeek-V2-Chat-0628
DeepSeek-V2 is a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token. Compared with DeepSeek 67B, DeepSeek-V2 achieves stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times.
DeepSeek-V2.5
DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct. The new model integrates the general and coding abilities of the two previous versions.
DeepSeek-V3-0324
DeepSeek-V3 outperforms other open-source models such as Qwen2.5-72B and Llama-3.1-405B in multiple evaluations and matches the performance of top-tier closed-source models like GPT-4 and Claude-3.5-Sonnet.
DeepSeek-V2-Lite-Chat
DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model presented by DeepSeek, DeepSeek-V2-Lite is a lite version of it.
DeepSeek-V2-Chat
DeepSeek-V2 is a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token. Compared with DeepSeek 67B, DeepSeek-V2 achieves stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times.
Relevant documents
AI-Powered NoteGPT Transforms YouTube Learning Experience
In today’s fast-moving world, effective learning is essential. NoteGPT is a dynamic Chrome extension that revolutionizes how you engage with YouTube content. By harnessing AI, it offers concise summar
Community Union and Google Partner to Boost AI Skills for UK Workers
Editor’s Note: Google has teamed up with Community Union in the UK to demonstrate how AI skills can enhance the capabilities of both office and operational workers. This pioneering program is part of
Magi-1 Unveils Revolutionary Open-Source AI Video Generation Technology
The realm of AI-powered video creation is advancing rapidly, and Magi-1 marks a transformative milestone. This innovative open-source model offers unmatched precision in controlling timing, motion, an
AI Ethics: Navigating Risks and Responsibilities in Technology Development
Artificial intelligence (AI) is reshaping industries, from healthcare to logistics, offering immense potential for progress. Yet, its rapid advancement brings significant risks that require careful ov
AI-Driven Interior Design: ReRoom AI Transforms Your Space
Aspiring to revamp your home but short on design expertise or funds for a professional? Artificial intelligence is reshaping interior design, delivering user-friendly and creative solutions. ReRoom AI