Name: DeepSeek-R1
Rating: 1 (58 reviews)
Author: DeepSeek

Home

List of Al models

DeepSeek-R1

Add comparison

671B

Model parameter quantity

DeepSeek

Affiliated organization

Open Source

License Type

January 20, 2025

Release time

Official website

Model documentation

Technical report

Related figures

Zhenda Xie

Kai Dong

Qihao Zhu

Daya Guo

Liang Wenfeng

Model Introduction

DeepSeek-R1 extensively utilized reinforcement learning techniques during the post-training phase, significantly enhancing the model's reasoning capabilities with only a minimal amount of annotated data. On tasks involving mathematics, coding, and natural language inference, its performance is on par with the official release of OpenAI's o1.

Comprehensive score Language dialogue Knowledge reserve Reasoning association Mathematical calculation Code writing Command following

Swipe left and right to view more

Language comprehension ability

Capable of understanding complex contexts and generating logically coherent sentences, though occasionally off in tone control.

7.5

Knowledge coverage scope

Covers more than 200 specialized fields, integrating the latest research findings and cross-cultural knowledge in real time.

9.0

Reasoning ability

Can perform logical reasoning with more than three steps, though efficiency drops when handling nonlinear relationships.

8.5

Model comparison

DeepSeek-R1 vs Qwen2.5-7B-Instruct Like Qwen2, the Qwen2.5 language models support up to 128K tokens and can generate up to 8K tokens. They also maintain multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

DeepSeek-R1 vs Hunyuan-T1-20250822 The deep reasoning model independently developed by Tencent adopts the version number hunyuan-t1-20250822.

DeepSeek-R1 vs Spark-X1 The inference model Spark X1 released by iFlytek, on the basis of leading domestic mathematical tasks, benchmarks the performance of general tasks such as inference, text generation, and language understanding against OpenAI o1 and DeepSeek R1.

DeepSeek-R1 vs Doubao-Seed-1.6-thinking-250715 The latest version of the seed series model launched by ByteDance, which supports the thinking mode.

DeepSeek-R1 vs Doubao-Seed-1.6-251015 (Thinking) The deep reasoning model released by ByteDance, which supports manual switching of deep reasoning, and its performance is significantly improved compared to doubao-1.5.

Related model

DeepSeek-V3.2 The latest version of Deepseek V3 series models.

DeepSeek-V3.2-Exp The latest experimental version of Deepseek V3 series models.

DeepSeek-R1-0528 The latest version of Deepseek R1.

DeepSeek-V3-0324 DeepSeek-V3 outperforms other open-source models such as Qwen2.5-72B and Llama-3.1-405B in multiple evaluations and matches the performance of top-tier closed-source models like GPT-4 and Claude-3.5-Sonnet.

DeepSeek-R1-0528 The latest version of Deepseek R1.

Relevant documents

Yaoke Media's First AIGC Drama 'The Mystery of the Bronze in Qinling' Launches Today with AI-Signed Leads Today marks the official launch of Yaoke Media's AIGC fantasy mystery short drama, "The Secret Story of the Qinling Bronze." Starring the company's first two signed AI actors, Qin Lingyue and Lin Xiyanyan, the story unfolds in the enigmatic Qinling m

Satya Nadella ready to exploit new OpenAI deal On Wednesday, a Wall Street analyst asked Microsoft CEO Satya Nadella directly how the revised OpenAI partnership would affect the company’s financials.Nadella described the new agreement as a win for everyone. “We feel good about our partnership wit

WordPress.com now allows AI agents to write and publish posts, plus more WordPress.com, the popular web hosting and publishing platform, is now embracing AI agents—a move that could reshape the look and feel of the web. The company announced Friday that it will allow AI agents to draft, edit, and publish content on custom

Anthropic's experimental AI Claude completes negotiations and transactions in e-commerce test As artificial intelligence advances rapidly, Anthropic quietly rolled out an internal experiment called "Project Deal" last Friday, showcasing AI's potential in e-commerce. The experiment had its AI model Claude autonomously handle buying, selling, a

DeepSeek Code poised for launch As AI technology accelerates, DeepSeek is at a thrilling juncture. The AI company recently revealed it has secured over 70 billion yuan in funding. Leadership has emphasized a commitment to groundbreaking AI research over immediate commercial gains.

Model comparison

Start the comparison