Name: DeepSeek-R1
Rating: 1 (19 reviews)
Author: DeepSeek

Home

List of Al models

DeepSeek-R1

Add comparison

671B

Model parameter quantity

DeepSeek

Affiliated organization

Open Source

License Type

January 20, 2025

Release time

Official website

Model documentation

Technical report

Related figures

Zhenda Xie

Kai Dong

Qihao Zhu

Daya Guo

Liang Wenfeng

Model Introduction

DeepSeek-R1 is a model trained through large-scale Reinforcement Learning (RL) without using Supervised Fine-Tuning (SFT) as an initial step. Its performance in mathematics, coding, and reasoning tasks is comparable to that of OpenAI-o1.

Comprehensive score Language dialogue Knowledge reserve Reasoning association Mathematical calculation Code writing Command following

Swipe left and right to view more

Language comprehension ability

Capable of understanding complex contexts and generating logically coherent sentences, though occasionally off in tone control.

7.8

Knowledge coverage scope

Possesses core knowledge of mainstream disciplines, but has limited coverage of cutting-edge interdisciplinary fields.

8.9

Reasoning ability

Capable of building multi-level logical frameworks, achieving over 99% accuracy in complex mathematical modeling.

9.1

Model comparison

DeepSeek-R1 vs Qwen2.5-7B-Instruct Like Qwen2, the Qwen2.5 language models support up to 128K tokens and can generate up to 8K tokens. They also maintain multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

DeepSeek-R1 vs GPT-4o-mini-20240718 GPT-4o-mini is an API model produced by OpenAI, with the specific version number being gpt-4o-mini-2024-07-18.

DeepSeek-R1 vs Gemini-2.5-Pro-Preview-05-06 Gemini 2.5 Pro is a model released by Google DeepMind artificial intelligence research team, using version number Gemini-2.5-Pro-Preview-05-06.

DeepSeek-R1 vs GPT-4o-mini-20240718 GPT-4o-mini is an API model produced by OpenAI, with the specific version number being gpt-4o-mini-2024-07-18.

DeepSeek-R1 vs Spark-X1 The inference model Spark X1 released by iFlytek, on the basis of leading domestic mathematical tasks, benchmarks the performance of general tasks such as inference, text generation, and language understanding against OpenAI o1 and DeepSeek R1.

Related model

DeepSeek-V3-0324 DeepSeek-V3 outperforms other open-source models such as Qwen2.5-72B and Llama-3.1-405B in multiple evaluations and matches the performance of top-tier closed-source models like GPT-4 and Claude-3.5-Sonnet.

DeepSeek-R1-0528 The latest version of Deepseek R1.

DeepSeek-V2-Chat-0628 DeepSeek-V2 is a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token. Compared with DeepSeek 67B, DeepSeek-V2 achieves stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times.

DeepSeek-V2.5 DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct. The new model integrates the general and coding abilities of the two previous versions.

Relevant documents

Conceptual Graphs Explained: AI Guide with Simple Examples Conceptual graphs have emerged as a fundamental knowledge representation framework in artificial intelligence, offering a visually intuitive yet mathematically rigorous way to model complex logical systems. These graphical structures bridge the gap b

FlexClip AI Video Translator Simplifies Multilingual Video Creation In our increasingly connected digital landscape, video content now effortlessly crosses international borders. FlexClip's AI Video Translator breaks down language barriers, empowering creators to engage global audiences through seamless multilingual

Master AI-Powered Upwork Proposal Writing: Your Complete Guide to Success In the increasingly competitive freelance landscape, differentiation is key to success. This comprehensive guide reveals how artificial intelligence can revolutionize your approach to crafting winning Upwork proposals, helping you stand out in a crow

Google Search Expands Smarter AI Mode Worldwide Google is bringing its AI-powered search experience to 180 additional countries, significantly expanding beyond its initial US, UK, and India rollout. While currently English-only, this global expansion enables more users worldwide to experience conv

Step-by-Step Guide to Creating Amazon Coloring Books Using Leonardo AI Dreaming of breaking into Amazon's thriving book market? Coloring books offer a fantastic passive income opportunity, but finding distinctive artwork can be difficult. This comprehensive tutorial reveals how Leonardo AI can help you craft captivating

Model comparison

Start the comparison