Introducing DeepSeek-V3: Your AI Companion

MihiCherub · Jan 24, 2025

The DeepSeek R1 model is a versatile, powerful AI tool that’s set to revolutionize how businesses and individuals interact with technology. Whether you’re automating workflows or enhancing user experiences, the R1 is a game-changer.

Open-Source Model You Can Run Locally

DeepSeek-R1-Evaluation

Category	Benchmark (Metric)	Claude-3.5-Sonnet-1022	GPT-4o 0513	DeepSeek V3	OpenAI o1-mini	OpenAI o1-1217	DeepSeek R1
	Architecture	-	-	MoE	-	-	MoE
	# Activated Params	-	-	37B	-	-	37B
	# Total Params	-	-	671B	-	-	671B
English	MMLU (Pass@1)	88.3	87.2	88.5	85.2	91.8	90.8
	MMLU-Redux (EM)	88.9	88.0	89.1	86.7	-	92.9
	MMLU-Pro (EM)	78.0	72.6	75.9	80.3	-	84.0
	DROP (3-shot F1)	88.3	83.7	91.6	83.9	90.2	92.2
	IF-Eval (Prompt Strict)	86.5	84.3	86.1	84.8	-	83.3
	GPQA-Diamond (Pass@1)	65.0	49.9	59.1	60.0	75.7	71.5
	SimpleQA (Correct)	28.4	38.2	24.9	7.0	47.0	30.1
	FRAMES (Acc.)	72.5	80.5	73.3	76.9	-	82.5
	AlpacaEval2.0 (LC-winrate)	52.0	51.1	70.0	57.8	-	87.6
	ArenaHard (GPT-4-1106)	85.2	80.4	85.5	92.0	-	92.3
Code	LiveCodeBench (Pass@1-COT)	33.8	34.2	-	53.8	63.4	65.9
	Codeforces (Percentile)	20.3	23.6	58.7	93.4	96.6	96.3
	Codeforces (Rating)	717	759	1134	1820	2061	2029
	SWE Verified (Resolved)	50.8	38.8	42.0	41.6	48.9	49.2
	Aider-Polyglot (Acc.)	45.3	16.0	49.6	32.9	61.7	53.3
Math	AIME 2024 (Pass@1)	16.0	9.3	39.2	63.6	79.2	79.8
	MATH-500 (Pass@1)	78.3	74.6	90.2	90.0	96.4	97.3
	CNMO 2024 (Pass@1)	13.1	10.8	43.2	67.6	-	78.8
Chinese	CLUEWSC (EM)	85.4	87.9	90.9	89.9	-	92.8
	C-Eval (EM)	76.7	76.0	86.5	68.9	-	91.8
	C-SimpleQA (Correct)	55.4	58.7	68.0	40.3	-	63.7

DeepSeek R1 release https://api-docs.deepseek.com/news/news250120
- Online chat https://chat.deepseek.com/
- Deepseek coder https://huggingface.co/spaces/akhaliq/anychat
- Higgingface https://huggingface.co/deepseek-ai/DeepSeek-R1

DeepSeek Math Capabilities: Precision Meets Power
DeepSeek’s math capabilities are designed to handle a wide range of mathematical tasks, from basic arithmetic to advanced calculus, linear algebra, and statistical analysis. Its AI-driven engine ensures accurate problem-solving, step-by-step explanations, and the ability to tackle complex equations with ease. Whether you're a student, educator, or professional, DeepSeek can simplify math challenges and provide clear, logical solutions.

Honey Bunch · Jan 24, 2025

මාත් ඊයේ ඉන්ස්ටා එකේ දැක්කා.දැන් ඔයාගේ අර්ටිකල් එක බලලා චැට් එකට ගියා සට සට ගාලා උත්තර දෙනවා.ඩේටා සෙෆ්ටිය කොහොමද දන්නේ නෑ මේකේ.චීනේ හදපු එකක් නිසා.ට්‍රම්ප් අත්සන් කරන වෙලාවටම තමා මේක ලෝන්ච් කරා කියන්නේ

hi.dushan · Jan 25, 2025

ඩේටා සෙෆ්ටිය කොහොමද දන්නේ නෑ මේකේ.චීනේ හදපු එකක් නිසා (2)

MihiCherub · Jan 25, 2025

Honey Bunch said:
මාත් ඊයේ ඉන්ස්ටා එකේ දැක්කා.දැන් ඔයාගේ අර්ටිකල් එක බලලා චැට් එකට ගියා සට සට ගාලා උත්තර දෙනවා.ඩේටා සෙෆ්ටිය කොහොමද දන්නේ නෑ මේකේ.චීනේ හදපු එකක් නිසා.ට්‍රම්ප් අත්සන් කරන වෙලාවටම තමා මේක ලෝන්ච් කරා කියන්නේ

ලෝකල් රන් කලා නම් හරි.

Sen-lu · Jan 25, 2025

hi.dushan said:
ඩේටා සෙෆ්ටිය කොහොමද දන්නේ නෑ මේකේ.චීනේ හදපු එකක් නිසා (2)

අපේ මොනා හොරකම් කරන්නද බං

.

Draco Malfoy · Jan 25, 2025

mamath test karala baluwa. patta bn

Al Baik · Jan 25, 2025

https://elakiri.com/threads/open-ai-o3-model-will-be-free-for-all-users.2195998/post-30497741

devops · Jan 31, 2025

Search

Latest ads

Introducing DeepSeek-V3: Your AI Companion

MihiCherub

Well-known member

DeepSeek-R1-Evaluation

Honey Bunch

Well-known member

hi.dushan

Well-known member

MihiCherub

Well-known member

Sen-lu

Well-known member

Draco Malfoy

Well-known member

Al Baik

Well-known member

devops

Well-known member

Similar threads

Introducing DeepSeek-V3: Your AI Companion

Well-known member

​

DeepSeek-R1-Evaluation​

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Similar threads

DeepSeek-R1-Evaluation