Models deepseek-aiDeepSeek-V3.1-Terminus

deepseek-ai /

DeepSeek-V3.1-Terminus

94.5 DZD in 332.5 DZD out 45.5 DZD cached/ 1M tokens

DeepSeek-V3.1 Terminus is an update to DeepSeek V3.1 that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's performance in coding and search agents. It is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes. It extends the DeepSeek-V3 base with a two-phase long-context training process. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows.

Publicfp4163,840JSONFunctionProject

Architecturefp4

Context Window163k

Model

#Total Params

#Activated Params

Context Length

Download

DeepSeek-V3.1-Base

671B

37B

128K

HuggingFace | ModelScope

DeepSeek-V3.1

671B

37B

128K

HuggingFace | ModelScope

## Tools You have access to the following tools: ### {tool_name1} Description: {description} Parameters: {json.dumps(parameters)} IMPORTANT: ALWAYS adhere to this exact format for tool use: <｜tool_calls_begin｜><｜tool_call_begin｜>tool_call_name<｜tool_sep｜>tool_call_arguments<｜tool_call_end｜>[{additional_tool_calls}]<｜tool_calls_end｜> Where: - `tool_call_name` must be an exact match to one of the available tools - `tool_call_arguments` must be valid JSON that strictly follows the tool's Parameters Schema - For multiple tool calls, chain them directly without separators or spaces

Category

Benchmark (Metric)

DeepSeek V3.1-NonThinking

DeepSeek V3 0324

DeepSeek V3.1-Thinking

DeepSeek R1 0528

General

MMLU-Redux (EM)

91.8

90.5

93.7

93.4

General

MMLU-Pro (EM)

83.7

81.2

84.8

85.0

General

GPQA-Diamond (Pass@1)

74.9

68.4

80.1

81.0

General

Humanity's Last Exam (Pass@1)

15.9

17.7

Search Agent

BrowseComp

30.0

8.9

Search Agent

BrowseComp_zh

49.2

35.7

Search Agent

Humanity's Last Exam (Python + Search)

29.8

24.8

Search Agent

SimpleQA

93.4

92.3

Code

LiveCodeBench (2408-2505) (Pass@1)

56.4

43.0

74.8

73.3

Code

Codeforces-Div1 (Rating)

2091

1930

Code

Aider-Polyglot (Acc.)

68.4

55.1

76.3

71.6

Code Agent

SWE Verified (Agent mode)

66.0

45.4

44.6

Code Agent

SWE-bench Multilingual (Agent mode)

54.5

29.3

30.5

Code Agent

Terminal-bench (Terminus 1 framework)

31.3

13.3

5.7

Math

AIME 2024 (Pass@1)

66.3

59.4

93.1

91.4

Math

AIME 2025 (Pass@1)

49.8

51.3

88.4

87.5

Math

HMMT 2025 (Pass@1)

33.5

29.2

84.2

79.4

import transformers tokenizer = transformers.AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-V3.1") messages = [ {"role": "system", "content": "You are a helpful assistant"}, {"role": "user", "content": "Who are you?"}, {"role": "assistant", "content": "<think>Hmm</think>I am DeepSeek"}, {"role": "user", "content": "1+1=?"} ] tokenizer.apply_chat_template(messages, tokenize=False, thinking=True, add_generation_prompt=True) # '<｜begin_of_sentence｜>You are a helpful assistant<｜User｜>Who are you?<｜Assistant｜><think>Hmm</think>I am DeepSeek<｜end_of_sentence｜><｜User｜>1+1=?<｜Assistant｜><think>' tokenizer.apply_chat_template(messages, tokenize=False, thinking=False, add_generation_prompt=True) # '<｜begin_of_sentence｜>You are a helpful assistant<｜User｜>Who are you?<｜Assistant｜></think>I am DeepSeek<｜end_of_sentence｜><｜User｜>1+1=?<｜Assistant｜></think>'

Model Library

DeepSeek-V3.1-Terminus

DeepSeek-V3.1 Technical Details

Introduction

Model Downloads

Chat Template

Non-Thinking

Thinking

ToolCall

Evaluation

Usage Example