Skip to content

Commit

Permalink
Update README.md & DeepSeek_R1.pdf
Browse files Browse the repository at this point in the history
  • Loading branch information
stack-heap-overflow committed Jan 23, 2025
1 parent 89c71e0 commit e06298d
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 2 deletions.
Binary file modified DeepSeek_R1.pdf
Binary file not shown.
6 changes: 4 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,6 +56,8 @@ we introduce DeepSeek-R1, which incorporates cold-start data before RL.
DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks.
To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSeek-R1, and six dense models distilled from DeepSeek-R1 based on Llama and Qwen. DeepSeek-R1-Distill-Qwen-32B outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.

**NOTE: Before running DeepSeek-R1 series models locally, we kindly recommend reviewing the [Usage Recommendation](#usage-recommendations) section.**

<p align="center">
<img width="80%" src="figures/benchmark.jpg">
</p>
Expand Down Expand Up @@ -202,8 +204,8 @@ python3 -m sglang.launch_server --model deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
**We recommend adhering to the following configurations when utilizing the DeepSeek-R1 series models, including benchmarking, to achieve the expected performance:**

1. Set the temperature within the range of 0.5-0.7 (0.6 is recommended) to prevent endless repetitions or incoherent outputs.
2. Avoid adding a system prompt; all instructions should be contained within the user prompt.

This comment has been minimized.

Copy link
@rajuroyal-web

rajuroyal-web Feb 6, 2025

..

3. For mathematical problems, it is advisable to include a directive in your prompt such as: "put your final answer within \boxed{}".
2. **Avoid adding a system prompt; all instructions should be contained within the user prompt.**
3. For mathematical problems, it is advisable to include a directive in your prompt such as: "Please reason step by step, and put your final answer within \boxed{}."
4. When evaluating model performance, it is recommended to conduct multiple tests and average the results.

## 7. License
Expand Down

4 comments on commit e06298d

@a188208
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

怎么安装

@zhu578616673
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PDF这么小

@sinonion-ma
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

下载下来看看

@HOUQIANG107
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

XXXXX下载以下

Please sign in to comment.