Are there any recipes or repositories for performing post-training on the DeepSeek-R1 model, not based on the distilled models? #475

JiayiFu · 2025-02-21T07:58:12Z

Hi everyone,
I want to run some SFT experiments on the 671B DeepSeek-R1 model. However, I couldn’t find any recipes in this repository or on Hugging Face.
Does anyone know if there are any recipes or repositories for performing post-training on the DeepSeek-R1 model?
Thx!

alan008 · 2025-02-21T21:36:16Z

https://github.com/open-thoughts/open-thoughts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are there any recipes or repositories for performing post-training on the DeepSeek-R1 model, not based on the distilled models? #475

Are there any recipes or repositories for performing post-training on the DeepSeek-R1 model, not based on the distilled models? #475

JiayiFu commented Feb 21, 2025

alan008 commented Feb 21, 2025

Are there any recipes or repositories for performing post-training on the DeepSeek-R1 model, not based on the distilled models? #475

Are there any recipes or repositories for performing post-training on the DeepSeek-R1 model, not based on the distilled models? #475

Comments

JiayiFu commented Feb 21, 2025

alan008 commented Feb 21, 2025