Skip to content

Commit

Permalink
docs:data distillation pipline for distilling high-quality maths reas…
Browse files Browse the repository at this point in the history
…oning data with thought process (Long Cot data)from deepseek R1 (#1532)

Co-authored-by: “yifeng.wang” <“[email protected];q:wqqgit config --global user.name “yifeng.wang”git config --global user.email “[email protected]>
Co-authored-by: Wendong <[email protected]>
Co-authored-by: Wendong-Fan <[email protected]>
  • Loading branch information
4 people authored Feb 3, 2025
1 parent 388b640 commit c572a5c
Show file tree
Hide file tree
Showing 3 changed files with 1,111 additions and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -280,6 +280,7 @@ Practical guides and tutorials for implementing specific functionalities in CAME
| **[CoT Data Generation and SFT Qwen with Unsolth](https://docs.camel-ai.org/cookbooks/data_generation/cot_data_gen_sft_qwen_unsolth_upload_huggingface.html)** | Discover how to generate CoT data using CAMEL and SFT Qwen with Unsolth, and seamlessly upload your data and model to Huggingface. |
| **[Agentic Data Generation, Evaluation & Filtering with Reward Models](https://docs.camel-ai.org/cookbooks/data_generation/synthetic_dataevaluation&filter_with_reward_model.html)** | Discover methods for generating, evaluating, and filtering agentic data using reward models to enhance the quality and efficiency of your synthetic data pipelines. |
| **[Data Model Generation and Structured Output with Qwen Model](https://docs.camel-ai.org/cookbooks/data_generation/data_model_generation_and_structured_output_with_qwen.html)** |Learn how to generate data models and structured outputs using the Qwen Model for improved data representation.|
| **[Distill Math Reasoning Data from DeepSeek R1](https://docs.camel-ai.org/cookbooks/data_generation/distill_math_reasoning_data_from_deepseek_r1.html)** |Learn how to set up and leverage CAMEL's data distillation pipline for distilling high-quality maths reasoning data with thought process (Long CoT data)from deepseek R1, and uploading the results to Hugging Face.|
### Multi-Agent Systems & Applications
Expand Down

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions docs/cookbooks/data_generation/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -19,3 +19,4 @@ Model Training and Fine-tuning
cot_data_gen_sft_qwen_unsolth_upload_huggingface
synthetic_dataevaluation&filter_with_reward_model
data_model_generation_and_structured_output_with_qwen
distill_math_reasoning_data_from_deepseek_r1

0 comments on commit c572a5c

Please sign in to comment.