Fine-tune a DeepSeek distilled variant with a reasoning dataset #492

timo-kurtz · 2025-02-24T00:35:49Z

I want to fine-tune a distilled variant with a reasoning dataset. My question is whether I should generate two responses (one for the reasoning and one for the actual answer separately) or combine both the reasoning and the final answer into a single response. Do you have any other suggestions?

deep_seek_prompt = """ <｜User｜>{}<｜end▁of▁sentence｜> <｜Assistant｜> {} <｜end▁of▁sentence｜> <｜Assistant｜>{}"""

or

deep_seek_prompt = """ <｜User｜>{}<｜end▁of▁sentence｜> <｜Assistant｜> {} {}"""

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-tune a DeepSeek distilled variant with a reasoning dataset #492

Fine-tune a DeepSeek distilled variant with a reasoning dataset #492

timo-kurtz commented Feb 24, 2025

Fine-tune a DeepSeek distilled variant with a reasoning dataset #492

Fine-tune a DeepSeek distilled variant with a reasoning dataset #492

Comments

timo-kurtz commented Feb 24, 2025