Skip to content

KhushalM/GRPO_Finetuning

Repository files navigation

GRPO_Finetuning

Finetuning a base LLM with custom GRPO trainer to answer questions in RIchard Feymann style based on first principles.

About

Finetuning a base LLM with custom GRPO trainer to answer questions in RIchard Feymann style based on first principles.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors