You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Okay so mostly temperature is at 0.0-0.2, with 1.0-1.2 repetition penalty since DRY have low support
Further hints on how for math and coding, keeping it lower than general reasoning recommendations (less than 0.6) are still important https://www.reddit.com/r/LocalLLaMA/comments/1ias719/comment/m9cknjb/
But at the table there is no mention of repetition penalty only presence_penalty and frequency_penaltyhttps://docs.litellm.ai/docs/completion/input
P.S. Does anyone want to support DRY and other methods of preventing repetition? It's like asking for Min P instead of Top P
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Alternate low-temp guidelines for R1 (other than just the recommendation of 0.5-0.7 for temperature):
Some who wants to follow high temp will sometimes recommend mixing in 1.0 repetition penalty (along with the usual mixing in Top P and/or Top K with default values) https://community.appsmith.com/content/guide/building-chat-app-deepseek-r1-and-togetherai-under-5-minutes
Okay so mostly temperature is at 0.0-0.2, with 1.0-1.2 repetition penalty since DRY have low support
Further hints on how for math and coding, keeping it lower than general reasoning recommendations (less than 0.6) are still important https://www.reddit.com/r/LocalLLaMA/comments/1ias719/comment/m9cknjb/
But at the table there is no mention of
repetition penalty
onlypresence_penalty
andfrequency_penalty
https://docs.litellm.ai/docs/completion/inputP.S. Does anyone want to support DRY and other methods of preventing repetition? It's like asking for Min P instead of Top P
Beta Was this translation helpful? Give feedback.
All reactions