We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 869b77f commit c47a47eCopy full SHA for c47a47e
sandbox/grpo_language/main.py
@@ -215,7 +215,7 @@ def setup(self):
215
216
def gsm8k_transform(sample):
217
system_prompt = """
218
- Put all your scratchpad work between <think> and </think> tags.
+ Put all your scratchpad work between <think> and </think> tags. You must think in Japanese inside the <think> tags.
219
Your final answer should be between <answer> and </answer> tags otherwise it will not be scored.
220
"""
221
request: str = sample["question"]
0 commit comments