fix: resolve duplicate `-g` flag and `type=bool` argparse bug in sft.py by vominh1919 · Pull Request #1329 · PrimeIntellect-ai/verifiers

vominh1919 · 2026-05-10T13:41:09Z

Problem

Two bugs in scripts/sft.py's argument parser:

Duplicate -g flag: Both --gradient-accumulation-steps (line 69) and --max-grad-norm (line 73) use -g as their short flag. The second definition shadows the first, so python sft.py -g 4 silently sets max_grad_norm=4.0 instead of gradient_accumulation_steps=4.
type=bool doesn't work in argparse: bool("False") is True in Python — any non-empty string is truthy. The only way to disable --push-to-hub via CLI is --push-to-hub "", which is unintuitive.

Fix

Change --gradient-accumulation-steps short flag from -g to -G (uppercase, since --max-grad-norm already owns -g)
Replace type=bool with action=argparse.BooleanOptionalAction, which provides both --push-to-hub and --no-push-to-hub

Before vs After

Scenario	Before	After
`sft.py -g 4`	Sets `max_grad_norm=4.0` (wrong!)	Sets `max_grad_norm=4.0` (correct, `-G` needed for grad accum)
`sft.py -G 4`	Error: unrecognized argument	Sets `gradient_accumulation_steps=4` ✓
`sft.py --no-push-to-hub`	Error: unrecognized argument	Sets `push_to_hub=False` ✓
`sft.py --push-to-hub False`	Sets `push_to_hub=True` (wrong!)	N/A — use `--no-push-to-hub` instead

Tests

Verified syntax with python3 -c "import ast; ast.parse(open('scripts/sft.py').read())"

Note

Low Risk
Low risk: only adjusts CLI argument definitions in scripts/sft.py, changing how flags are parsed but not training logic or data handling.

Overview
Fixes the scripts/sft.py CLI so --gradient-accumulation-steps no longer conflicts with --max-grad-norm by switching its short flag from -g to -G.

Updates --push-to-hub to use argparse.BooleanOptionalAction, enabling --push-to-hub / --no-push-to-hub instead of the broken type=bool parsing.

^{Reviewed by Cursor Bugbot for commit d2ae717. Bugbot is set up for automated code reviews on this repo. Configure here.}

Two bugs in the SFT training script's argument parser: 1. Both --gradient-accumulation-steps and --max-grad-norm use '-g' as their short flag. The second definition shadows the first, so `python sft.py -g 4` silently sets max_grad_norm=4.0 instead of gradient_accumulation_steps=4. 2. `type=bool` doesn't work as expected in argparse — bool('False') is True in Python, so --push-to-hub False still sets the value to True. Use BooleanOptionalAction instead, which provides --no-push-to-hub.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit d2ae717. Configure here.}

cursor · 2026-05-10T13:42:48Z

    parser.add_argument("--weight-decay", "-w", type=float, default=0.01)
    parser.add_argument("--max-grad-norm", "-g", type=float, default=0.1)
-    parser.add_argument("--push-to-hub", "-p", type=bool, default=True)
+    parser.add_argument("--push-to-hub", "-p", action=argparse.BooleanOptionalAction, default=True)


Parsed push_to_hub value never used in SFTConfig

High Severity

The --push-to-hub argparse fix is incomplete. While the CLI now correctly parses --no-push-to-hub via BooleanOptionalAction, the SFTConfig on line 49 hardcodes push_to_hub=True instead of using args.push_to_hub. This means --no-push-to-hub is silently ignored — the model is always pushed to the hub regardless of the flag.

Additional Locations (1)

scripts/sft.py#L48-L49

^{Reviewed by Cursor Bugbot for commit d2ae717. Configure here.}

cursor Bot reviewed May 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: resolve duplicate `-g` flag and `type=bool` argparse bug in sft.py#1329

fix: resolve duplicate `-g` flag and `type=bool` argparse bug in sft.py#1329
vominh1919 wants to merge 1 commit into
PrimeIntellect-ai:mainfrom
vominh1919:fix/sft-cli-duplicate-flags

vominh1919 commented May 10, 2026 •

edited by cursor Bot

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot May 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vominh1919 commented May 10, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Before vs After

Tests

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot May 10, 2026

Choose a reason for hiding this comment

Parsed push_to_hub value never used in SFTConfig

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vominh1919 commented May 10, 2026 •

edited by cursor Bot

Loading

Parsed `push_to_hub` value never used in SFTConfig