-
Notifications
You must be signed in to change notification settings - Fork 995
Open
Description
| if self.training and (self.config.halt_max_steps > 1): |
trm model in training loop use q_head to introduce halting so it will not reach
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels