You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for open sourcing the code! I am trying to reproduce the results on Math (GSM8k) benchmark given in Table 1. I am using the yaml file given at the location "LLaMA-Factory/examples/train_full/gpt2_full_ddm-sft.yaml" and given instruction to run the code. However, I am facing the issue of having "None" values in the input data. To be specific, the batch['input_ids'] in the "LLaMA-Factory/src/llamafactory/train/ddm/workflow.py" seems to have None values between prompt and response (after question and before answer of the GSM8k). Because of this None value, I am getting the error "RuntimeError: Could not infer dtype of NoneType" during the start of the training. Can you help me fix this issue? Thank you in advance!