Skip to content

'None' values in the dataset preprocessing when reproducing the result on GSM8k #3

@kulinshah98

Description

@kulinshah98

Hi,

Thanks for open sourcing the code! I am trying to reproduce the results on Math (GSM8k) benchmark given in Table 1. I am using the yaml file given at the location "LLaMA-Factory/examples/train_full/gpt2_full_ddm-sft.yaml" and given instruction to run the code. However, I am facing the issue of having "None" values in the input data. To be specific, the batch['input_ids'] in the "LLaMA-Factory/src/llamafactory/train/ddm/workflow.py" seems to have None values between prompt and response (after question and before answer of the GSM8k). Because of this None value, I am getting the error "RuntimeError: Could not infer dtype of NoneType" during the start of the training. Can you help me fix this issue? Thank you in advance!

Best,
Kulin

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions