Skip to content

Error occurred when fine-tuning using LoRA and IA3 methods. #44

@Gerald-li

Description

@Gerald-li

When fine-tuning the ProSST model (tested both 2046 and 4096 versions) and Prime model, we encountered errors while using LoRA and IA3 fine-tuning methods. The error message is approximately:

File "/hpcfs/fhome/puchx/.cache/huggingface/modules/transformers_modules/modeling_prosst.py", line 868, in forward
inputs_embeds.masked_fill_(
RuntimeError: a leaf Variable that requires grad is being used in an in-place operation.

Do you know what causes this error? Other fine-tuning methods (freeze, full, and ses-adapter) work without any issues.
Thanks!!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions