You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
FP16 and deepspeed zero3 both break training in my testing ([rank0]: RuntimeError: weight should have at least three dimensions), this config works#15
Open
Julian2002 wants to merge 1 commit into2U1:master2U1/Llama3.2-Vision-Finetune:masterfrom Julian2002:8-bit_config_patchJulian2002/Llama3.2-Vision-Finetune:8-bit_config_patchCopy head branch name to clipboard