fix: Explicitly load checkpoint to CPU to avoid CUDA error #219

gau-nernst · 2025-03-03T03:54:38Z

The checkpoint was serialized directly from CUDA tensors. Hence, by default, torch.load() will attempt to reload the weights to CUDA, even if CUDA is not available e.g. CPU-only machines.

Without this fix, on my macbook, I'm getting this error

RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. If you are running on a CPU-only machine, please use torch.load with map_location=torch.device('cpu') to map your storages to the CPU.

This PR fixes it.

tuanlda78202

lgtm!

fix: load model on CPU

f555041

gau-nernst requested a review from tuanlda78202 March 3, 2025 03:54

tuanlda78202 approved these changes Mar 3, 2025

View reviewed changes

gau-nernst merged commit 02e2f79 into main Mar 3, 2025

gau-nernst deleted the fix/load_model_on_cpu branch March 3, 2025 04:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Explicitly load checkpoint to CPU to avoid CUDA error #219

fix: Explicitly load checkpoint to CPU to avoid CUDA error #219

gau-nernst commented Mar 3, 2025

tuanlda78202 left a comment

fix: Explicitly load checkpoint to CPU to avoid CUDA error #219

fix: Explicitly load checkpoint to CPU to avoid CUDA error #219

Conversation

gau-nernst commented Mar 3, 2025

tuanlda78202 left a comment

Choose a reason for hiding this comment