[torchao safetensors] integrate torchao safetensors support with transformers #40735

liangel-02 · 2025-09-05T21:11:08Z

Context

Currently, we need to use safe_serialization=False while saving models as shown here. This PR enables safetensors support for torchao so that users can now save and load checkpoints using safetensors. Currently, only Float8Tensor is supported, but allowing other subclasses should involve minimal code changes.

# default forsafe_serialization is True
quantized_model.push_to_hub(save_to)

Summary

Changes to transformers code includes:

In TorchAoHfQuantizer, we provide get_state_dict and transform_state_dict that flattens/unflattens a model state dict with tensor subclasses by calling functionality built out in this PR.
In modeling_utils.py, we make appropriate changes to support propagating the metadata from tensor subclasses. We also add logic similar to hqq and bnb to directly load onto cpu rather than meta.

Test Plan

Modified unit test to allow safe serialization. Run using python tests/quantization/torchao_integration/test_torchao.py

github-actions · 2025-09-05T21:12:10Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: torchao_integration

jerryzh168 · 2025-09-05T21:47:08Z

src/transformers/modeling_utils.py

@@ -727,6 +729,7 @@ def _load_state_dict_into_meta_model(
    keep_in_fp32_regex: Optional[re.Pattern] = None,
    unexpected_keys: Optional[list[str]] = None,  # passing `unexpected` for cleanup from quantization items
    device_mesh: Optional["torch.distributed.device_mesh.DeviceMesh"] = None,
+    metadata: Optional[dict] = None,


let's use metadata_dict here for clarity

jerryzh168 · 2025-09-05T21:48:19Z

src/transformers/modeling_utils.py

+            if isinstance(state_dict, tuple):
+                state_dict, metadata = state_dict


this might be a bit confusing I feel, I think we should use a different API to get tensor_data_dict and metadata_dict

jerryzh168 · 2025-09-05T21:49:17Z

src/transformers/modeling_utils.py

@@ -4286,7 +4301,8 @@ def save_pretrained(
            if safe_serialization:
                # At some point we will need to deal better with save_function (used for TPU and other distributed
                # joyfulness), but for now this enough.
-                safe_save_file(shard, os.path.join(save_directory, shard_file), metadata={"format": "pt"})
+                metadata["format"] = "pt"


this should be the default value in L4022?

jerryzh168 · 2025-09-05T21:50:13Z

src/transformers/quantizers/quantizer_torchao.py

@@ -279,6 +286,9 @@ def create_quantized_param(

            quantize_(module, self.quantization_config.get_apply_tensor_subclass())

+    def transform_state_dict(self, tensor_data, metadata):


nit: I think we can make this a bit more clear, e.g. transform_state_dict_before_saving

jerryzh168 · 2025-09-05T21:50:59Z

src/transformers/quantizers/quantizer_torchao.py

@@ -297,10 +307,13 @@ def _process_model_after_weight_loading(self, model, **kwargs):

    def is_serializable(self, safe_serialization=None) -> bool:
        if safe_serialization:
+            from torchao.quantization import Float8WeightOnlyConfig


also Float8DynamicActivationFloat8Config?

jerryzh168 · 2025-09-05T21:52:10Z

tests/quantization/torchao_integration/test_torchao.py


    def setUp(self):
-        self.quant_config = TorchAoConfig(self.quant_scheme, **self.quant_scheme_kwargs)
-        dtype = torch.bfloat16 if self.quant_scheme == "int4_weight_only" else "auto"
+        from torchao.quantization import Float8WeightOnlyConfig


maybe add a new test instead of overriding the old one?

jerryzh168 · 2025-09-05T21:53:31Z

tests/quantization/torchao_integration/test_torchao.py

        with tempfile.TemporaryDirectory() as tmpdirname:
-            self.quantized_model.save_pretrained(tmpdirname, safe_serialization=False)
+            self.quantized_model.save_pretrained(tmpdirname, safe_serialization=True)


we can add a new test for Float8WeightOnlyConfig and Float8DynamicActivationFloat8WeightConfig I think, and revert all the changes to previous tests

enable torchao safetensors

9643eaf

liangel-02 force-pushed the torchao_safetensors branch from 0fe58b2 to d60acfe Compare September 5, 2025 21:20

enable torchao safetensors support

392a504

liangel-02 force-pushed the torchao_safetensors branch from d60acfe to 392a504 Compare September 5, 2025 21:20

liangel-02 marked this pull request as draft September 5, 2025 21:40

jerryzh168 reviewed Sep 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[torchao safetensors] integrate torchao safetensors support with transformers #40735

[torchao safetensors] integrate torchao safetensors support with transformers #40735

liangel-02 commented Sep 5, 2025

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

jerryzh168 Sep 5, 2025

Uh oh!

jerryzh168 Sep 5, 2025 •

edited

Loading

Uh oh!

jerryzh168 Sep 5, 2025

Uh oh!

jerryzh168 Sep 5, 2025

Uh oh!

jerryzh168 Sep 5, 2025

Uh oh!

jerryzh168 Sep 5, 2025

Uh oh!

jerryzh168 Sep 5, 2025

Uh oh!

Uh oh!

		if isinstance(state_dict, tuple):
		state_dict, metadata = state_dict

		@@ -279,6 +286,9 @@ def create_quantized_param(

		quantize_(module, self.quantization_config.get_apply_tensor_subclass())

		def transform_state_dict(self, tensor_data, metadata):

[torchao safetensors] integrate torchao safetensors support with transformers #40735

Are you sure you want to change the base?

[torchao safetensors] integrate torchao safetensors support with transformers #40735

Conversation

liangel-02 commented Sep 5, 2025

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

jerryzh168 Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jerryzh168 Sep 5, 2025 •

edited

Loading