Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

jerryzh168 · 2025-06-22T04:28:23Z

Stacked PRs:

Add support for Int4GroupwisePreshuffleTensor for fbgemm

Summary:
Note: slice is not working yet, others are working

Test Plan:
python test/dtypes/test_int4_groupwise_preshuffle.py

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2025-06-22T04:28:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2421

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 851dc01 with merge base 5a50667 ():

NEW FAILURE - The following job has failed:

Run TorchAO Experimental Tests / test-mps-ops (macos-m1-stable) (gh)
Process completed with exit code 127.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Note: slice is not working yet, others are working Test Plan: python test/dtypes/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2421, branch: jerryzh168/stack/1

torchao/dtypes/int4_groupwise_preshuffle_tensor.py

vkuzo · 2025-06-25T13:47:51Z

torchao/dtypes/int4_groupwise_preshuffle_tensor.py

+if importlib.util.find_spec("fbgemm_gpu") is None:
+    quantize_int4_preshuffle = None
+else:
+    from fbgemm_gpu.experimental.gen_ai.quantize import quantize_int4_preshuffle


is this a prototype API? If yes, should the torchao version also be prototype? what does "experimental" mean in the folder structure here?

it is stable and production ready, and used in production. it's just bad naming according to @jwfromm, and they have a plan to get rid of it

torchao/dtypes/int4_groupwise_preshuffle_tensor.py

vkuzo · 2025-07-02T12:02:55Z

torchao/quantization/quantize_/int4_groupwise_preshuffle_tensor.py

+        shape: shape of the original Tensor
+
+    Note:
+       preshuffle means the weight is rearranged for more efficient use of loading instructions


it would be good to share the specifics of the preshuffle transformation, either here or via a link a user can follow

vkuzo · 2025-07-02T12:04:29Z

torchao/quantization/quantize_/__init__.py

+    Int4GroupwisePreshuffleTensor,
+)
+
+Int4GroupwisePreshuffleTensor.__module__ = "torchao.quantization"


can we confirm (by actually testing it) that we can change the directory location later without breaking BC?

I have a test that verifies the loaded weight have module path torchao.quantization.Int4GroupwisePreshuffleTensor, this (type(tensor))is used in the load code path: https://github.com/pytorch/pytorch/blob/d4b8857e51a089b7e0e722689398c5c3ada274c9/torch/_tensor.py#L262 which gives us good confidence that it would work as long as we do this

but I can do a e2e test a bit later by uploading the file in huggingface hub and change the path locally to verify as well

added in #2437

vkuzo

thanks for making the changes!

Summary: Note: slice is not working yet, others are working Test Plan: python test/dtypes/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2421, branch: jerryzh168/stack/1

jerryzh168 added a commit that referenced this pull request Jun 22, 2025

Summary:

65a1373

Note: slice is not working yet, others are working Test Plan: python test/dtypes/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2421, branch: jerryzh168/stack/1

jerryzh168 force-pushed the jerryzh168/stack/1 branch from 565e596 to 65a1373 Compare June 22, 2025 04:28

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 22, 2025

jerryzh168 changed the title ~~Summary:~~ Add Int4GroupwisePreshuffleTensor for fbgemm Jun 22, 2025

jerryzh168 added the topic: new feature Use this tag if this PR adds a new feature label Jun 22, 2025

jerryzh168 changed the title ~~Add Int4GroupwisePreshuffleTensor for fbgemm~~ Summary: Jun 22, 2025

jerryzh168 mentioned this pull request Jun 22, 2025

Remove transpose_input from fbgemm configs #2422

Merged

jerryzh168 force-pushed the jerryzh168/stack/1 branch from 65a1373 to 8dcecf4 Compare June 22, 2025 04:35

jerryzh168 changed the title ~~Summary:~~ Add support for Int4GroupwisePreshuffleTensor for fbgemm Jun 22, 2025

jerryzh168 force-pushed the jerryzh168/stack/1 branch 3 times, most recently from 44b79dd to 6ce4c7b Compare June 24, 2025 22:25

jerryzh168 mentioned this pull request Jun 24, 2025

Add support for float8 activation for Int4GroupwisePreshuffleTensor #2437

Open

jerryzh168 force-pushed the jerryzh168/stack/1 branch from 6ce4c7b to 027648f Compare June 24, 2025 22:28

vkuzo reviewed Jun 25, 2025

View reviewed changes

torchao/dtypes/int4_groupwise_preshuffle_tensor.py Outdated Show resolved Hide resolved

vkuzo reviewed Jun 25, 2025

View reviewed changes

torchao/dtypes/int4_groupwise_preshuffle_tensor.py Outdated Show resolved Hide resolved

jerryzh168 force-pushed the jerryzh168/stack/1 branch 6 times, most recently from 69a215e to e664a1e Compare June 27, 2025 20:09

This was referenced Jun 30, 2025

Add Float8Tensor #2463

Open

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Open

vkuzo reviewed Jul 2, 2025

View reviewed changes

vkuzo approved these changes Jul 2, 2025

View reviewed changes

Add support for Int4GroupwisePreshuffleTensor for fbgemm

851dc01

Summary: Note: slice is not working yet, others are working Test Plan: python test/dtypes/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2421, branch: jerryzh168/stack/1

jerryzh168 force-pushed the jerryzh168/stack/1 branch from e664a1e to 851dc01 Compare July 2, 2025 20:36

jerryzh168 mentioned this pull request Jul 2, 2025

Rename torchao.float8.Float8Tensor to torchao.float8.Float8TrainingTensor #2479

Open

jerryzh168 merged commit 2d61be8 into main Jul 3, 2025
18 of 19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

Uh oh!

jerryzh168 commented Jun 22, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jun 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

vkuzo Jun 25, 2025

Uh oh!

jerryzh168 Jun 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

vkuzo Jul 2, 2025

Uh oh!

vkuzo Jul 2, 2025

Uh oh!

jerryzh168 Jul 2, 2025

Uh oh!

jerryzh168 Jul 2, 2025

Uh oh!

vkuzo left a comment

Uh oh!

Uh oh!

Uh oh!

Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

Uh oh!

Conversation

jerryzh168 commented Jun 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!