-
Notifications
You must be signed in to change notification settings - Fork 386
move six worfklows to prototype #3491
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Stack from ghstack (oldest at bottom): |
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3491
Note: Links to docs will display an error until the docs builds have been completed. ⏳ No Failures, 1 PendingAs of commit 00a23fa with merge base ff6d9e2 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Summary: Moves the following workflows to prototype: * Int8DynamicActivationInt4WeightConfig * Int4DynamicActivationInt4WeightConfig * GemliteUIntXWeightOnlyConfig * Float8StaticActivationFloat8WeightConfig * UIntXWeightOnlyConfig * FPXWeightOnlyConfig Specifically: 1. move the deprecated workflows to `torchao/prototype/quantization/quant_api.py` 2. move utils to `torchao/quantization/utils.py` as needed 3. make some imports happen at runtime to avoid circular imports 4. change the warnings in the six workflows to reflect their new state For now, keep BC by importing them from the old location. So, we get the benefits of the simpler `quant_api.py` file with this PR, and we will defer breaking BC until a future time to amortize the eng cost. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 6e22be1 ghstack-comment-id: 3657392357 Pull-Request: #3491
Summary: Moves the following workflows to prototype: * Int8DynamicActivationInt4WeightConfig * Int4DynamicActivationInt4WeightConfig * GemliteUIntXWeightOnlyConfig * Float8StaticActivationFloat8WeightConfig * UIntXWeightOnlyConfig * FPXWeightOnlyConfig Specifically: 1. move the deprecated workflows to `torchao/prototype/quantization/quant_api.py` 2. move utils to `torchao/quantization/utils.py` as needed 3. make some imports happen at runtime to avoid circular imports 4. change the warnings in the six workflows to reflect their new state For now, keep BC by importing them from the old location. So, we get the benefits of the simpler `quant_api.py` file with this PR, and we will defer breaking BC until a future time to amortize the eng cost. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: e06f561 ghstack-comment-id: 3657392357 Pull-Request: #3491
| Our framework makes it straightforward to add tensor parallel support to your custom quantized tensor subclass. Check out our [tensor parallel tutorial](tutorials/developer_api_guide/tensor_parallel.py) to see how a quantized tensor subclass can be extended to support column and row-wise tensor sharding while maintaining compatibility with `torch.compile`. | ||
| ### Custom Kernels |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is no longer relevant
Summary: Moves the following workflows to prototype: * Int8DynamicActivationInt4WeightConfig * Int4DynamicActivationInt4WeightConfig * GemliteUIntXWeightOnlyConfig * Float8StaticActivationFloat8WeightConfig * UIntXWeightOnlyConfig * FPXWeightOnlyConfig Specifically: 1. move the deprecated workflows to `torchao/prototype/quantization/quant_api.py` 2. move utils to `torchao/quantization/utils.py` as needed 3. make some imports happen at runtime to avoid circular imports 4. change the warnings in the six workflows to reflect their new state For now, keep BC by importing them from the old location. So, we get the benefits of the simpler `quant_api.py` file with this PR, and we will defer breaking BC until a future time to amortize the eng cost. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: c43b02e ghstack-comment-id: 3657392357 Pull-Request: #3491
Summary: Moves the following workflows to prototype: * Int8DynamicActivationInt4WeightConfig * Int4DynamicActivationInt4WeightConfig * GemliteUIntXWeightOnlyConfig * Float8StaticActivationFloat8WeightConfig * UIntXWeightOnlyConfig * FPXWeightOnlyConfig Specifically: 1. move the deprecated workflows to `torchao/prototype/quantization/quant_api.py` 2. move utils to `torchao/quantization/utils.py` as needed 3. make some imports happen at runtime to avoid circular imports 4. change the warnings in the six workflows to reflect their new state For now, keep BC by importing them from the old location. So, we get the benefits of the simpler `quant_api.py` file with this PR, and we will defer breaking BC until a future time to amortize the eng cost. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 30d8a0d ghstack-comment-id: 3657392357 Pull-Request: #3491
Summary: Moves the following workflows to prototype: * Int8DynamicActivationInt4WeightConfig * Int4DynamicActivationInt4WeightConfig * GemliteUIntXWeightOnlyConfig * Float8StaticActivationFloat8WeightConfig * UIntXWeightOnlyConfig * FPXWeightOnlyConfig Specifically: 1. move the deprecated workflows to `torchao/prototype/quantization/quant_api.py` 2. move utils to `torchao/quantization/utils.py` as needed 3. make some imports happen at runtime to avoid circular imports 4. change the warnings in the six workflows to reflect their new state For now, keep BC by importing them from the old location. So, we get the benefits of the simpler `quant_api.py` file with this PR, and we will defer breaking BC until a future time to amortize the eng cost. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 2ecf4d8 ghstack-comment-id: 3657392357 Pull-Request: #3491
Summary: Moves the following workflows to prototype: * Int8DynamicActivationInt4WeightConfig * Int4DynamicActivationInt4WeightConfig * GemliteUIntXWeightOnlyConfig * Float8StaticActivationFloat8WeightConfig * UIntXWeightOnlyConfig * FPXWeightOnlyConfig Specifically: 1. move the deprecated workflows to `torchao/prototype/quantization/quant_api.py` 2. move utils to `torchao/quantization/utils.py` as needed 3. make some imports happen at runtime to avoid circular imports 4. change the warnings in the six workflows to reflect their new state For now, keep BC by importing them from the old location. So, we get the benefits of the simpler `quant_api.py` file with this PR, and we will defer breaking BC until a future time to amortize the eng cost. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 4b7b474 ghstack-comment-id: 3657392357 Pull-Request: #3491
Summary: Moves the following workflows to prototype: * Int8DynamicActivationInt4WeightConfig * Int4DynamicActivationInt4WeightConfig * GemliteUIntXWeightOnlyConfig * Float8StaticActivationFloat8WeightConfig * UIntXWeightOnlyConfig * FPXWeightOnlyConfig Specifically: 1. move the deprecated workflows to `torchao/prototype/quantization/quant_api.py` 2. move utils to `torchao/quantization/utils.py` as needed 3. make some imports happen at runtime to avoid circular imports 4. change the warnings in the six workflows to reflect their new state For now, keep BC by importing them from the old location. So, we get the benefits of the simpler `quant_api.py` file with this PR, and we will defer breaking BC until a future time to amortize the eng cost. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 454af42 ghstack-comment-id: 3657392357 Pull-Request: #3491
Summary:
Moves the following workflows to prototype:
Specifically:
torchao/prototype/quantization/quant_api.pytorchao/quantization/utils.pyas neededFor now, keep BC by importing them from the old location. So, we get the
benefits of the simpler
quant_api.pyfile with this PR, and we will defer breakingBC until a future time to amortize the eng cost.
Test Plan:
CI
Reviewers:
Subscribers:
Tasks:
Tags: