Support DeepSeekV3-style block FP8 quantization #372

mgoin · 2025-06-30T19:32:19Z

Quite a few things packed into one here, but the goal is to support the 128x128 weight and 1x128 input quantization adopted by deepseekv3 and qwen3 models. See examples: https://huggingface.co/deepseek-ai/DeepSeek-V3 and https://huggingface.co/Qwen/Qwen3-0.6B-FP8

Added BLOCK static quantization paths for weight quantization.
Added GROUP dynamic quantization paths for per-token-group input quantization. I feel like this is more understandable than the "1x128" block input quantization deepseek uses.
I’ve updated all of the places where block_structure was previously treated as an “NxM” string so that it now uses a Python list of two integers (e.g. [128, 128]). I added a pydantic validator that can convert this automatically for old checkpoints that use the string.

Here is the scheme I am proposing to support this:

# Block‐wise FP8 (deepseekv3-style quantization):
# static 128x128 per‐block weights and 
# dynamic per‐token‐group activations
FP8_BLOCK = dict(
    weights=QuantizationArgs(
        num_bits=8,
        type=QuantizationType.FLOAT,
        strategy=QuantizationStrategy.BLOCK,
        symmetric=True,
        dynamic=False,
        block_structure=[128, 128],
    ),
    input_activations=QuantizationArgs(
        num_bits=8,
        type=QuantizationType.FLOAT,
        strategy=QuantizationStrategy.GROUP,
        symmetric=True,
        dynamic=True,
        observer=None,
        group_size=128,
    ),
)

Added this model to hugging face: nm-testing/Qwen3-0.6B-FP8-BLOCK

Signed-off-by: mgoin <[email protected]>

src/compressed_tensors/quantization/quant_args.py

src/compressed_tensors/quantization/lifecycle/forward.py

Signed-off-by: mgoin <[email protected]>

dsikka

Can you produce a test model to nm-testing andd it to this PR?

src/compressed_tensors/quantization/lifecycle/forward.py

src/compressed_tensors/compressors/quantized_compressors/nvfp4_quantized.py

src/compressed_tensors/quantization/lifecycle/forward.py

Signed-off-by: shanjiaz <[email protected]>

kylesayrs

Looks good!

brian-dellabetta

awesome work! clear with nice tests

mgoin added 2 commits June 30, 2025 19:26

Support DeepSeekV3-style block FP8 quantization

558870c

Signed-off-by: mgoin <[email protected]>

Remove math

759be7a

Signed-off-by: mgoin <[email protected]>

mgoin marked this pull request as ready for review June 30, 2025 19:39

mgoin mentioned this pull request Jun 30, 2025

Support DeepSeekV3-style block FP8 quantization vllm-project/llm-compressor#1607

Closed

kylesayrs reviewed Jun 30, 2025

View reviewed changes

mgoin mentioned this pull request Jun 30, 2025

Support DeepSeekV3-style block FP8 quantization with CT vllm-project/vllm#20279

Open

This was referenced Jun 30, 2025

Block-wise Quantization Not supported vllm-project/llm-compressor#1475

Closed

block wise quantization support vllm-project/llm-compressor#1497

Closed

mgoin added 4 commits July 1, 2025 00:44

Enforce divisible shapes

010e903

Signed-off-by: mgoin <[email protected]>

Format

7e642f7

Signed-off-by: mgoin <[email protected]>

Remove validation

bc5bea3

Signed-off-by: mgoin <[email protected]>

Fix string

cda798e

Signed-off-by: mgoin <[email protected]>

dsikka reviewed Jul 2, 2025

View reviewed changes

src/compressed_tensors/quantization/lifecycle/forward.py Outdated Show resolved Hide resolved

kylesayrs self-assigned this Jul 8, 2025

kylesayrs reviewed Jul 8, 2025

View reviewed changes

src/compressed_tensors/compressors/quantized_compressors/nvfp4_quantized.py Outdated Show resolved Hide resolved

src/compressed_tensors/quantization/lifecycle/forward.py Outdated Show resolved Hide resolved

kylesayrs assigned shanjiaz and unassigned kylesayrs Jul 9, 2025

shanjiaz added 2 commits July 10, 2025 21:14

fix failed tests

da55b66

Signed-off-by: shanjiaz <[email protected]>

address review comments

c4e04a7

Signed-off-by: shanjiaz <[email protected]>

shanjiaz requested review from kylesayrs and dsikka July 21, 2025 14:05

new line

5e23ba1

Signed-off-by: shanjiaz <[email protected]>

kylesayrs approved these changes Jul 21, 2025

View reviewed changes

brian-dellabetta approved these changes Jul 21, 2025

View reviewed changes

shanjiaz merged commit 09b7ed4 into main Jul 21, 2025
1 check passed

shanjiaz deleted the support-deepseek-block-fp8 branch July 21, 2025 16:44

This was referenced Jul 21, 2025

Support DeepSeekV3-style block FP8 quantization with CT vllm-project/vllm#21337

Open

Refactor dense FP8 tensor/channel/block utils and add CT FP8 block vllm-project/vllm#21404

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support DeepSeekV3-style block FP8 quantization #372

Support DeepSeekV3-style block FP8 quantization #372

Uh oh!

mgoin commented Jun 30, 2025 •

edited by shanjiaz

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dsikka left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kylesayrs left a comment

Uh oh!

brian-dellabetta left a comment

Uh oh!

Uh oh!

Uh oh!

Support DeepSeekV3-style block FP8 quantization #372

Support DeepSeekV3-style block FP8 quantization #372

Uh oh!

Conversation

mgoin commented Jun 30, 2025 • edited by shanjiaz Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kylesayrs left a comment

Choose a reason for hiding this comment

Uh oh!

brian-dellabetta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mgoin commented Jun 30, 2025 •

edited by shanjiaz

Loading