Commit fafe981
committed
Speed up nvfp4 pack/unpack w/ torch.compile (vllm-project#400)
* Speed up nvfp4 pack/unpack w/ torch.compile
Signed-off-by: Fynn Schmitt-Ulms <[email protected]>
* Add `dynamic=True` to torch.compile call in nvfp4 packing
Signed-off-by: Fynn Schmitt-Ulms <[email protected]>
---------
Signed-off-by: Fynn Schmitt-Ulms <[email protected]>1 parent abddd38 commit fafe981
File tree
1 file changed
+5
-4
lines changed- src/compressed_tensors/compressors/quantized_compressors
1 file changed
+5
-4
lines changedLines changed: 5 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
123 | 123 | | |
124 | 124 | | |
125 | 125 | | |
| 126 | + | |
126 | 127 | | |
127 | 128 | | |
128 | 129 | | |
| |||
145 | 146 | | |
146 | 147 | | |
147 | 148 | | |
148 | | - | |
149 | | - | |
150 | | - | |
| 149 | + | |
| 150 | + | |
151 | 151 | | |
152 | 152 | | |
153 | | - | |
| 153 | + | |
154 | 154 | | |
155 | 155 | | |
156 | 156 | | |
| |||
174 | 174 | | |
175 | 175 | | |
176 | 176 | | |
| 177 | + | |
177 | 178 | | |
178 | 179 | | |
179 | 180 | | |
| |||
0 commit comments