-
Notifications
You must be signed in to change notification settings - Fork 526
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support ascend w8a8 graph_mode #3267
Conversation
yao-fengchen
commented
Mar 17, 2025
•
edited
Loading
edited
- support dlinfer smooth_quant
- suppprt ascend w8a8 graph_mode
- In order to share the per_channel_quant kernel with cuda, move the kernel from cuda folder to default folder
- In order to find the dlinfer folder when load_func, modify FunctionDispatcher
- fix dynamo error in QTensor
5120be9
to
e823c48
Compare
e823c48
to
5ffb3b3
Compare
2137f28
to
dcb3a1e
Compare
dcb3a1e
to
4c9ead8
Compare
lmdeploy/lite/apis/smooth_quant.py
Outdated
@@ -7,9 +7,12 @@ | |||
import torch | |||
from torch import nn | |||
|
|||
import lmdeploy.pytorch.devices.device_manager as device_manager |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is not a good idea to import pytorch module in lite.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I modified this in the subsequent commit.
lmdeploy/serve/vl_async_engine.py
Outdated
@@ -6,7 +6,7 @@ | |||
import PIL | |||
|
|||
from lmdeploy.messages import PytorchEngineConfig, TurbomindEngineConfig, VisionConfig | |||
from lmdeploy.pytorch.check_env import try_import_deeplink | |||
from lmdeploy.pytorch.check_env import check_env_deeplink |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do not import pytorch in serve
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
removed