Skip to content

[WIP] Enable Ascend NPU Backend with Custom Ops Integration for NF4 Support #1695

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

SlightwindSec
Copy link

What does this PR do?

This PR ports Ascend NPU backend changes from the multi-backend-refactor branch and integrates with custom ops. It includes changes to enable Ascend build and translation of kernels and ops to Ascend-compatible operators. As the AscendC-based high-performance NF4 implementation is still in progress, a temporary PyTorch version is used for now. The build steps remain the same as before from the user's standpoint.

Collaborators

@ji-huazhong @Ginray @Runningwater23
cc @Titus-von-Koeller @matthewdouglas @amathews-amd @sunway513

@SlightwindSec SlightwindSec changed the title Enable Ascend NPU Backend with Custom Ops Integration for NF4 Support [WIP] Enable Ascend NPU Backend with Custom Ops Integration for NF4 Support Jul 4, 2025
@matthewdouglas matthewdouglas self-requested a review July 8, 2025 16:41
@matthewdouglas matthewdouglas self-assigned this Jul 8, 2025
@matthewdouglas matthewdouglas added the Ascend NPU Related to Ascend NPU backend label Jul 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Ascend NPU Related to Ascend NPU backend
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants