-
Notifications
You must be signed in to change notification settings - Fork 24.8k
Labels
module: onnxRelated to torch.onnxRelated to torch.onnxtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Milestone
Description
🚀 The feature, motivation and pitch
Hi, when using enabled_gqa
with scaled_dot_product_attention
, the ONNX export fails - this is documented.
However, since QGA is very popular currently, and the Attention ONNX op already supports it, I was wondering if there is any plan to add support for it in the exporter, and if so, how soon, thanks.
Alternatives
No response
Additional context
No response
Metadata
Metadata
Assignees
Labels
module: onnxRelated to torch.onnxRelated to torch.onnxtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module