Fix model copying for QDQ stripping #784

mklimenk · 2025-08-21T12:47:19Z

Description

This PR fixes the model copying portion required by QDQ stripping for GPU and bfloat16->float16 conversions. The copying was broken by the converting the initializers to OrtValues in the upstream repo.

To make the changes in a single place, a PR #768 with a small fix was embedded in this PR.

https://jira.devtools.intel.com/browse/CVS-171536

mklimenk · 2025-08-21T12:47:47Z

@ankitm3k @sfatimar could you please review it?

ankitm3k

LGTM

Copilot

Pull Request Overview

This PR fixes model copying functionality required for QDQ (Quantize/Dequantize) stripping when processing GPU models with bfloat16->float16 conversions and 16-bit integer quantization. The changes update the model copying mechanism to handle OrtValue initializers and refine the conditions for QDQ optimization.

Updated model copying logic to use OrtValue-based initializer handling instead of TensorProto copying
Added support for INT16/UINT16 data types in type checking and moved the logic from experimental GPU-only to general support
Enhanced QDQ graph detection to specifically identify graphs with 16-bit quantization for targeted GPU optimization

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
qdq_scales_fix.cpp	Simplified model copying to use OrtValue initializers and removed redundant tensor proto copying logic
data_ops.cc	Moved INT16/UINT16 type support from experimental GPU-only section to general type support
backend_manager.cc	Added detection for QDQ graphs with 16-bit quantization and refined GPU optimization conditions

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

onnxruntime/core/providers/openvino/backend_manager.cc

onnxruntime/core/providers/openvino/qdq_transformations/qdq_scales_fix.cpp

MayureshV1

LGTM !
Implication of this PR is limited to GPU and only to INT16, UINT16 and BF16.
Waiting on confirmation from validation of GPU customer models before we merge.

mklimenk added 2 commits August 21, 2025 14:28

Reintroduce intel#768 with a small fix

005fcd5

Fix model copying with help from microsoft#25761

77e310c

Merge branch 'ovep-develop' into private/mklimenk/fix_model_copying

0a70568

This was referenced Aug 21, 2025

[Draft] Fix bfloat16->float16 pass #786

Closed

[Draft] Fix bfloat16->float16 pass #787

Draft

ankitm3k requested a review from Copilot August 21, 2025 14:46

This comment was marked as outdated.

Sign in to view

ankitm3k requested a review from Copilot August 21, 2025 16:21

ankitm3k reviewed Aug 21, 2025

View reviewed changes

Copilot AI reviewed Aug 21, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backend_manager.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/openvino/backend_manager.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/openvino/qdq_transformations/qdq_scales_fix.cpp Outdated Show resolved Hide resolved

ankitm3k requested a review from MayureshV1 August 22, 2025 06:28

Remove unused debug variables

481b4c8

ankitm3k approved these changes Aug 22, 2025

View reviewed changes

MayureshV1 approved these changes Aug 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix model copying for QDQ stripping #784

Fix model copying for QDQ stripping #784

mklimenk commented Aug 21, 2025

Uh oh!

mklimenk commented Aug 21, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

ankitm3k left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MayureshV1 left a comment

Uh oh!

Uh oh!

Fix model copying for QDQ stripping #784

Are you sure you want to change the base?

Fix model copying for QDQ stripping #784

Conversation

mklimenk commented Aug 21, 2025

Description

Uh oh!

mklimenk commented Aug 21, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

ankitm3k left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MayureshV1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!