Skip to content

Conversation

@byungilm
Copy link
Contributor

@byungilm byungilm commented Nov 21, 2025

Description of the issue

  • iGPU execution showed worse performance than CPU
    benchmark_app -m irsmodel.onnx -d GPU -ip f16 Result is 160 FPS
    benchmark_app -m irsmodel.onnx -d CPU -ip f16 Result is 250 FPS
  • Root cause is that deconvolution with simple format execution showed worse performance.
  • cpu : image
  • igpu :
    image

The code and line that caused this issue

  • Convolution layers are only counted while selecting blocked formats even though Deconvolution shows lower performance with simple format :
    program.cpp:1622

Reproduction step and snapshot

  • Reproduced by benchmark
    .\benchmark_app.exe -m models\irsmodel\irsmodel.onnx -d GPU.0 -t 10 -b 1

Checklist

  • Is it a proper fix?
  • Did you include test case for this fix, if necessary?
  • Did you review existing test that can be extended to cover this scenario? Passed llm_bench

Tickets:

@byungilm byungilm requested review from a team as code owners November 21, 2025 05:33
@github-actions github-actions bot added the category: GPU OpenVINO GPU plugin label Nov 21, 2025
@byungilm byungilm changed the title [TEMP] Apply blocked format only for deconv Apply blocked format for deconv Nov 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: GPU OpenVINO GPU plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant