Legalizer support for vectors in G_FPEXT #548

Sameeranjoshi · 2025-07-16T23:38:23Z

G_FPEXT crashed for vector types when converting <32xbfloat> -> <32xfloat> as vectors were not handled in legalizer. This patch adds support for vectors with a custom legalizer rule using following pattern

   VDst = G_FPEXT VSrc

converts to something like below

   ZeroVec = G_AIE_BROADCAST_VECTOR VSrc
   VShuffleLow = G_AIE_SHUFFLE_VECTOR ZeroVec, VSrc, 2
   VShuffleHigh = G_AIE_SHUFFLE_VECTOR ZeroVec, VSrc, 3
   VShuffleLow = G_BITCAST VShuffleLow
   VShuffleHigh = G_BITCAST VShuffleHigh
   VDst = G_CONCAT_VECTORS VShuffleLow, VShuffleHigh

Patch adds tests to make sure all the tests with `vector.multi_reduction` generate successfully pass Peano legalizer and generate efficient vectorized code. This patch checks only the IREE side to keep the dependency minimun on Peano. (Depends on Peano: 1. Xilinx/llvm-aie#548 2. Xilinx/llvm-aie#557 ) 1. `reassociateFpReductions=true` is must else code is scalarized. This flag could be added into the IREE vectorization pipeline to trigger automatically. 2. bf16/i32/f32 all types with different sizes work now.

…d code. This patch is dependent on Xilinx#548 and Xilinx#557. Previously bf16 and f32 failed to generate fully vectorized code and used to scalarize, this test makes sures different types and vector sizes work and are fully vectorized. This is a supplementary patch for verifying below pipeline: Part 1: `vector.multi_reduction` to `vector.reduction` to `llvm.vector.reduce.fadd.*` nod-ai/iree-amd-aie#1336 Part 2: Further lowers to AIE2P instructions.(This patch)

Sameeranjoshi · 2025-08-07T22:00:19Z

Squashed into #604

[AutoBump] Merge with fixes of 977d744 (Jan 20) (10) (May need downstream changes)

Sameeranjoshi added 7 commits July 11, 2025 09:28

fpext, customFor way

b0add75

Instruction selection method

a097383

G_FPEXT passes legalizer for bf16->f32 vector types.

19925db

CustomIf condition checks

c584c63

More tests and non-powers of 2, less < 256 bits

67942ea

tests

0c980a0

Clang-format, fix failing tests

7c3257c

Sameeranjoshi requested review from F-Stuckmann, SagarMaheshwari99, abhinay-anubola, abnikant, andcarminati, katerynamuts, khallouh, konstantinschwarz, martien-de-jong, niwinanto and stephenneuendorffer as code owners July 16, 2025 23:38

This was referenced Jul 16, 2025

Direct codegen vectorized lowering of reduction operation nod-ai/iree-amd-aie#1306

Closed

Handle Vector types in G_FADD using G_FPEXT #557

Closed

Sameeranjoshi mentioned this pull request Jul 21, 2025

[Reduction] Verify different sizes and types work for reduction. nod-ai/iree-amd-aie#1336

Open

Sameeranjoshi closed this Aug 7, 2025

mgehre-amd pushed a commit that referenced this pull request Aug 21, 2025

Merge pull request #548 from Xilinx/bump_to_977d744b

815db3c

[AutoBump] Merge with fixes of 977d744 (Jan 20) (10) (May need downstream changes)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Legalizer support for vectors in G_FPEXT #548

Legalizer support for vectors in G_FPEXT #548

Uh oh!

Sameeranjoshi commented Jul 16, 2025 •

edited

Loading

Uh oh!

Sameeranjoshi commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Legalizer support for vectors in G_FPEXT #548

Legalizer support for vectors in G_FPEXT #548

Uh oh!

Conversation

Sameeranjoshi commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sameeranjoshi commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Sameeranjoshi commented Jul 16, 2025 •

edited

Loading