Test vector fpadd #562

Sameeranjoshi · 2025-07-22T00:35:10Z

No description provided.

1. Use a custom legalizer for bf16(only works for smaller than 32 elements) 2. Use underlying G_FPEXT to first convert bf16 -> f32 then perform add. G_FADD(G_FPEXT(V1), G_FPEXT(V2)) Convert back to original types and shapes if needed. 3. Can see fully vectorized code which wasn't seen before.

Break <64xbf16> into 2 chunks of <32xbf16>.

Pending check: Not sure about how to verify pad and unpad logic, seems it's unrolling into a lot of boilerplate code.

Less than Vectors of f32 = 16, 32 gets converted to 64xf32 as those are legal. Vectors of bf16 = 32xbf16 is a custom case converts other sizes into this corresponding vector.

…d code. This patch is dependent on Xilinx#548 and Xilinx#557. Previously bf16 and f32 failed to generate fully vectorized code and used to scalarize, this test makes sures different types and vector sizes work and are fully vectorized. This is a supplementary patch for verifying below pipeline: Part 1: `vector.multi_reduction` to `vector.reduction` to `llvm.vector.reduce.fadd.*` nod-ai/iree-amd-aie#1336 Part 2: Further lowers to AIE2P instructions.(This patch)

Implement support in legalizer for Float32 types.

Sameeranjoshi · 2025-08-07T21:59:42Z

Squashed into #604

Sameeranjoshi added 13 commits July 11, 2025 09:28

fpext, customFor way

b0add75

Instruction selection method

a097383

G_FPEXT passes legalizer for bf16->f32 vector types.

19925db

CustomIf condition checks

c584c63

More tests and non-powers of 2, less < 256 bits

67942ea

tests

0c980a0

Clang-format, fix failing tests

7c3257c

[AIE2p] Handle failing <64xbf16> test case.

019b723

Break <64xbf16> into 2 chunks of <32xbf16>.

Clang format

5d78388

[AIE2P] Add test to validate vector of bf16 with 16,32,64 shape.

b966462

Pending check: Not sure about how to verify pad and unpad logic, seems it's unrolling into a lot of boilerplate code.

[AIE2P] Fix bf16 and f32 crash

543ca14

Less than Vectors of f32 = 16, 32 gets converted to 64xf32 as those are legal. Vectors of bf16 = 32xbf16 is a custom case converts other sizes into this corresponding vector.

Sameeranjoshi requested review from F-Stuckmann, SagarMaheshwari99, abhinay-anubola, abnikant, andcarminati, katerynamuts, khallouh, konstantinschwarz, martien-de-jong, niwinanto and stephenneuendorffer as code owners July 22, 2025 00:35

Sameeranjoshi mentioned this pull request Jul 22, 2025

Direct codegen vectorized lowering of reduction operation nod-ai/iree-amd-aie#1306

Closed

Sameeranjoshi changed the title ~~Sam test vector fpadd~~ Test vector fpadd Aug 3, 2025

Sameeranjoshi changed the title ~~Test vector fpadd~~ Test vector fpadd Aug 3, 2025

[AIE2P] Fix wrong modes for VSHUFFLE.

6e4bb40

Implement support in legalizer for Float32 types.

Sameeranjoshi closed this Aug 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test vector fpadd #562

Test vector fpadd #562

Uh oh!

Sameeranjoshi commented Jul 22, 2025

Uh oh!

Sameeranjoshi commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Test vector fpadd #562

Test vector fpadd #562

Uh oh!

Conversation

Sameeranjoshi commented Jul 22, 2025

Uh oh!

Sameeranjoshi commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant