[AIE2] Combiners for 8x8->8x8 and 8x4->4x8 matrix transposes #76

ValentijnvdBeek · 2024-06-15T12:34:49Z

This merge requests adds the same framework for Shuffle Vector combinations used in #41 to the AIE2 backend. It also defines a new generator that generates the pattern used for a decent for subset of the shuffle vector modes. Concretely it matches mode 35 (4x4 -> 4x4 matrix transpose) and 29 (8x4->4x8 matrix transpose). Finally, it adds a generic opcode for the AIE vshuffle instruction.

konstantinschwarz

The shuffle modes description is difficult to read at first. Each element type and matrix dimension uses its own shuffle mode.

llvm/test/CodeGen/AIE/aie2/GlobalISel/prelegalizercombiner-shufflevector.mir

konstantinschwarz · 2024-06-21T22:43:17Z

llvm/test/CodeGen/AIE/aie2/GlobalISel/prelegalizercombiner-shufflevector.mir

+    %1:_(<64 x s8>) = COPY $x0
+    %2:_(<64 x s8>) = COPY $x1
+    %0:_(<64 x s8>) = G_SHUFFLE_VECTOR %1:_(<64 x s8>), %2:_, shufflemask(0, 16, 32, 48, 1, 17, 33, 49, 2, 18, 34, 50, 3, 19, 35, 51, 4, 20, 36, 52, 5, 21, 37, 53, 6, 22, 38, 54, 7, 23, 39, 55, 8, 24, 40, 56, 9, 25, 41, 57, 10, 26, 42, 58, 11, 27, 43, 59, 12, 28, 44, 60, 13, 29, 45, 61, 14, 30, 46, 62, 15, 31, 47, 63)


Mode 35 operates on a 8-bit element 8x8 matrix.
That would need to match shufflemask(0, 8, 16, 24, 1, 9, 17, 25, ...)?

Yes, it does but vshuffle takes 1024 bytes of input. It ignores the higher bits of the input. 35 operates on a 8x8 8-bit element matrix, 64x8 vector, which is 512-bits. In the first case, those two are split into two 4x8 8-bit vectors and the second case is the "common" case where we just ignore the higher order bits.

What you propose would be a 4x8 8-bit match which is a 32x8 vector or 256-bit.

ValentijnvdBeek · 2024-06-24T16:21:12Z

The shuffle modes description is difficult to read at first. Each element type and matrix dimension uses its own shuffle mode.

Yeah, it is a strange set of the ISA. At the moment I am orientating myself by looking at the image descriptions since from the perspective of G_SHUFFLEVECTOR I don't really care about what the original input was. I'll double check if I got that right, but being a bit more stringent on the size requirements seems prudent to me.

We check for iterative shift masks which corresponds to the CONCAT_VECTOR instruction.

…et size

…hunks of a vector

…rs together

… of two vectors together

…ffle

ValentijnvdBeek · 2024-08-15T13:40:53Z

llvm/test/CodeGen/AIE/aie2/GlobalISel/prelegalizercombiner-shufflevector.mir

---
-name:            concat_vector_reverse_32_512_random
-legalized:       false
-body:             |
-  bb.1.entry:
-    liveins: $wl2, $wl4
-    ; CHECK-LABEL: name: concat_vector_reverse_32_512_random
-    ; CHECK: liveins: $wl2, $wl4
-    ; CHECK-NEXT: {{  $}}
-    ; CHECK-NEXT: [[COPY:%[0-9]+]]:_(<8 x s32>) = COPY $wl2
-    ; CHECK-NEXT: [[COPY1:%[0-9]+]]:_(<8 x s32>) = COPY $wl4
-    ; CHECK-NEXT: [[CONCAT_VECTORS:%[0-9]+]]:_(<16 x s32>) = G_CONCAT_VECTORS [[COPY1]](<8 x s32>), [[COPY]](<8 x s32>)


These got moved somehow, I will fix it when i get to it

ValentijnvdBeek added llvm:globalisel Code that modifies the Global Intruction Selection vectorization Support for vector instructions llvm:instcombine Code that modifies the combiner backend:aie2 labels Jun 15, 2024

ValentijnvdBeek self-assigned this Jun 15, 2024

ValentijnvdBeek requested review from abhinay-anubola, abnikant, andcarminati, gbossu, khallouh, konstantinschwarz, martien-de-jong, SagarMaheshwari99 and stephenneuendorffer as code owners June 15, 2024 12:34

konstantinschwarz requested changes Jun 21, 2024

View reviewed changes

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch from 0e664aa to f872cf4 Compare June 24, 2024 16:16

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch from f872cf4 to 73a92c2 Compare June 25, 2024 09:36

ValentijnvdBeek changed the title ~~[AIE2] Combiners for 4x4->4x4 and 8x4->4x8 matrix transposes~~ [AIE2] Combiners for 8x8->8x8 and 8x4->4x8 matrix transposes Jun 25, 2024

ValentijnvdBeek force-pushed the vvandebe.vshuffle.impl branch from be3751a to 5654047 Compare June 25, 2024 16:02

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch from 73a92c2 to e160d1c Compare June 26, 2024 11:39

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch from e160d1c to 07244e7 Compare July 15, 2024 15:06

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch from 07244e7 to 4836f6a Compare August 1, 2024 16:46

ValentijnvdBeek force-pushed the vvandebe.vshuffle.impl branch from 5654047 to 84f3995 Compare August 2, 2024 11:22

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch 2 times, most recently from b41f4e1 to ea44d18 Compare August 7, 2024 18:27

ValentijnvdBeek force-pushed the vvandebe.vshuffle.impl branch from 84f3995 to 5c3b1a6 Compare August 7, 2024 18:35

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch 2 times, most recently from 4c022df to b49d34c Compare August 12, 2024 10:38

[AIE2] Enable G_CONCAT_VECTOR optimizations for AIE2

2ae63a8

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch 2 times, most recently from f855c29 to 4d6af83 Compare August 13, 2024 14:46

ValentijnvdBeek added 6 commits August 15, 2024 09:44

[GISel][CombinerHelper] Add a generator that counts from 0 to n

a3ae452

[GISel][CombinerHelper] Use a stream to check for G_CONCAT_VECTOR

df516f9

We check for iterative shift masks which corresponds to the CONCAT_VECTOR instruction.

[GISel][CombinerHelper] Add a helper that unmerges a vector to a targ…

b8652cc

…et size

[GISel][CombinerHelper] Add two patterns that extract the first two c…

6cb3e09

…hunks of a vector

[GISel][CombinerHelper] Add a function that chains a list of generato…

40e0c76

…rs together

[GISel][CombinerHelper] Add a combiner to concatenate the first halfs…

13cc82a

… of two vectors together

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch from 4d6af83 to 13cc82a Compare August 15, 2024 08:52

ValentijnvdBeek added 2 commits August 15, 2024 10:02

[AIE2] AIE2 custom shuffle vector mask support

6aa86b5

[AIE2][Combiner] Add generator for alternating sequences

302563b

ValentijnvdBeek force-pushed the vvandebe.vshuffle.impl branch 2 times, most recently from e05b018 to aec1600 Compare August 15, 2024 13:21

ValentijnvdBeek added 3 commits August 15, 2024 14:25

[AIE2] Helper function for creating VSHUFFLE instructions

4cf13dd

[AIE2] Replace 8x8->8x8 & 8x4 ->4x8 tranpose shuffle vector with vshu…

4c349de

…ffle

[AIE2] Implement vshuffle instruction selection

d1d0a3a

ValentijnvdBeek force-pushed the vvandebe.vshuffle.impl branch from aec1600 to d1d0a3a Compare August 15, 2024 13:35

ValentijnvdBeek commented Aug 15, 2024

View reviewed changes

ValentijnvdBeek force-pushed the vvandebe.shufflevector.pattern.optimization branch 2 times, most recently from c38562a to ebe6489 Compare September 23, 2024 22:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AIE2] Combiners for 8x8->8x8 and 8x4->4x8 matrix transposes #76

[AIE2] Combiners for 8x8->8x8 and 8x4->4x8 matrix transposes #76

ValentijnvdBeek commented Jun 15, 2024

konstantinschwarz left a comment

konstantinschwarz Jun 21, 2024

ValentijnvdBeek Jul 16, 2024

ValentijnvdBeek commented Jun 24, 2024

ValentijnvdBeek Aug 15, 2024

[AIE2] Combiners for 8x8->8x8 and 8x4->4x8 matrix transposes #76

Are you sure you want to change the base?

[AIE2] Combiners for 8x8->8x8 and 8x4->4x8 matrix transposes #76

Conversation

ValentijnvdBeek commented Jun 15, 2024

konstantinschwarz left a comment

Choose a reason for hiding this comment

konstantinschwarz Jun 21, 2024

Choose a reason for hiding this comment

ValentijnvdBeek Jul 16, 2024

Choose a reason for hiding this comment

ValentijnvdBeek commented Jun 24, 2024

ValentijnvdBeek Aug 15, 2024

Choose a reason for hiding this comment