SIMD/Loop framework upgrade #2937

AlexandreEichenberger · 2024-09-10T18:27:30Z

Added support for handling SIMD for loops that have loop iterations that are not multiple of the Vector Length.

Because we can now generate SIMD code using either Krnl, Affine, or SCF, it was painful to have multiple ways to generate loops. I have now a unified interface that create loops across all 3 dialects:

  void forLoopIE(IndexExpr lb, IndexExpr ub, int64_t step, bool useParallel,
      mlir::function_ref<void(SCFBuilder &, mlir::ValueRange)> bodyFn) const;

which takes a lower/upper bound as IndexExpr, a boolean to define if the loop is sequential or parallel, and the function to be called.

An example is shown here

    // Invocation of the (possibly parallel) SIMD loop.
     if constexpr (std::is_same<BUILDER, KrnlBuilder>::value ||
                   std::is_same<BUILDER, AffineBuilder>::value ||
                   std::is_same<BUILDER, SCFBuilder>::value)
       builder.forLoopIE(lb, simdUb, VL, useParallel, simdLoopBody);
    else
      llvm_unreachable("BUILDER type not supported\n");

This complement the 3 SIMD calls: simdIterateIE, simdReduceIE, and simdReduce2DIE. The last 2 calls both perform reductions, but the first one uses horizontal/do-across reductions (e.g. available on z16 with integer add) and the second one use shuffle to mix VL consecutive reductions.
All simd calls now work with arbitrary numbers of loop iterations (whether a multiple of the hardware vector length or not).

To better provide the same functionality to both reduce simd calls, I expect now one lambda function per output (before one lambda function to generate all outputs).

We also had different calls for memory load/store. Now a common interface is used for Krnl, Affine, and MemRef, and nearly identical for Vector (where the load operation needs the type to determine the VL).

They all use the calls below

  mlir::Value load(mlir::Value memref, mlir::ValueRange indices = {},
      mlir::ValueRange offsets = {}) const;
  mlir::Value loadIE(mlir::Value memref, mlir::ArrayRef<IndexExpr> indices = {},
      mlir::ValueRange offsets = {}) const;
  void store(mlir::Value val, mlir::Value memref, mlir::ValueRange indices = {},
      mlir::ValueRange offsets = {}) const;
  void storeIE(mlir::Value val, mlir::Value memref,
      mlir::ArrayRef<IndexExpr> indices, mlir::ValueRange offsets = {}) const;

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

tungld

LGTM. Really appreciate your effort of simplifying these interfaces!

jenkins-droid · 2024-09-18T12:04:06Z

Jenkins Linux amd64 Build #15654 [push] SIMD/Loop framework upgr... started at 07:04

jenkins-droid · 2024-09-18T12:04:07Z

Jenkins Linux s390x Build #15657 [push] SIMD/Loop framework upgr... started at 08:04

jenkins-droid · 2024-09-18T12:05:07Z

Jenkins Linux ppc64le Build #14685 [push] SIMD/Loop framework upgr... started at 08:16

jenkins-droid · 2024-09-18T13:12:49Z

Jenkins Linux amd64 Build #15654 [push] SIMD/Loop framework upgr... passed after 1 hr 8 min

jenkins-droid · 2024-09-18T13:29:23Z

Jenkins Linux s390x Build #15657 [push] SIMD/Loop framework upgr... passed after 1 hr 25 min

jenkins-droid · 2024-09-18T14:09:03Z

Jenkins Linux ppc64le Build #14685 [push] SIMD/Loop framework upgr... passed after 2 hr 3 min

AlexandreEichenberger added 23 commits September 3, 2024 19:24

transformed simd iterate to a list of function

02d4cc0

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

works with collections of function

5d579bd

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

cleanup

3338ec9

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

cleanup

6888081

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

initial new version for 2D reduce

23c4ea6

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

working but not expanded opportunities

940cba5

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

cleanup

2d65922

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

working without extending to non fullSimd

e380d5d

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

update

f2456dc

update

a51656b

comments

396e9bd

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

moved fn defs to top header file

1ca7b33

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

update

135b115

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

common for / parallel interface

8db1952

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

working

ae5d6d2

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

upgrade of tests

0c12c11

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

update

b50bd6a

format

3d376cf

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

added comments

f89c8e7

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

remove {} that are not necessary in load/store anymore

766d84a

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

update

03b2543

flipped debug mode back to normal

9d7be77

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

format

8f3eb75

Signed-off-by: Alexandre Eichenberger <alexe@us.ibm.com>

AlexandreEichenberger requested a review from tungld September 11, 2024 14:27

Merge branch 'main' into simd-framwork-v1

9cdfc55

AlexandreEichenberger requested a review from chentong319 September 11, 2024 14:53

AlexandreEichenberger added 3 commits September 13, 2024 14:29

Merge branch 'main' into simd-framwork-v1

3fbd9fe

Merge branch 'main' into simd-framwork-v1

7add7d9

Merge branch 'main' into simd-framwork-v1

4836f22

tungld approved these changes Sep 18, 2024

View reviewed changes

AlexandreEichenberger merged commit 9dd7c4a into onnx:main Sep 18, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SIMD/Loop framework upgrade #2937

SIMD/Loop framework upgrade #2937

AlexandreEichenberger commented Sep 10, 2024 •

edited

Loading

tungld left a comment

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

SIMD/Loop framework upgrade #2937

SIMD/Loop framework upgrade #2937

Conversation

AlexandreEichenberger commented Sep 10, 2024 • edited Loading

tungld left a comment

Choose a reason for hiding this comment

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

jenkins-droid commented Sep 18, 2024

AlexandreEichenberger commented Sep 10, 2024 •

edited

Loading