[compiler][flow] Move cast, reshape and bitcast after transfer op #18742

sogartar · 2024-10-10T12:11:44Z

We got incoming IR of the form

  %cast = tensor.cast %0 : tensor<2xf32> to tensor<?xf32>
  %2 = flow.tensor.transfer %cast : tensor<?xf32>{%c2} to
    #hal.device.affinity<@__device_0>
  %cast_2 = tensor.cast %2 : tensor<?xf32> to tensor<2xf32>

We would like to allow for the 2 casts to get folded. We would also like to reduce the dynamism of the transfer op. To operate on a tensor with fewer dynamic dimensions.

We got incoming IR of the form ```mlir %cast = tensor.cast %0 : tensor<2xf32> to tensor<?xf32> %2 = flow.tensor.transfer %cast : tensor<?xf32>{%c2} to #hal.device.affinity<@__device_0> %cast_2 = tensor.cast %2 : tensor<?xf32> to tensor<2xf32> ``` We would like to allow for the 2 casts to get folded. We would also like to reduce the dynamism of the transfer op. To operate on a tensor with fewer dynamic dimensions. Signed-off-by: Boian Petkantchin <boian.petkantchin@amd.com>

IanWood1 · 2024-10-10T16:40:59Z

I've been trying to fix a similar problem with tensor.collapse_shape/tensor.expand_shape here #18729

If this is a recurring problem, maybe there is a more systematic way to address this?

sogartar · 2024-10-10T17:27:10Z

@IanWood1 This seems similar. I am not sure what is more appropriate, to fold the reshape or to move it.
We may need some more general approach to constructing the new op with the folded shape. Some ops in the Flow dialect for example change their arguments depending on the dynamic dimensions of the tensor, where we need to provide the dynamic dimensions as arguments as well. Injection of a constructor into the pattern will probably solve it.

benvanik · 2024-10-10T17:31:47Z

Folding is best whenever possible - the reshape/cast ops are only there to carry metadata and should be aggressively removed when the metadata they carry is not useful.

For example,

%0 = flow.tensor.reshape %arg : tensor<1x2xf32> -> tensor<2x1xf32>
%1 = flow.tensor.transfer %0 : tensor<2x1xf32> to "target"
%2 = flow.tensor.reshape %1 : tensor<2x1xf32> -> tensor<1x2xf32>

should never exist - the canonical form would be:

%2 = flow.tensor.transfer %arg : tensor<1x2xf32> to "target"

If propagating the reshapes/casts/etc allows those to fold by having an intermediate state like:

%0 = flow.tensor.transfer %arg : tensor<1x2xf32> to "target"
%1 = flow.tensor.reshape %0 : tensor<1x2xf32> -> tensor<2x1xf32>
%2 = flow.tensor.reshape %1 : tensor<2x1xf32> -> tensor<1x2xf32>

and then the existing reshape-reshape canonicalizer kicks in that's a good thing.

benvanik · 2024-10-15T17:02:07Z

compiler/src/iree/compiler/Utils/Shape.h

+
+// Unranked shapes are always considered to have more dynamic dimensions than
+// ranked.
+inline bool shapeHasLessOrEqualDynamicDimensions(ShapedType t1, ShapedType t2) {


YAGNI
on the 3rd or 4th time a 3 line block of code is used it's worth hoisting into a global shared util - but there's a bar for doing this - every shared util pollutes the project and adds a maintenance burden and has to be worth it. A single use in a single location is not worth it. Just inline this as static where this is used.

(this would also need to be static here - you can't put inline functions in header files)

I moved it.

benvanik · 2024-10-15T17:03:04Z

compiler/src/iree/compiler/Dialect/Flow/IR/FlowOpFolders.cpp

+template <typename Op, unsigned homomorphismOpOperandIndex = 0,
+          unsigned homomorphismOpResultIndex = 0>
+static void populateMoveOpAfterTransferPattern(RewritePatternSet &results) {
+


Suggested change

benvanik · 2024-10-15T17:03:19Z

compiler/src/iree/compiler/Dialect/Flow/IR/FlowOpFolders.cpp

+// before the transfer instead. We strive to reduce the dynamism of the
+// transfer op. If there will be no strict dynamism improvement, we prefer the
+// other op after the transfer.
+// TODO: add the analogous move-befor-transfer pattern.


Suggested change

// TODO: add the analogous move-befor-transfer pattern.

// TODO: add the analogous move-before-transfer pattern.

(a spell checker can help in your IDE)

Done. I have one but it ignored the hyphened word.

benvanik · 2024-10-15T17:04:30Z

compiler/src/iree/compiler/Dialect/Flow/IR/FlowOpFolders.cpp

 } // namespace

 void TensorTransferOp::getCanonicalizationPatterns(RewritePatternSet &results,
                                                   MLIRContext *context) {
  results.insert<ElideRedundantTransfer>(context);
+  populateMoveOpAfterTransferPattern<tensor::BitcastOp>(results);
+  populateMoveOpAfterTransferPattern<TensorBitCastOp>(results);


Suggested change

populateMoveOpAfterTransferPattern<TensorBitCastOp>(results);

populateMoveOpAfterTransferPattern<IREE::Flow::TensorBitCastOp>(results);

use namespaced names where there may be ambiguity to make it clearer to readers

benvanik · 2024-10-15T17:04:39Z

compiler/src/iree/compiler/Dialect/Flow/IR/FlowOpFolders.cpp

+  populateMoveOpAfterTransferPattern<TensorBitCastOp>(results);
+  populateMoveOpAfterTransferPattern<tensor::CastOp>(results);
+  populateMoveOpAfterTransferPattern<tensor::ReshapeOp>(results);
+  populateMoveOpAfterTransferPattern<TensorReshapeOp>(results);


Suggested change

populateMoveOpAfterTransferPattern<TensorReshapeOp>(results);

populateMoveOpAfterTransferPattern<IREE::Flow::TensorReshapeOp>(results);

sogartar requested review from hanhanW, MaheshRavishankar, IanWood1 and benvanik as code owners October 10, 2024 12:11

sogartar requested a review from rsuderman October 10, 2024 12:12

sogartar force-pushed the cast-transfer-reorder branch from f4c155a to d5794d3 Compare October 10, 2024 12:13

sogartar added 2 commits October 10, 2024 07:59

Fix bazel build dependencies

10b08c9

Improve code reuse

8163488

benvanik requested changes Oct 15, 2024

View reviewed changes

Address Ben's PR comments

0d27333

sogartar requested a review from benvanik October 15, 2024 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[compiler][flow] Move cast, reshape and bitcast after transfer op #18742

[compiler][flow] Move cast, reshape and bitcast after transfer op #18742

sogartar commented Oct 10, 2024

IanWood1 commented Oct 10, 2024 •

edited

Loading

sogartar commented Oct 10, 2024

benvanik commented Oct 10, 2024

benvanik Oct 15, 2024

sogartar Oct 15, 2024

benvanik Oct 15, 2024

sogartar Oct 15, 2024

benvanik Oct 15, 2024

sogartar Oct 15, 2024

benvanik Oct 15, 2024

sogartar Oct 15, 2024

benvanik Oct 15, 2024

sogartar Oct 15, 2024

	// TODO: add the analogous move-befor-transfer pattern.
	// TODO: add the analogous move-before-transfer pattern.

	populateMoveOpAfterTransferPattern<TensorBitCastOp>(results);
	populateMoveOpAfterTransferPattern<IREE::Flow::TensorBitCastOp>(results);

[compiler][flow] Move cast, reshape and bitcast after transfer op #18742

Are you sure you want to change the base?

[compiler][flow] Move cast, reshape and bitcast after transfer op #18742

Conversation

sogartar commented Oct 10, 2024

IanWood1 commented Oct 10, 2024 • edited Loading

sogartar commented Oct 10, 2024

benvanik commented Oct 10, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IanWood1 commented Oct 10, 2024 •

edited

Loading