[mlir][amdgpu] Support for 8bit extf for 0d vector type #126102

pashu123 · 2025-02-06T18:00:26Z

For 0d vector type the rewrite crashes.

llvmbot · 2025-02-06T18:00:58Z

@llvm/pr-subscribers-mlir
@llvm/pr-subscribers-mlir-gpu

@llvm/pr-subscribers-backend-amdgpu

Author: Prashant Kumar (pashu123)

Changes

For 0d vector type the rewrite crashes.

Full diff: https://github.com/llvm/llvm-project/pull/126102.diff

2 Files Affected:

(modified) mlir/lib/Conversion/ArithToAMDGPU/ArithToAMDGPU.cpp (+6-3)
(modified) mlir/test/Conversion/ArithToAMDGPU/8-bit-floats.mlir (+13-1)

diff --git a/mlir/lib/Conversion/ArithToAMDGPU/ArithToAMDGPU.cpp b/mlir/lib/Conversion/ArithToAMDGPU/ArithToAMDGPU.cpp
index 33370566996eee5..60a002c41bfb2f3 100644
--- a/mlir/lib/Conversion/ArithToAMDGPU/ArithToAMDGPU.cpp
+++ b/mlir/lib/Conversion/ArithToAMDGPU/ArithToAMDGPU.cpp
@@ -102,20 +102,23 @@ void ExtFOnFloat8RewritePattern::rewrite(arith::ExtFOp op,
     return rewriter.replaceOp(op, result);
   }
   int64_t numElements = inType.getNumElements();
+
   Value zero = rewriter.create<arith::ConstantOp>(
       loc, outElemType, rewriter.getFloatAttr(outElemType, 0.0));
+  VectorType outType = cast<VectorType>(op.getOut().getType());
+
   if (inType.getShape().empty()) {
+    Value zerodSplat =
+        rewriter.createOrFold<vector::SplatOp>(loc, outType, zero);
     Value scalarIn =
         rewriter.create<vector::ExtractOp>(loc, in, ArrayRef<int64_t>{});
-    // Recurse to send the 0-D vector case to the 1-D vector case
     Value scalarExt =
         rewriter.create<arith::ExtFOp>(loc, outElemType, scalarIn);
-    Value result = rewriter.create<vector::InsertOp>(loc, scalarExt, zero,
+    Value result = rewriter.create<vector::InsertOp>(loc, scalarExt, zerodSplat,
                                                      ArrayRef<int64_t>{});
     return rewriter.replaceOp(op, result);
   }
 
-  VectorType outType = cast<VectorType>(op.getOut().getType());
   VectorType flatTy = VectorType::get(SmallVector<int64_t>{numElements},
                                       outType.getElementType());
   Value result = rewriter.createOrFold<vector::SplatOp>(loc, flatTy, zero);
diff --git a/mlir/test/Conversion/ArithToAMDGPU/8-bit-floats.mlir b/mlir/test/Conversion/ArithToAMDGPU/8-bit-floats.mlir
index bd90facb6154408..985fb532ea74ad3 100644
--- a/mlir/test/Conversion/ArithToAMDGPU/8-bit-floats.mlir
+++ b/mlir/test/Conversion/ArithToAMDGPU/8-bit-floats.mlir
@@ -10,7 +10,19 @@ func.func @scalar_ext(%v: f8E5M2FNUZ) -> f16 {
   return %w : f16
 }
 
-// No 0-D test because arith.extf hasn't been extended to support it.
+// -----
+
+// CHECK-LABEL: func.func @vector_zero_d(
+// CHECK-SAME:   %[[ARG0:[a-zA-Z0-9_]+]]: vector<f8E5M2FNUZ>) -> vector<f32>
+// CHECK: %[[CONST:.+]] = arith.constant dense<0.000000e+00> : vector<f32>
+// CHECK: %[[EXTRACT:.+]] = vector.extract %[[ARG0]][] : f8E5M2FNUZ from vector<f8E5M2FNUZ>
+// CHECK: %[[CONVERT:.+]] = amdgpu.ext_packed_fp8 %[[EXTRACT]][0] : f8E5M2FNUZ to f32
+// CHECK: %[[RESULT:.+]] = vector.insert %[[CONVERT]], %[[CONST]] [] : f32 into vector<f32>
+// CHECK: return %[[RESULT]] : vector<f32>
+func.func @vector_zero_d(%v: vector<f8E5M2FNUZ>) -> vector<f32> {
+  %w = arith.extf %v : vector<f8E5M2FNUZ> to vector<f32>
+  return %w : vector<f32>
+}
 
 // -----

krzysz00

Approved, thanks. Can you get truncf in a separate PR?

For 0d vector type the rewrite crashes.

[mlir][amdgpu] Support for 8bit extf for 0d vector type

6da4acd

For 0d vector type the rewrite crashes.

pashu123 requested a review from krzysz00 February 6, 2025 18:00

llvmbot added backend:AMDGPU mlir:gpu mlir labels Feb 6, 2025

pashu123 mentioned this pull request Feb 6, 2025

[ROCm][Codegen] llama 8b fp8 with attention segfault iree-org/iree#19921

Closed

krzysz00 approved these changes Feb 6, 2025

View reviewed changes

pashu123 merged commit 97b08b8 into llvm:main Feb 6, 2025
12 checks passed

Icohedron pushed a commit to Icohedron/llvm-project that referenced this pull request Feb 11, 2025

[mlir][amdgpu] Support for 8bit extf for 0d vector type (llvm#126102)

991397c

For 0d vector type the rewrite crashes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mlir][amdgpu] Support for 8bit extf for 0d vector type #126102

[mlir][amdgpu] Support for 8bit extf for 0d vector type #126102

pashu123 commented Feb 6, 2025

llvmbot commented Feb 6, 2025 •

edited

Loading

krzysz00 left a comment

[mlir][amdgpu] Support for 8bit extf for 0d vector type #126102

[mlir][amdgpu] Support for 8bit extf for 0d vector type #126102

Conversation

pashu123 commented Feb 6, 2025

llvmbot commented Feb 6, 2025 • edited Loading

krzysz00 left a comment

Choose a reason for hiding this comment

llvmbot commented Feb 6, 2025 •

edited

Loading