Skip to content

Commit d4c3fa6

Browse files
committed
Remove the operand promotion for LHS and RHS.
Operand promotion for unaligned matmul cases is leading to dynamic trip count and forall loop fusion is not taking place by iree-codegen-gpu-fuse-and-hoist-parallel-loops.
1 parent 54b9e29 commit d4c3fa6

File tree

1 file changed

+0
-2
lines changed

1 file changed

+0
-2
lines changed

compiler/src/iree/compiler/Codegen/LLVMGPU/KernelConfig.cpp

-2
Original file line numberDiff line numberDiff line change
@@ -1346,8 +1346,6 @@ static LogicalResult setContractConfig(IREE::GPU::TargetAttr target,
13461346
attrs.emplace_back(b.getStringAttr("reduction"),
13471347
b.getI64ArrayAttr(reductionTileSizes));
13481348

1349-
// Promote operands to use shared memory for LHS and RHS.
1350-
IREE::GPU::setPromotedOperandList(context, attrs, {0, 1});
13511349
auto configDict = b.getDictionaryAttr(attrs);
13521350
auto loweringConfig =
13531351
IREE::GPU::LoweringConfigAttr::get(context, configDict);

0 commit comments

Comments
 (0)