[AMDGPU] Use the SchedModel available in SIInstrInfo #110859

jmmartinez · 2024-10-02T15:05:23Z

Instead of allocating an initializing a new instance in GCNHazardRecognizer and AMDGPUInsertDelayAlu.

Instead of allocating an initializing a new instance in GCNHazardRecognizer and AMDGPUInsertDelayAlu.

llvmbot · 2024-10-02T15:05:44Z

@llvm/pr-subscribers-backend-amdgpu

Author: Juan Manuel Martinez Caamaño (jmmartinez)

Changes

Instead of allocating an initializing a new instance in GCNHazardRecognizer and AMDGPUInsertDelayAlu.

Full diff: https://github.com/llvm/llvm-project/pull/110859.diff

3 Files Affected:

(modified) llvm/lib/Target/AMDGPU/AMDGPUInsertDelayAlu.cpp (+3-4)
(modified) llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp (+2-2)
(modified) llvm/lib/Target/AMDGPU/GCNHazardRecognizer.h (+1-1)

diff --git a/llvm/lib/Target/AMDGPU/AMDGPUInsertDelayAlu.cpp b/llvm/lib/Target/AMDGPU/AMDGPUInsertDelayAlu.cpp
index 7619a39bac9c14..3f2bb5df8836bb 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUInsertDelayAlu.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPUInsertDelayAlu.cpp
@@ -30,7 +30,7 @@ class AMDGPUInsertDelayAlu : public MachineFunctionPass {
   const SIInstrInfo *SII;
   const TargetRegisterInfo *TRI;
 
-  TargetSchedModel SchedModel;
+  const TargetSchedModel *SchedModel;
 
   AMDGPUInsertDelayAlu() : MachineFunctionPass(ID) {}
 
@@ -387,7 +387,7 @@ class AMDGPUInsertDelayAlu : public MachineFunctionPass {
       if (Type != OTHER) {
         // TODO: Scan implicit defs too?
         for (const auto &Op : MI.defs()) {
-          unsigned Latency = SchedModel.computeOperandLatency(
+          unsigned Latency = SchedModel->computeOperandLatency(
               &MI, Op.getOperandNo(), nullptr, 0);
           for (MCRegUnit Unit : TRI->regunits(Op.getReg()))
             State[Unit] = DelayInfo(Type, Latency);
@@ -429,8 +429,7 @@ class AMDGPUInsertDelayAlu : public MachineFunctionPass {
 
     SII = ST.getInstrInfo();
     TRI = ST.getRegisterInfo();
-
-    SchedModel.init(&ST);
+    SchedModel = &SII->getSchedModel();
 
     // Calculate the delay state for each basic block, iterating until we reach
     // a fixed point.
diff --git a/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp b/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp
index cc39fd1740683f..44afccb0690d0d 100644
--- a/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp
+++ b/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp
@@ -59,10 +59,10 @@ static bool shouldRunLdsBranchVmemWARHazardFixup(const MachineFunction &MF,
 GCNHazardRecognizer::GCNHazardRecognizer(const MachineFunction &MF)
     : IsHazardRecognizerMode(false), CurrCycleInstr(nullptr), MF(MF),
       ST(MF.getSubtarget<GCNSubtarget>()), TII(*ST.getInstrInfo()),
-      TRI(TII.getRegisterInfo()), UseVALUReadHazardExhaustiveSearch(false),
+      TRI(TII.getRegisterInfo()), TSchedModel(TII.getSchedModel()),
+      UseVALUReadHazardExhaustiveSearch(false),
       ClauseUses(TRI.getNumRegUnits()), ClauseDefs(TRI.getNumRegUnits()) {
   MaxLookAhead = MF.getRegInfo().isPhysRegUsed(AMDGPU::AGPR0) ? 19 : 5;
-  TSchedModel.init(&ST);
   RunLdsBranchVmemWARHazardFixup = shouldRunLdsBranchVmemWARHazardFixup(MF, ST);
 }
 
diff --git a/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.h b/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.h
index e840e2445188fb..adb2278c48eebe 100644
--- a/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.h
+++ b/llvm/lib/Target/AMDGPU/GCNHazardRecognizer.h
@@ -46,7 +46,7 @@ class GCNHazardRecognizer final : public ScheduleHazardRecognizer {
   const GCNSubtarget &ST;
   const SIInstrInfo &TII;
   const SIRegisterInfo &TRI;
-  TargetSchedModel TSchedModel;
+  const TargetSchedModel &TSchedModel;
   bool RunLdsBranchVmemWARHazardFixup;
   BitVector VALUReadHazardSGPRs;
   bool UseVALUReadHazardExhaustiveSearch;

jayfoad

LGTM, thanks.

arsenm

LGTM. I don't think we should have this copy in TargetInstrInfo, it's already present in the MCSubtargetInfo

Instead of allocating an initializing a new instance in `GCNHazardRecognizer` and `AMDGPUInsertDelayAlu`.

[AMDGPU] Use the SchedModel available in SIInstrInfo

7229527

Instead of allocating an initializing a new instance in GCNHazardRecognizer and AMDGPUInsertDelayAlu.

jmmartinez added the backend:AMDGPU label Oct 2, 2024

jmmartinez requested a review from arsenm October 2, 2024 15:05

jmmartinez self-assigned this Oct 2, 2024

jmmartinez mentioned this pull request Oct 2, 2024

[AMDGPU][SIPreEmitPeephole] mustRetainExeczBranch: use BranchProbability and TargetSchedmodel #109818

Merged

jayfoad approved these changes Oct 2, 2024

View reviewed changes

arsenm approved these changes Oct 2, 2024

View reviewed changes

jmmartinez merged commit d617371 into llvm:main Oct 2, 2024
8 of 10 checks passed

xgupta pushed a commit to xgupta/llvm-project that referenced this pull request Oct 4, 2024

[AMDGPU] Use the SchedModel available in SIInstrInfo (llvm#110859)

9b56402

Instead of allocating an initializing a new instance in `GCNHazardRecognizer` and `AMDGPUInsertDelayAlu`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Use the SchedModel available in SIInstrInfo #110859

[AMDGPU] Use the SchedModel available in SIInstrInfo #110859

jmmartinez commented Oct 2, 2024

llvmbot commented Oct 2, 2024

jayfoad left a comment

arsenm left a comment

[AMDGPU] Use the SchedModel available in SIInstrInfo #110859

[AMDGPU] Use the SchedModel available in SIInstrInfo #110859

Conversation

jmmartinez commented Oct 2, 2024

llvmbot commented Oct 2, 2024

jayfoad left a comment

Choose a reason for hiding this comment

arsenm left a comment

Choose a reason for hiding this comment