Implicit Exec Mask Operand Missing From MCInstrDesc of Some Opcodes in AMDGPU backend #89830

matinraayai · 2024-04-23T21:09:47Z

The following Opcodes do not have AMDGPU::EXEC mask as an implicit operand in their MC Instr Desc:
GLOBAL_STORE_DWORD_vi
BUFFER_LOAD_DWORD_OFFEN_vi
GLOBAL_STORE_DWORD_SADDR_vi
The issue appears when constructing a llvm::MachineInstr from a valid llvm::MCInst, in the same manner as llvm-exegesis here:

llvm-project/llvm/tools/llvm-exegesis/lib/Assembler.cpp

Lines 157 to 176 in 859de94

    
             MachineInstrBuilder Builder = BuildMI(MBB, DL, MCID); 
        
             for (unsigned OpIndex = 0, E = Inst.getNumOperands(); OpIndex < E; 
        
                  ++OpIndex) { 
        
               const MCOperand &Op = Inst.getOperand(OpIndex); 
        
               if (Op.isReg()) { 
        
                 const bool IsDef = OpIndex < MCID.getNumDefs(); 
        
                 unsigned Flags = 0; 
        
                 const MCOperandInfo &OpInfo = MCID.operands().begin()[OpIndex]; 
        
                 if (IsDef && !OpInfo.isOptionalDef()) 
        
                   Flags |= RegState::Define; 
        
                 Builder.addReg(Op.getReg(), Flags); 
        
               } else if (Op.isImm()) { 
        
                 Builder.addImm(Op.getImm()); 
        
               } else if (!Op.isValid()) { 
        
                 llvm_unreachable("Operand is not set"); 
        
               } else { 
        
                 llvm_unreachable("Not yet implemented"); 
        
               } 
        
             } 
        
           }

For the MIR to be correct, simply adding the explicit operands should be enough, as the implicit operands are automatically added according to the MCInstrDesc when calling the llvm::BuildMI here:

llvm-project/llvm/tools/llvm-exegesis/lib/Assembler.cpp

Line 157 in 859de94

MachineInstrBuilder Builder = BuildMI(MBB, DL, MCID);

This method will call the llvm::MachineInstr constructor with NoImplicit flag set to false:

llvm-project/llvm/lib/CodeGen/MachineInstr.cpp

Lines 98 to 114 in 859de94

    
           MachineInstr::MachineInstr(MachineFunction &MF, const MCInstrDesc &TID, 
        
                                      DebugLoc DL, bool NoImp) 
        
               : MCID(&TID), NumOperands(0), Flags(0), AsmPrinterFlags(0), 
        
                 DbgLoc(std::move(DL)), DebugInstrNum(0) { 
        
             assert(DbgLoc.hasTrivialDestructor() && "Expected trivial destructor"); 
        
             // Reserve space for the expected number of operands. 
        
             if (unsigned NumOps = MCID->getNumOperands() + MCID->implicit_defs().size() + 
        
                                   MCID->implicit_uses().size()) { 
        
               CapOperands = OperandCapacity::get(NumOps); 
        
               Operands = MF.allocateOperandArray(CapOperands); 
        
             } 
        
             if (!NoImp) 
        
               addImplicitDefUseOperands(MF); 
        
           }

However, when printing the constructed MIR, the implicit exec is nowhere to be seen:

GLOBAL_STORE_DWORD_vi $vgpr0_vgpr1, $vgpr2, 0, 0
BUFFER_LOAD_DWORD_OFFEN_vi $vgpr3, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 60, 0, 0
BUFFER_STORE_DWORD_OFFEN_vi $vgpr1, $vgpr3, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 84, 0, 0

This causes issues when these instructions get verified before running CodeGen passes.

CC: @arsenm @kzhuravl

The text was updated successfully, but these errors were encountered:

llvmbot · 2024-04-23T21:32:27Z

@llvm/issue-subscribers-backend-amdgpu

Author: Matin Raayai (matinraayai)

The following Opcodes do not have `AMDGPU::EXEC` mask as an implicit operand in their MC Instr Desc: `GLOBAL_STORE_DWORD_vi` `BUFFER_LOAD_DWORD_OFFEN_vi` `GLOBAL_STORE_DWORD_SADDR_vi` The issue appears when constructing a `llvm::MachineInstr` from a valid `llvm::MCInst`, in the same manner as llvm-exegesis here: https://github.com/llvm/llvm-project/blob/859de94536425376244940e190e069a09d797737/llvm/tools/llvm-exegesis/lib/Assembler.cpp#L157-L176 For the MIR to be correct, simply adding the explicit operands should be enough, as the implicit operands are automatically added according to the MCInstrDesc when calling the `llvm::BuildMI` here: https://github.com/llvm/llvm-project/blob/859de94536425376244940e190e069a09d797737/llvm/tools/llvm-exegesis/lib/Assembler.cpp#L157 This method will call the `llvm::MachineInstr` constructor with `NoImplicit` flag set to `false`:

llvm-project/llvm/lib/CodeGen/MachineInstr.cpp

Lines 98 to 114 in 859de94

    
           MachineInstr::MachineInstr(MachineFunction &MF, const MCInstrDesc &TID, 
        
                                      DebugLoc DL, bool NoImp) 
        
               : MCID(&TID), NumOperands(0), Flags(0), AsmPrinterFlags(0), 
        
                 DbgLoc(std::move(DL)), DebugInstrNum(0) { 
        
             assert(DbgLoc.hasTrivialDestructor() && "Expected trivial destructor"); 
        
             // Reserve space for the expected number of operands. 
        
             if (unsigned NumOps = MCID->getNumOperands() + MCID->implicit_defs().size() + 
        
                                   MCID->implicit_uses().size()) { 
        
               CapOperands = OperandCapacity::get(NumOps); 
        
               Operands = MF.allocateOperandArray(CapOperands); 
        
             } 
        
             if (!NoImp) 
        
               addImplicitDefUseOperands(MF); 
        
           }

However, when printing the constructed MIR, the implicit exec is nowhere to be seen:

GLOBAL_STORE_DWORD_vi $vgpr0_vgpr1, $vgpr2, 0, 0
BUFFER_LOAD_DWORD_OFFEN_vi $vgpr3, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 60, 0, 0
BUFFER_STORE_DWORD_OFFEN_vi $vgpr1, $vgpr3, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 84, 0, 0

This causes issues when these instructions get verified before running CodeGen passes.

CC: @arsenm @kzhuravl

jayfoad · 2024-04-24T08:29:58Z

This causes issues when these instructions get verified before running CodeGen passes.

Why are you trying to run CodeGen passes and MachineVerifier on Real instructions like GLOBAL_STORE_DWORD_vi?

In normal CodeGen, all the passes run on Pseudo instructions like GLOBAL_STORE_DWORD, which does have the implicit use of EXEC. These Pseudo instructions are only converted to the corresponding Real instruction for the appropriate subtarget as part of final code emission.

matinraayai · 2024-04-24T14:52:50Z

@jayfoad the main reason is instrumentation. We use MachinePasses to leverage things like the AsmPrinter and pseudo instructions to make our job easier.
Now I can, as you said, somehow convert these opcode to their pseudo equivalent; However:

I know there's a way to convert pseudo to MC, I don't know if there's a way to go other way around.
I want to keep the lowered MC operands. There's no point adding an extra conversion step here.
I've encountered other real instructions that have the EXEC mask as an implicit operand. One example is V_MOV_B32_e32_vi. This seems inconsistent with only pseudo instructions having the implicit operands modeled.

IMO even if the normal CodeGen won't encounter real opcodes, that doesn't mean MC shouldn't model the implicit operands correctly. This is especially true if someone is using MC to analyze a binary; They shouldn't need to convert the opcode to its pseudo equivalent first just to get the correctly modeled behavior.

jayfoad · 2024-04-24T15:05:50Z

OK. I certainly have no objection to fixing the implicit uses/defs on Real instructions to match the corresponding Pseudo.

arsenm · 2024-04-24T15:07:28Z

Do the real instructions just not copy the Uses and Defs from the parent pseudo?

Currently, the tablegen files that generate the instruction definitions in lib/Target/AMDGPU/AMDGPUGenInstrInfo.inc often only include implicit operands for the architecture-independent pseudo instructions, but not for the corresponding real instructions. The missing implicit operands (most prominently: the EXEC mask) do not affect code generation, since that operates on pseudo instructions, but they are problematic when working with real instructions, e.g., as a decoding result from the MC layer. This patch copies the implicit Defs and Uses from pseudo instructions to the corresponding real instructions, so that implicit operands are also defined for real instructions. Addresses issue llvm#89830.

ritter-x2a · 2024-05-22T08:25:21Z

Do the real instructions just not copy the Uses and Defs from the parent pseudo?

Currently, that's not the case for many instructions. This PR changes that: #93004

@matinraayai : feel free to try the commit and check if it solves your problem.

Currently, the tablegen files that generate the instruction definitions in lib/Target/AMDGPU/AMDGPUGenInstrInfo.inc often only include implicit operands for the architecture-independent pseudo instructions, but not for the corresponding real instructions. The missing implicit operands (most prominently: the EXEC mask) do not affect code generation, since that operates on pseudo instructions, but they are problematic when working with real instructions, e.g., as a decoding result from the MC layer. This patch copies the implicit Defs and Uses from pseudo instructions to the corresponding real instructions, so that implicit operands are also defined for real instructions. Addresses issue #89830.

ritter-x2a · 2024-06-04T13:36:42Z

Closing this, as the merged PR #93004 should fix the issue.

github-actions bot added the new issue label Apr 23, 2024

EugeneZelenko added backend:AMDGPU and removed new issue labels Apr 23, 2024

kzhuravl assigned ritter-x2a Apr 23, 2024

ritter-x2a mentioned this issue May 22, 2024

[AMDGPU] Copy Defs and Uses from Pseudo to Real Instructions #93004

Merged

ritter-x2a closed this as completed Jun 4, 2024

matinraayai mentioned this issue Jul 23, 2024

Copy flt_scr and Any Remaining Pseudo Op Flags to their Real Counterparts #100187

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implicit Exec Mask Operand Missing From MCInstrDesc of Some Opcodes in AMDGPU backend #89830

Implicit Exec Mask Operand Missing From MCInstrDesc of Some Opcodes in AMDGPU backend #89830

matinraayai commented Apr 23, 2024

llvmbot commented Apr 23, 2024

jayfoad commented Apr 24, 2024

matinraayai commented Apr 24, 2024

jayfoad commented Apr 24, 2024

arsenm commented Apr 24, 2024

ritter-x2a commented May 22, 2024

ritter-x2a commented Jun 4, 2024

Implicit Exec Mask Operand Missing From MCInstrDesc of Some Opcodes in AMDGPU backend #89830

Implicit Exec Mask Operand Missing From MCInstrDesc of Some Opcodes in AMDGPU backend #89830

Comments

matinraayai commented Apr 23, 2024

llvmbot commented Apr 23, 2024

jayfoad commented Apr 24, 2024

matinraayai commented Apr 24, 2024

jayfoad commented Apr 24, 2024

arsenm commented Apr 24, 2024

ritter-x2a commented May 22, 2024

ritter-x2a commented Jun 4, 2024