Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AVR] Fix parsing & emitting relative jumps #102936

Closed
wants to merge 2 commits into from

Conversation

Patryk27
Copy link
Contributor

@Patryk27 Patryk27 commented Aug 12, 2024

Ever since 6859685 (or, precisely, 84428da) relative jumps emitted by the AVR codegen are off by two bytes - this pull request fixes it.

Abstract

As compared to absolute jumps, relative jumps - such as rjmp, rcall or brsh - have an implied pc+2 behavior; that is, jmp 100 is pc = 100, but rjmp 100 gets understood as pc = pc + 100 + 2.

This is not reflected in the AVR codegen:

static void adjustRelativeBranch(unsigned Size, const MCFixup &Fixup,

... which always emits relative jumps that are two bytes too far - or rather it would emit such jumps if not for this check:

... which causes most of the relative jumps to be actually resolved late, by the linker, which applies the offsetting logic on its own, hiding the issue within LLVM.

Some time ago we've had a similar "jumps are off" problem that got solved by touching shouldForceRelocation(), but I think that has worked only by accident. It's exploited the fact that absolute vs relative jumps in the parsed assembly can be distinguished through a "side channel" check relying on the existence of labels (i.e. absolute jumps happen to named labels, but relative jumps are anonymous, so to say). This was an alright idea back then, but it got broken by 6859685.

I propose a different approach:

  • when emitting relative jumps, offset them by -2 (well, -1, strictly speaking, because those instructions rely on right-shifted offset),
  • when parsing relative jumps, treat . as +2 and read rjmp .+1234 as rjmp (1234 + 2).

This approach seems to be sound and now we generate the same assembly as avr-gcc, which can be confirmed with:

// avr-gcc test.c -O3 && avr-objdump -d a.out

int main() {
    asm(
"      foo:\n\t"
"        rjmp  .+2\n\t"
"        rjmp  .-2\n\t"
"        rjmp  foo\n\t"
"        rjmp  .+8\n\t"
"        rjmp  end\n\t"
"        rjmp  .+0\n\t"
"      end:\n\t"
"        rjmp .-4\n\t"
"        rjmp .-6\n\t"
"      x:\n\t"
"        rjmp x\n\t"
"        .short 0xc00f\n\t"
);
}

avr-gcc is also how I got the opcodes for all new tests like inst-brbc.s, so we should be good.

Seizing the day, I've reformatted a couple of other tests (this change got extracted into its own commit).

Closes #102436.

@llvmbot llvmbot added the mc Machine (object) code label Aug 19, 2024
@llvmbot
Copy link
Member

llvmbot commented Aug 19, 2024

@llvm/pr-subscribers-mc

Author: Patryk Wychowaniec (Patryk27)

Changes

AVR relative jumps are encoded relative to the end of the current instruction (i.e. rjmp 0 skips one instruction forward), while LLVM by default seems to assume jumps happen at the beginning of the instruction - fixing this requires offseting them by -2, which is what this commit does.

(note that strictly speaking we have to offset just -1, because the jump's encoding utilizes fact that AVR instructions take two bytes and so it's already divided by two.)

This wasn't a problem up until 6859685 (or rather 84428da), because the AVR backend used to encode all jumps (including those between basic blocks) using relocations, so the problematic code path simply wasn't triggered.

Closes #102436.


Patch is 83.55 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/102936.diff

108 Files Affected:

  • (modified) llvm/lib/Target/AVR/AsmParser/AVRAsmParser.cpp (+11-4)
  • (modified) llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp (+5-6)
  • (added) llvm/test/CodeGen/AVR/jmp.ll (+24)
  • (modified) llvm/test/MC/AVR/inst-adc.s (-2)
  • (modified) llvm/test/MC/AVR/inst-add.s (-2)
  • (modified) llvm/test/MC/AVR/inst-adiw.s (-2)
  • (modified) llvm/test/MC/AVR/inst-and.s (-2)
  • (modified) llvm/test/MC/AVR/inst-andi.s (-2)
  • (modified) llvm/test/MC/AVR/inst-asr.s (-2)
  • (modified) llvm/test/MC/AVR/inst-bld.s (-2)
  • (modified) llvm/test/MC/AVR/inst-brbc.s (+12-11)
  • (modified) llvm/test/MC/AVR/inst-brbs.s (+12-11)
  • (added) llvm/test/MC/AVR/inst-brcc.s (+28)
  • (added) llvm/test/MC/AVR/inst-brcs.s (+28)
  • (modified) llvm/test/MC/AVR/inst-break.s (-2)
  • (added) llvm/test/MC/AVR/inst-breq.s (+28)
  • (added) llvm/test/MC/AVR/inst-brge.s (+24)
  • (added) llvm/test/MC/AVR/inst-brhc.s (+24)
  • (added) llvm/test/MC/AVR/inst-brhs.s (+24)
  • (added) llvm/test/MC/AVR/inst-brid.s (+24)
  • (added) llvm/test/MC/AVR/inst-brie.s (+24)
  • (added) llvm/test/MC/AVR/inst-brlo.s (+24)
  • (added) llvm/test/MC/AVR/inst-brlt.s (+24)
  • (added) llvm/test/MC/AVR/inst-brmi.s (+24)
  • (added) llvm/test/MC/AVR/inst-brne.s (+28)
  • (added) llvm/test/MC/AVR/inst-brpl.s (+24)
  • (added) llvm/test/MC/AVR/inst-brsh.s (+24)
  • (added) llvm/test/MC/AVR/inst-brtc.s (+24)
  • (added) llvm/test/MC/AVR/inst-brts.s (+24)
  • (added) llvm/test/MC/AVR/inst-brvc.s (+24)
  • (added) llvm/test/MC/AVR/inst-brvs.s (+24)
  • (modified) llvm/test/MC/AVR/inst-bst.s (-2)
  • (modified) llvm/test/MC/AVR/inst-call.s (-2)
  • (modified) llvm/test/MC/AVR/inst-cbi.s (-2)
  • (modified) llvm/test/MC/AVR/inst-cbr.s (-2)
  • (modified) llvm/test/MC/AVR/inst-clr.s (-2)
  • (modified) llvm/test/MC/AVR/inst-com.s (-2)
  • (modified) llvm/test/MC/AVR/inst-cp.s (-2)
  • (modified) llvm/test/MC/AVR/inst-cpc.s (-2)
  • (modified) llvm/test/MC/AVR/inst-cpi.s (-2)
  • (modified) llvm/test/MC/AVR/inst-cpse.s (-2)
  • (modified) llvm/test/MC/AVR/inst-dec.s (-2)
  • (modified) llvm/test/MC/AVR/inst-des.s (-2)
  • (modified) llvm/test/MC/AVR/inst-eicall.s (-2)
  • (modified) llvm/test/MC/AVR/inst-eijmp.s (-2)
  • (modified) llvm/test/MC/AVR/inst-elpm.s (-2)
  • (modified) llvm/test/MC/AVR/inst-eor.s (-2)
  • (removed) llvm/test/MC/AVR/inst-family-cond-branch.s (-321)
  • (modified) llvm/test/MC/AVR/inst-fmul.s (-2)
  • (modified) llvm/test/MC/AVR/inst-fmuls.s (-2)
  • (modified) llvm/test/MC/AVR/inst-fmulsu.s (-2)
  • (modified) llvm/test/MC/AVR/inst-icall.s (-2)
  • (modified) llvm/test/MC/AVR/inst-ijmp.s (-2)
  • (modified) llvm/test/MC/AVR/inst-in.s (-2)
  • (modified) llvm/test/MC/AVR/inst-inc.s (-2)
  • (modified) llvm/test/MC/AVR/inst-jmp.s (-2)
  • (modified) llvm/test/MC/AVR/inst-lac.s (-2)
  • (modified) llvm/test/MC/AVR/inst-las.s (-2)
  • (modified) llvm/test/MC/AVR/inst-lat.s (-2)
  • (modified) llvm/test/MC/AVR/inst-ld.s (-2)
  • (modified) llvm/test/MC/AVR/inst-ldi.s (-1)
  • (modified) llvm/test/MC/AVR/inst-lds.s (-1)
  • (modified) llvm/test/MC/AVR/inst-lpm.s (-2)
  • (modified) llvm/test/MC/AVR/inst-lsl.s (-2)
  • (modified) llvm/test/MC/AVR/inst-lsr.s (-2)
  • (modified) llvm/test/MC/AVR/inst-mov.s (-2)
  • (modified) llvm/test/MC/AVR/inst-movw.s (-2)
  • (modified) llvm/test/MC/AVR/inst-mul.s (-1)
  • (modified) llvm/test/MC/AVR/inst-muls.s (-2)
  • (modified) llvm/test/MC/AVR/inst-mulsu.s (-2)
  • (modified) llvm/test/MC/AVR/inst-nop.s (-2)
  • (modified) llvm/test/MC/AVR/inst-or.s (-1)
  • (modified) llvm/test/MC/AVR/inst-ori.s (-2)
  • (modified) llvm/test/MC/AVR/inst-out.s (-2)
  • (modified) llvm/test/MC/AVR/inst-pop.s (-2)
  • (modified) llvm/test/MC/AVR/inst-push.s (-2)
  • (modified) llvm/test/MC/AVR/inst-rcall.s (+17-16)
  • (modified) llvm/test/MC/AVR/inst-ret.s (-2)
  • (modified) llvm/test/MC/AVR/inst-reti.s (-2)
  • (modified) llvm/test/MC/AVR/inst-rjmp.s (+38-31)
  • (modified) llvm/test/MC/AVR/inst-rol.s (-2)
  • (modified) llvm/test/MC/AVR/inst-ror.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sbc.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sbci.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sbi.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sbic.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sbis.s (-1)
  • (modified) llvm/test/MC/AVR/inst-sbiw.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sbr.s (-1)
  • (modified) llvm/test/MC/AVR/inst-sbrc.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sbrs.s (-2)
  • (modified) llvm/test/MC/AVR/inst-ser.s (-1)
  • (modified) llvm/test/MC/AVR/inst-sleep.s (-2)
  • (modified) llvm/test/MC/AVR/inst-spm.s (-2)
  • (modified) llvm/test/MC/AVR/inst-st.s (-1)
  • (modified) llvm/test/MC/AVR/inst-std.s (-1)
  • (modified) llvm/test/MC/AVR/inst-sts.s (-2)
  • (modified) llvm/test/MC/AVR/inst-sub.s (-1)
  • (modified) llvm/test/MC/AVR/inst-subi.s (-1)
  • (modified) llvm/test/MC/AVR/inst-swap.s (-2)
  • (modified) llvm/test/MC/AVR/inst-tst.s (-2)
  • (modified) llvm/test/MC/AVR/inst-wdr.s (-2)
  • (modified) llvm/test/MC/AVR/inst-xch.s (-2)
  • (modified) llvm/test/MC/AVR/modifiers.s (-14)
  • (modified) llvm/test/MC/AVR/relocations.s (+1)
  • (modified) llvm/test/MC/AVR/separator.s (-1)
  • (modified) llvm/test/MC/AVR/syntax-reg-int-literal.s (-1)
  • (modified) llvm/test/MC/AVR/syntax-reg-pair.s (-1)
diff --git a/llvm/lib/Target/AVR/AsmParser/AVRAsmParser.cpp b/llvm/lib/Target/AVR/AsmParser/AVRAsmParser.cpp
index 383dfcc31117c1..c016b2dd91dc67 100644
--- a/llvm/lib/Target/AVR/AsmParser/AVRAsmParser.cpp
+++ b/llvm/lib/Target/AVR/AsmParser/AVRAsmParser.cpp
@@ -72,7 +72,7 @@ class AVRAsmParser : public MCTargetAsmParser {
   int parseRegisterName();
   int parseRegister(bool RestoreOnFailure = false);
   bool tryParseRegisterOperand(OperandVector &Operands);
-  bool tryParseExpression(OperandVector &Operands);
+  bool tryParseExpression(OperandVector &Operands, int64_t offset);
   bool tryParseRelocExpression(OperandVector &Operands);
   void eatComma();
 
@@ -418,7 +418,7 @@ bool AVRAsmParser::tryParseRegisterOperand(OperandVector &Operands) {
   return false;
 }
 
-bool AVRAsmParser::tryParseExpression(OperandVector &Operands) {
+bool AVRAsmParser::tryParseExpression(OperandVector &Operands, int64_t offset) {
   SMLoc S = Parser.getTok().getLoc();
 
   if (!tryParseRelocExpression(Operands))
@@ -437,6 +437,11 @@ bool AVRAsmParser::tryParseExpression(OperandVector &Operands) {
   if (getParser().parseExpression(Expression))
     return true;
 
+  if (offset) {
+    Expression = MCBinaryExpr::createAdd(
+        Expression, MCConstantExpr::create(offset, getContext()), getContext());
+  }
+
   SMLoc E = SMLoc::getFromPointer(Parser.getTok().getLoc().getPointer() - 1);
   Operands.push_back(AVROperand::CreateImm(Expression, S, E));
   return false;
@@ -529,8 +534,9 @@ bool AVRAsmParser::parseOperand(OperandVector &Operands, bool maybeReg) {
     [[fallthrough]];
   case AsmToken::LParen:
   case AsmToken::Integer:
+    return tryParseExpression(Operands, 0);
   case AsmToken::Dot:
-    return tryParseExpression(Operands);
+    return tryParseExpression(Operands, 2);
   case AsmToken::Plus:
   case AsmToken::Minus: {
     // If the sign preceeds a number, parse the number,
@@ -540,7 +546,7 @@ bool AVRAsmParser::parseOperand(OperandVector &Operands, bool maybeReg) {
     case AsmToken::BigNum:
     case AsmToken::Identifier:
     case AsmToken::Real:
-      if (!tryParseExpression(Operands))
+      if (!tryParseExpression(Operands, 0))
         return false;
       break;
     default:
@@ -643,6 +649,7 @@ bool AVRAsmParser::ParseInstruction(ParseInstructionInfo &Info,
     // These specific operands should be treated as addresses/symbols/labels,
     // other than registers.
     bool maybeReg = true;
+
     if (OperandNum == 1) {
       std::array<StringRef, 8> Insts = {"lds", "adiw", "sbiw", "ldi"};
       for (auto Inst : Insts) {
diff --git a/llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp b/llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp
index 0d29912bee2646..46bef75697fcb2 100644
--- a/llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp
+++ b/llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp
@@ -94,6 +94,9 @@ static void adjustRelativeBranch(unsigned Size, const MCFixup &Fixup,
 
   // Rightshifts the value by one.
   AVR::fixups::adjustBranchTarget(Value);
+
+  // Jumps are relative to the current instruction.
+  Value -= 1;
 }
 
 /// 22-bit absolute fixup.
@@ -513,14 +516,10 @@ bool AVRAsmBackend::shouldForceRelocation(const MCAssembler &Asm,
   switch ((unsigned)Fixup.getKind()) {
   default:
     return Fixup.getKind() >= FirstLiteralRelocationKind;
-  // Fixups which should always be recorded as relocations.
   case AVR::fixup_7_pcrel:
   case AVR::fixup_13_pcrel:
-    // Do not force relocation for PC relative branch like 'rjmp .',
-    // 'rcall . - off' and 'breq . + off'.
-    if (const auto *SymA = Target.getSymA())
-      if (SymA->getSymbol().getName().size() == 0)
-        return false;
+    // Always resolve relocations for PC-relative branches
+    return false;
     [[fallthrough]];
   case AVR::fixup_call:
     return true;
diff --git a/llvm/test/CodeGen/AVR/jmp.ll b/llvm/test/CodeGen/AVR/jmp.ll
new file mode 100644
index 00000000000000..30c7850b368442
--- /dev/null
+++ b/llvm/test/CodeGen/AVR/jmp.ll
@@ -0,0 +1,24 @@
+; RUN: llc -filetype=obj -mtriple=avr < %s | llvm-objdump -d --no-show-raw-insn - | FileCheck %s
+
+define i8 @foo(i8 %a) {
+bb0:
+  %0 = tail call i8 @bar(i8 %a)
+  %1 = icmp eq i8 %0, 123
+  br i1 %1, label %bb1, label %bb2
+
+bb1:
+  ret i8 100
+
+bb2:
+  ret i8 200
+}
+
+declare i8 @bar(i8);
+
+; CHECK: rcall   .-2
+; CHECK-NEXT: cpi     r24, 0x7b
+; CHECK-NEXT: brne    .+4
+; CHECK-NEXT: ldi     r24, 0x64
+; CHECK-NEXT: ret
+; CHECK-NEXT: ldi     r24, 0xc8
+; CHECK-NEXT: ret
diff --git a/llvm/test/MC/AVR/inst-adc.s b/llvm/test/MC/AVR/inst-adc.s
index d1157bc7a9b3ba..33ce40849e021b 100644
--- a/llvm/test/MC/AVR/inst-adc.s
+++ b/llvm/test/MC/AVR/inst-adc.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr < %s | llvm-objdump -d - | FileCheck --check-prefix=CHECK-INST %s
 
-
 foo:
-
   adc r0,  r15
   adc r15, r0
   adc r16, r31
diff --git a/llvm/test/MC/AVR/inst-add.s b/llvm/test/MC/AVR/inst-add.s
index 49ad5de80c06b9..5120af12be7c41 100644
--- a/llvm/test/MC/AVR/inst-add.s
+++ b/llvm/test/MC/AVR/inst-add.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr < %s | llvm-objdump -d - | FileCheck --check-prefix=CHECK-INST %s
 
-
 foo:
-
   add r0,  r15
   add r15, r0
   add r16, r31
diff --git a/llvm/test/MC/AVR/inst-adiw.s b/llvm/test/MC/AVR/inst-adiw.s
index 7904965a51d68d..6b83027234c11c 100644
--- a/llvm/test/MC/AVR/inst-adiw.s
+++ b/llvm/test/MC/AVR/inst-adiw.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -mattr=addsubiw -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr -mattr=addsubiw < %s | llvm-objdump --no-print-imm-hex -dr --mattr=addsubiw - | FileCheck --check-prefix=CHECK-INST %s
 
-
 foo:
-
   adiw r26,  12
   adiw r26,  63
 
diff --git a/llvm/test/MC/AVR/inst-and.s b/llvm/test/MC/AVR/inst-and.s
index c4d90bfba37477..19d8a16862dd93 100644
--- a/llvm/test/MC/AVR/inst-and.s
+++ b/llvm/test/MC/AVR/inst-and.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr < %s | llvm-objdump -d - | FileCheck --check-prefix=CHECK-INST %s
 
-
 foo:
-
   and r0,  r15
   and r15, r0
   and r16, r31
diff --git a/llvm/test/MC/AVR/inst-andi.s b/llvm/test/MC/AVR/inst-andi.s
index 96a090173bd786..a68eb66921bc25 100644
--- a/llvm/test/MC/AVR/inst-andi.s
+++ b/llvm/test/MC/AVR/inst-andi.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr < %s | llvm-objdump --no-print-imm-hex -d - | FileCheck --check-prefix=CHECK-INST %s
 
-
 foo:
-
   andi r16, 255
   andi r29, 190
   andi r22, 172
diff --git a/llvm/test/MC/AVR/inst-asr.s b/llvm/test/MC/AVR/inst-asr.s
index 1b59d027dc2bcc..265f8646d4976d 100644
--- a/llvm/test/MC/AVR/inst-asr.s
+++ b/llvm/test/MC/AVR/inst-asr.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr < %s | llvm-objdump -d - | FileCheck -check-prefix=CHECK-INST %s
 
-
 foo:
-
   asr r31
   asr r25
   asr r5
diff --git a/llvm/test/MC/AVR/inst-bld.s b/llvm/test/MC/AVR/inst-bld.s
index 71352c5c0abd80..01a1b8b6973290 100644
--- a/llvm/test/MC/AVR/inst-bld.s
+++ b/llvm/test/MC/AVR/inst-bld.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr < %s | llvm-objdump --no-print-imm-hex -d - | FileCheck --check-prefix=CHECK-INST %s
 
-
 foo:
-
   bld r3, 5
   bld r1, 1
   bld r0, 0
diff --git a/llvm/test/MC/AVR/inst-brbc.s b/llvm/test/MC/AVR/inst-brbc.s
index 4d7d684da4468a..3ef3664cf07bfc 100644
--- a/llvm/test/MC/AVR/inst-brbc.s
+++ b/llvm/test/MC/AVR/inst-brbc.s
@@ -3,7 +3,6 @@
 ; RUN:     | llvm-objdump -d - | FileCheck --check-prefix=INST %s
 
 foo:
-
   brbc 3, .+8
   brbc 0, .-16
   .short 0xf759
@@ -11,14 +10,16 @@ foo:
   .short 0xf74c
   .short 0xf4c7
 
-; CHECK: brvc .Ltmp0+8              ; encoding: [0bAAAAA011,0b111101AA]
-; CHECK:                            ; fixup A - offset: 0, value: .Ltmp0+8, kind: fixup_7_pcrel
-; CHECK: brcc .Ltmp1-16             ; encoding: [0bAAAAA000,0b111101AA]
-; CHECK:                            ; fixup A - offset: 0, value: .Ltmp1-16, kind: fixup_7_pcrel
+; CHECK: brvc (.Ltmp0+8)+2   ; encoding: [0bAAAAA011,0b111101AA]
+; CHECK-NEXT:                ; fixup A - offset: 0, value: (.Ltmp0+8)+2, kind: fixup_7_pcrel
+;
+; CHECK: brcc (.Ltmp1-16)+2  ; encoding: [0bAAAAA000,0b111101AA]
+; CHECK-NEXT:                ; fixup A - offset: 0, value: (.Ltmp1-16)+2, kind: fixup_7_pcrel
 
-; INST: 23 f4   brvc .+8
-; INST: c0 f7   brsh .-16
-; INST: 59 f7   brne .-42
-; INST: 52 f7   brpl .-44
-; INST: 4c f7   brge .-46
-; INST: c7 f4   brid .+48
+; INST-LABEL: <foo>:
+; INST-NEXT: 23 f4   brvc .+8
+; INST-NEXT: c0 f7   brsh .-16
+; INST-NEXT: 59 f7   brne .-42
+; INST-NEXT: 52 f7   brpl .-44
+; INST-NEXT: 4c f7   brge .-46
+; INST-NEXT: c7 f4   brid .+48
diff --git a/llvm/test/MC/AVR/inst-brbs.s b/llvm/test/MC/AVR/inst-brbs.s
index 7987feeec654a1..58db423aae50fa 100644
--- a/llvm/test/MC/AVR/inst-brbs.s
+++ b/llvm/test/MC/AVR/inst-brbs.s
@@ -3,7 +3,6 @@
 ; RUN:     | llvm-objdump -d - | FileCheck --check-prefix=INST %s
 
 foo:
-
   brbs 3, .+8
   brbs 0, .-12
   .short 0xf359
@@ -11,14 +10,16 @@ foo:
   .short 0xf34c
   .short 0xf077
 
-; CHECK: brvs .Ltmp0+8              ; encoding: [0bAAAAA011,0b111100AA]
-; CHECK:                            ; fixup A - offset: 0, value: .Ltmp0+8, kind: fixup_7_pcrel
-; CHECK: brcs .Ltmp1-12             ; encoding: [0bAAAAA000,0b111100AA]
-; CHECK:                            ; fixup A - offset: 0, value: .Ltmp1-12, kind: fixup_7_pcrel
+; CHECK: brvs (.Ltmp0+8)+2   ; encoding: [0bAAAAA011,0b111100AA]
+; CHECK-NEXT:                ; fixup A - offset: 0, value: (.Ltmp0+8)+2, kind: fixup_7_pcrel
+;
+; CHECK: brcs (.Ltmp1-12)+2  ; encoding: [0bAAAAA000,0b111100AA]
+; CHECK-NEXT:                ; fixup A - offset: 0, value: (.Ltmp1-12)+2, kind: fixup_7_pcrel
 
-; INST: 23 f0   brvs .+8
-; INST: d0 f3   brlo .-12
-; INST: 59 f3   breq .-42
-; INST: 52 f3   brmi .-44
-; INST: 4c f3   brlt .-46
-; INST: 77 f0   brie .+28
+; INST-LABEL: <foo>:
+; INST-NEXT: 23 f0   brvs .+8
+; INST-NEXT: d0 f3   brlo .-12
+; INST-NEXT: 59 f3   breq .-42
+; INST-NEXT: 52 f3   brmi .-44
+; INST-NEXT: 4c f3   brlt .-46
+; INST-NEXT: 77 f0   brie .+28
diff --git a/llvm/test/MC/AVR/inst-brcc.s b/llvm/test/MC/AVR/inst-brcc.s
new file mode 100644
index 00000000000000..d9218bc61e787f
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brcc.s
@@ -0,0 +1,28 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  brcc .+66
+  brcc .-22
+  brbc 0, .+66
+  brbc 0, bar
+
+bar:
+
+; CHECK: brcc (.Ltmp0+66)+2  ; encoding: [0bAAAAA000,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp0+66)+2, kind: fixup_7_pcrel
+; CHECK: brcc (.Ltmp1-22)+2  ; encoding: [0bAAAAA000,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp1-22)+2, kind: fixup_7_pcrel
+; CHECK: brcc (.Ltmp2+66)+2  ; encoding: [0bAAAAA000,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp2+66)+2, kind: fixup_7_pcrel
+; CHECK: brcc bar            ; encoding: [0bAAAAA000,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: 08 f5      brsh .+66
+; INST-NEXT: a8 f7      brsh .-22
+; INST-NEXT: 08 f5      brsh .+66
+; INST-NEXT: 00 f4      brsh .+0
diff --git a/llvm/test/MC/AVR/inst-brcs.s b/llvm/test/MC/AVR/inst-brcs.s
new file mode 100644
index 00000000000000..0012cb31f61269
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brcs.s
@@ -0,0 +1,28 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  brcs .+8
+  brcs .+4
+  brbs 0, .+8
+  brbs 0, bar
+
+bar:
+
+; CHECK: brcs (.Ltmp0+8)+2  ; encoding: [0bAAAAA000,0b111100AA]
+; CHECK-NEXT:               ;   fixup A - offset: 0, value: (.Ltmp0+8)+2, kind: fixup_7_pcrel
+; CHECK: brcs (.Ltmp1+4)+2  ; encoding: [0bAAAAA000,0b111100AA]
+; CHECK-NEXT:               ;   fixup A - offset: 0, value: (.Ltmp1+4)+2, kind: fixup_7_pcrel
+; CHECK: brcs (.Ltmp2+8)+2  ; encoding: [0bAAAAA000,0b111100AA]
+; CHECK-NEXT:               ;   fixup A - offset: 0, value: (.Ltmp2+8)+2, kind: fixup_7_pcrel
+; CHECK: brcs bar           ; encoding: [0bAAAAA000,0b111100AA]
+; CHECK-NEXT:               ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: 20 f0      brlo .+8
+; INST-NEXT: 10 f0      brlo .+4
+; INST-NEXT: 20 f0      brlo .+8
+; INST-NEXT: 00 f0      brlo .+0
diff --git a/llvm/test/MC/AVR/inst-break.s b/llvm/test/MC/AVR/inst-break.s
index a1bfde93c5a0d7..bb3abc651a1511 100644
--- a/llvm/test/MC/AVR/inst-break.s
+++ b/llvm/test/MC/AVR/inst-break.s
@@ -1,9 +1,7 @@
 ; RUN: llvm-mc -triple avr -mattr=break -show-encoding < %s | FileCheck %s
 ; RUN: llvm-mc -filetype=obj -triple avr -mattr=break < %s | llvm-objdump -d --mattr=break - | FileCheck --check-prefix=CHECK-INST %s
 
-
 foo:
-
   break
 
 ; CHECK: break                  ; encoding: [0x98,0x95]
diff --git a/llvm/test/MC/AVR/inst-breq.s b/llvm/test/MC/AVR/inst-breq.s
new file mode 100644
index 00000000000000..f82010f02ba617
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-breq.s
@@ -0,0 +1,28 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  breq .-18
+  breq .-12
+  brbs 1, .-18
+  brbs 1, bar
+
+bar:
+
+; CHECK: breq    (.Ltmp0-18)+2     ; encoding: [0bAAAAA001,0b111100AA]
+; CHECK-NEXT:                      ;   fixup A - offset: 0, value: (.Ltmp0-18)+2, kind: fixup_7_pcrel
+; CHECK: breq    (.Ltmp1-12)+2     ; encoding: [0bAAAAA001,0b111100AA]
+; CHECK-NEXT:                      ;   fixup A - offset: 0, value: (.Ltmp1-12)+2, kind: fixup_7_pcrel
+; CHECK: brbs    1, (.Ltmp2-18)+2  ; encoding: [0bAAAAA001,0b111100AA]
+; CHECK-NEXT:                      ;   fixup A - offset: 0, value: (.Ltmp2-18)+2, kind: fixup_7_pcrel
+; CHECK: brbs    1, bar            ; encoding: [0bAAAAA001,0b111100AA]
+; CHECK-NEXT:                      ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: b9 f3      breq .-18
+; INST-NEXT: d1 f3      breq .-12
+; INST-NEXT: b9 f3      breq .-18
+; INST-NEXT: 01 f0      breq .+0
diff --git a/llvm/test/MC/AVR/inst-brge.s b/llvm/test/MC/AVR/inst-brge.s
new file mode 100644
index 00000000000000..1121284a114689
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brge.s
@@ -0,0 +1,24 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  brge .+50
+  brge .+42
+  brge bar
+
+bar:
+
+; CHECK: brge (.Ltmp0+50)+2  ; encoding: [0bAAAAA100,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp0+50)+2, kind: fixup_7_pcrel
+; CHECK: brge (.Ltmp1+42)+2  ; encoding: [0bAAAAA100,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp1+42)+2, kind: fixup_7_pcrel
+; CHECK: brge bar            ; encoding: [0bAAAAA100,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: cc f4      brge .+50
+; INST-NEXT: ac f4      brge .+42
+; INST-NEXT: 04 f4      brge .+0
diff --git a/llvm/test/MC/AVR/inst-brhc.s b/llvm/test/MC/AVR/inst-brhc.s
new file mode 100644
index 00000000000000..eb16ac2ef7a64e
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brhc.s
@@ -0,0 +1,24 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  brhc .+12
+  brhc .+14
+  brhc bar
+
+bar:
+
+; CHECK: brhc (.Ltmp0+12)+2  ; encoding: [0bAAAAA101,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp0+12)+2, kind: fixup_7_pcrel
+; CHECK: brhc (.Ltmp1+14)+2  ; encoding: [0bAAAAA101,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp1+14)+2, kind: fixup_7_pcrel
+; CHECK: brhc bar            ; encoding: [0bAAAAA101,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: 35 f4      brhc .+12
+; INST-NEXT: 3d f4      brhc .+14
+; INST-NEXT: 05 f4      brhc .+0
diff --git a/llvm/test/MC/AVR/inst-brhs.s b/llvm/test/MC/AVR/inst-brhs.s
new file mode 100644
index 00000000000000..77c49596b3b0b8
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brhs.s
@@ -0,0 +1,24 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  brhs .-66
+  brhs .+14
+  brhs bar
+
+bar:
+
+; CHECK: brhs (.Ltmp0-66)+2  ; encoding: [0bAAAAA101,0b111100AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp0-66)+2, kind: fixup_7_pcrel
+; CHECK: brhs (.Ltmp1+14)+2  ; encoding: [0bAAAAA101,0b111100AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp1+14)+2, kind: fixup_7_pcrel
+; CHECK: brhs bar            ; encoding: [0bAAAAA101,0b111100AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: fd f2      brhs .-66
+; INST-NEXT: 3d f0      brhs .+14
+; INST-NEXT: 05 f0      brhs .+0
diff --git a/llvm/test/MC/AVR/inst-brid.s b/llvm/test/MC/AVR/inst-brid.s
new file mode 100644
index 00000000000000..70d0ea83c49b2a
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brid.s
@@ -0,0 +1,24 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  brid .+42
+  brid .+62
+  brid bar
+
+bar:
+
+; CHECK: brid (.Ltmp0+42)+2  ; encoding: [0bAAAAA111,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp0+42)+2, kind: fixup_7_pcrel
+; CHECK: brid (.Ltmp1+62)+2  ; encoding: [0bAAAAA111,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp1+62)+2, kind: fixup_7_pcrel
+; CHECK: brid bar            ; encoding: [0bAAAAA111,0b111101AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: af f4      brid .+42
+; INST-NEXT: ff f4      brid .+62
+; INST-NEXT: 07 f4      brid .+0
diff --git a/llvm/test/MC/AVR/inst-brie.s b/llvm/test/MC/AVR/inst-brie.s
new file mode 100644
index 00000000000000..717c686e2ed44e
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brie.s
@@ -0,0 +1,24 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
+  brie .+20
+  brie .+40
+  brie bar
+
+bar:
+
+; CHECK: brie (.Ltmp0+20)+2  ; encoding: [0bAAAAA111,0b111100AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp0+20)+2, kind: fixup_7_pcrel
+; CHECK: brie (.Ltmp1+40)+2  ; encoding: [0bAAAAA111,0b111100AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: (.Ltmp1+40)+2, kind: fixup_7_pcrel
+; CHECK: brie bar            ; encoding: [0bAAAAA111,0b111100AA]
+; CHECK-NEXT:                ;   fixup A - offset: 0, value: bar, kind: fixup_7_pcrel
+
+; INST-LABEL: <foo>:
+; INST-NEXT: 57 f0      brie .+20
+; INST-NEXT: a7 f0      brie .+40
+; INST-NEXT: 07 f0      brie .+0
diff --git a/llvm/test/MC/AVR/inst-brlo.s b/llvm/test/MC/AVR/inst-brlo.s
new file mode 100644
index 00000000000000..4b56d66ffdfe00
--- /dev/null
+++ b/llvm/test/MC/AVR/inst-brlo.s
@@ -0,0 +1,24 @@
+; RUN: llvm-mc -triple avr -show-encoding < %s | FileCheck %s
+;
+; RUN: llvm-mc -filetype=obj -triple avr < %s \
+; RUN:     | llvm-objdump -d - \
+; RUN:     | FileCheck --check-prefix=INST %s
+
+foo:
...
[truncated]

@Patryk27 Patryk27 changed the title [AVR] Offset relative jumps by -2 [AVR] Fix parsing & emitting relative jumps Aug 19, 2024
@Patryk27
Copy link
Contributor Author

cc @benshi001 🙂

@aykevl
Copy link
Contributor

aykevl commented Aug 26, 2024

Testing this change now. It does improve things, but it doesn't fix all issues. Still investigating...

@aykevl
Copy link
Contributor

aykevl commented Aug 26, 2024

Nevermind, that was a stale build cache (should really fix that some day).

I can confirm that with this fix, all my TinyGo AVR tests start working again on LLVM 19 (when cherry-picking the first commit on top of llvm19-rc3). I'll review the code more in-depth.

@aykevl
Copy link
Contributor

aykevl commented Aug 26, 2024

This patch makes it so that relative branches are typically resolved at compile time, with no emitted relocations.
That's not like avr-gcc which does seem to emit relocations every time.
For example, with this code:

void asdf(void);

void foobar(char x) {
        while (x != 0) {
                asdf();
                x--;
        }
}

I get the following output for avr-gcc 5.4.0 (Debian), which includes relocations:

00000000 <foobar>:
   0:   push    r28
   2:   mov     r28, r24
   4:   and     r28, r28
   6:   breq    .+0             ; 0x8 <foobar+0x8>
                        6: R_AVR_7_PCREL        .text+0xe
   8:   rcall   .+0             ; 0xa <foobar+0xa>
                        8: R_AVR_13_PCREL       asdf
   a:   subi    r28, 0x01       ; 1
   c:   rjmp    .+0             ; 0xe <foobar+0xe>
                        c: R_AVR_13_PCREL       .text+0x4
   e:   pop     r28
  10:   ret

And the following for avr-gcc 14.1.0 in Fedora 40, which also includes relocations:

00000000 <foobar>:
   0:   push    r28
   2:   mov     r28, r24

00000004 <.L2>:
   4:   cpse    r28, r1
   6:   rjmp    .+0             ; 0x8 <.L2+0x4>
                        6: R_AVR_13_PCREL       .L3
   8:   pop     r28
   a:   ret

0000000c <.L3>:
   c:   rcall   .+0             ; 0xe <.L3+0x2>
                        c: R_AVR_13_PCREL       asdf
   e:   subi    r28, 0x01       ; 1
  10:   rjmp    .+0             ; 0x12 <.L3+0x6>
                        10: R_AVR_13_PCREL      .L2

But Clang outputs the following, with no relocations:

00000000 <foobar>:
   0:   push    r17
   2:   mov     r17, r24
   4:   cpi     r17, 0x00       ; 0
   6:   breq    .+8             ; 0x10 <foobar+0x10>
   8:   call    0       ; 0x0 <foobar>
                        8: R_AVR_CALL   asdf
   c:   dec     r17
   e:   rjmp    .-12            ; 0x4 <foobar+0x4>
  10:   pop     r17
  12:   ret

(In all cases, I compiled to an object file and disassembled using avr-objdump -dr).

I don't know why this is, but my guess would be to support linker relaxation.

In any case:

... which causes most of the relative jumps to be actually resolved late, by the linker, which applies the offsetting logic on its own, hiding the issue within LLVM.

I think that would in fact be a reasonable solution: to let the linker deal with relative branches in all cases.

@aykevl
Copy link
Contributor

aykevl commented Aug 26, 2024

This small patch also gets all my AVR tests to pass:

--- a/llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp
+++ b/llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp
@@ -516,12 +516,6 @@ bool AVRAsmBackend::shouldForceRelocation(const MCAssembler &Asm,
   // Fixups which should always be recorded as relocations.
   case AVR::fixup_7_pcrel:
   case AVR::fixup_13_pcrel:
-    // Do not force relocation for PC relative branch like 'rjmp .',
-    // 'rcall . - off' and 'breq . + off'.
-    if (const auto *SymA = Target.getSymA())
-      if (SymA->getSymbol().getName().size() == 0)
-        return false;
-    [[fallthrough]];
   case AVR::fixup_call:
     return true;
   }

EDIT: nevermind, it doesn't result in the same assembly when checking with your inline assembly example. So it's only mostly a fix, not entirely.

Copy link
Contributor

@aykevl aykevl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, with a few nits.

I hope this fix can be merged and backported into LLVM 19 before the release!

llvm/lib/Target/AVR/MCTargetDesc/AVRAsmBackend.cpp Outdated Show resolved Hide resolved
llvm/test/CodeGen/AVR/jmp.ll Outdated Show resolved Hide resolved
llvm/test/MC/AVR/inst-brbc.s Outdated Show resolved Hide resolved
@aykevl
Copy link
Contributor

aykevl commented Aug 26, 2024

Also note that you may want to create two separate PRs, one for each commit, unless you want to squash then ("rebase and merge" has been disabled).

@benshi001 benshi001 self-requested a review August 29, 2024 07:33
@benshi001
Copy link
Member

Sorry for my delay, I will take a look soon.

@Patryk27
Copy link
Contributor Author

Should we force relocations for calls and pc-relative jumps, then? 👀 (cc @aykevl)

I'm leaning towards resolving the relocations, because the current behavior seems to confuse people:

... but I don't have a strong opinion here - linker relaxation seems like a good argument (couldn't llvm just do it on its own, though?) and emiting relocations would certainly allow to remove some pieces of code, so there's pros and cons to both approaches.

@aykevl
Copy link
Contributor

aykevl commented Aug 30, 2024

I'm mainly in favor of merging any reasonable fix in time for the LLVM 19 deadline (coming Tuesday) so that the AVR won't be totally broken in there. Whichever fix it is, I'm less concerned about. The PR as-is seems reasonable to me.

I'm leaning towards resolving the relocations, because the current behavior seems to confuse people:

I'm not sure I agree with "it confuses people". Yes, it may confuse people, but that's just how relocations work.
For example, if I have this code:

void foo(void);
void bar(void) {
    foo();
}

and I compile it to an ARM object file, I get the following:

$ llvm-objdump -d tmp/test.o
[...]
00000000 <bar>:
       0: b580         	push	{r7, lr}
       2: af00         	add	r7, sp, #0x0
       4: f7ff fffe    	bl	0x4 <bar+0x4>           @ imm = #-0x4
       8: bd80         	pop	{r7, pc}

The same issue is present here: the bl appears to jump to itself.
It's only when you show relocations, that you see that it's actually a call to foo:

$ llvm-objdump -dr tmp/test.o
[...]
00000000 <bar>:
       0: b580         	push	{r7, lr}
       2: af00         	add	r7, sp, #0x0
       4: f7ff fffe    	bl	0x4 <bar+0x4>           @ imm = #-0x4
			00000004:  R_ARM_THM_CALL	foo
       8: bd80         	pop	{r7, pc}

@MaskRay also has a good explanation here: #104853 (comment)

We could also do both and have all branches point to the destination, and insert relocations.

Anyway, I hope this doesn't distract too much. I just hope we can get a fix in for LLVM 19 because having working code is better than having a totally broken AVR backend. We can think of the most proper fix afterwards.

@aykevl
Copy link
Contributor

aykevl commented Aug 30, 2024

linker relaxation seems like a good argument (couldn't llvm just do it on its own, though?)

Just to clarify: the whole point of linker relaxation is that it can only be done at link time. A typical example would be to convert a call instruction (4 bytes) into a rcall instruction (2 bytes) if the call target is close enough (within 4kB of code). This will change every branch in that function that crosses the call instruction, so they will need to be updated. Hence why every branch needs a relocation when using linker relaxation. (I'm pretty sure it also breaks DWARF debugging on AVR, so it's not without cost, see this post).

...but as long as we don't enable linker relaxation, there's nothing to worry about here. IIRC (but I'm not sure) there's a flag in the object file that tells the linker whether linker relaxation is supported, and I assume we don't set that flag.

@Patryk27
Copy link
Contributor Author

Makes sense, one more thing:

Also note that you may want to create two separate PRs, one for each commit, unless you want to squash then ("rebase and merge" has been disabled).

I'm not sure I follow - what does rebase and merge have to do with multi-commit pull requests? 🤔

@aykevl
Copy link
Contributor

aykevl commented Aug 30, 2024

GitHub has several ways to merge a PR from the web UI:

  • "Create a merge commit"
  • "Squash and merge"
  • "Rebase and merge"

Of these, only the 2nd is enabled, "squash and merge". That means that from the web UI, it's only possible to squash the two commits together into one and merge that. Which is probably not what you intended!

I've noticed that (apparently) I can squash+merge a PR from the web UI. Didn't know that.
In other words, if you make a new PR with just the first commit (for example), it seems I can merge that one and we can request a backport now.

@Patryk27
Copy link
Contributor Author

Okie, on it!

@Patryk27
Copy link
Contributor Author

Okie, after that other PR is merged, I'll rebase this one and adjust it to contain only the test-refactoring commit.

; CHECK: ; fixup A - offset: 0, value: .Ltmp0+8, kind: fixup_7_pcrel
; CHECK: brcc .Ltmp1-16 ; encoding: [0bAAAAA000,0b111101AA]
; CHECK: ; fixup A - offset: 0, value: .Ltmp1-16, kind: fixup_7_pcrel
; CHECK: brvc (.Ltmp0+8)+2 ; encoding: [0bAAAAA011,0b111101AA]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Though the generated binary is correct, which makes TinyGo fully work. The disassembly of llvm-objdump is not in accordance with avr-objdump. avr-objdump will output 1a: 23 f4 brvc .+8, without an extra +2.

So can we also fix llvm-objdump ? BTW, is it possible to fix this issue by modification in adjustRelativeBranch. (Sorry I have not enough time to investigate my suggestion)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see, yes - we probably have to approach the problem differently, since treating . as +2 leaks into the objdump's output.

We've already got this commit merged, but I'll prepare a better fix in the coming days.

@Patryk27
Copy link
Contributor Author

Patryk27 commented Sep 3, 2024

I'm cleaning up the pull requests - this change has already been merged, we're missing just the second commit, which I'll extract into a separate pull request now.

I'm also noting both your suggestions and I'll try to prepare a fix where +2 doesn't leak into objdump's output and where we generate relocations, to be in accordance with gcc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
mc Machine (object) code
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[AVR] Jumps are off by 2
4 participants