Ensure NoTrapAfterNoreturn is false for the wasm backend #65876

majaha · 2023-09-10T06:52:16Z

In the WebAssembly back end, the TrapUnreachable option is currently load-bearing for correctness, inserting wasm unreachable instructions where needed to create valid wasm. There is another option, NoTrapAfterNoreturn, that removes some of those traps and causes incorrect wasm to be emitted.

The first commit adds a command line flag for NoTrapAfterNoreturn so that it can be tested, and adds new tests (and fixes some typos).
The second commit turns off NoTrapAfterNoreturn for the Wasm backend.
The third commit adds a new peephole optimisation, to remove some common cases of unnecessary instructions around unreachable. Properly modelling how unreachable can be a sink and a source for operands would be better, but this will do for now.

Add the command line flag --no-trap-after-noreturn. Add and improve tests related to WebAssembly's unreachable instruction. Also fix various typos.

majaha · 2023-09-10T07:23:28Z

As an aside, here's a thought that occurred to me while reading through the code: It would be nice if LLVM IR's unreachable was instruction-selected to an ISD::UNDEFINED_BEHAVIOUR pseudo-instruction (when TrapUnreachable is off). Physical ISA backends would lower it to nothing, but wasm could lower it to wasm unreachable or nothing, depending on what was needed to fulfil the stack requirement. I didn't feel brave enough to attempt that though :)

The new peephole optimisation removed an unreachable in this test.

llvmbot · 2023-09-11T22:03:26Z

@llvm/pr-subscribers-lld-wasm

Changes

In the WebAssembly back end, the TrapUnreachable option is currently load-bearing for correctness, inserting wasm unreachable instructions where needed to create valid wasm. There is another option, NoTrapAfterNoreturn, that removes some of those traps and causes incorrect wasm to be emitted.

The first commit adds a command line flag for NoTrapAfterNoreturn so that it can be tested, and adds new tests (and fixes some typos).
The second commit turns off NoTrapAfterNoreturn for the Wasm backend.
The third commit adds a new peephole optimisation, to remove some common cases of unnecessary instructions around `unreachable`. Properly modelling how `unreachable` can be a sink and a source for operands would be better, but this will do for now.

Full diff: https://github.com/llvm/llvm-project/pull/65876.diff

10 Files Affected:

(modified) lld/test/wasm/init-fini.ll (+1-1)
(modified) llvm/include/llvm/CodeGen/MachineFunction.h (+1-1)
(modified) llvm/include/llvm/CodeGen/MachineInstr.h (+1-1)
(modified) llvm/lib/CodeGen/LLVMTargetMachine.cpp (+7)
(modified) llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp (+1-1)
(modified) llvm/lib/Target/WebAssembly/WebAssemblyDebugFixup.cpp (+11)
(modified) llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp (+51)
(modified) llvm/lib/Target/WebAssembly/WebAssemblyTargetMachine.cpp (+1)
(modified) llvm/test/CodeGen/WebAssembly/unreachable.ll (+116-21)
(modified) llvm/test/MC/WebAssembly/global-ctor-dtor.ll (+6-6)

diff --git a/lld/test/wasm/init-fini.ll b/lld/test/wasm/init-fini.ll
index 14385f042efb7c2..35286bbcccc2f54 100644
--- a/lld/test/wasm/init-fini.ll
+++ b/lld/test/wasm/init-fini.ll
@@ -77,7 +77,7 @@ entry:
 ; CHECK-NEXT:         Body:            10031004100A100F1012100F10141003100C100F10161001100E0B
 ; CHECK:            - Index:           22
 ; CHECK-NEXT:         Locals:
-; CHECK-NEXT:         Body:            02404186808080004100418088808000108780808000450D0000000B0B
+; CHECK-NEXT:         Body:            02404186808080004100418088808000108780808000450D00000B0B
 ; CHECK-NEXT:   - Type:            CUSTOM
 ; CHECK-NEXT:     Name:            name
 ; CHECK-NEXT:     FunctionNames:
diff --git a/llvm/include/llvm/CodeGen/MachineFunction.h b/llvm/include/llvm/CodeGen/MachineFunction.h
index 6c2da626ea54b4d..8f1651c2958e591 100644
--- a/llvm/include/llvm/CodeGen/MachineFunction.h
+++ b/llvm/include/llvm/CodeGen/MachineFunction.h
@@ -266,7 +266,7 @@ class LLVM_EXTERNAL_VISIBILITY MachineFunction {
   // RegInfo - Information about each register in use in the function.
   MachineRegisterInfo *RegInfo;
 
-  // Used to keep track of target-specific per-machine function information for
+  // Used to keep track of target-specific per-machine-function information for
   // the target implementation.
   MachineFunctionInfo *MFInfo;
 
diff --git a/llvm/include/llvm/CodeGen/MachineInstr.h b/llvm/include/llvm/CodeGen/MachineInstr.h
index 03fb15f77c65cbb..8367f999fcc76d3 100644
--- a/llvm/include/llvm/CodeGen/MachineInstr.h
+++ b/llvm/include/llvm/CodeGen/MachineInstr.h
@@ -1276,7 +1276,7 @@ class MachineInstr
   /// eraseFromBundle() to erase individual bundled instructions.
   void eraseFromParent();
 
-  /// Unlink 'this' form its basic block and delete it.
+  /// Unlink 'this' from its basic block and delete it.
   ///
   /// If the instruction is part of a bundle, the other instructions in the
   /// bundle remain bundled.
diff --git a/llvm/lib/CodeGen/LLVMTargetMachine.cpp b/llvm/lib/CodeGen/LLVMTargetMachine.cpp
index d02ec1db1165d41..aadc3709b85bfb7 100644
--- a/llvm/lib/CodeGen/LLVMTargetMachine.cpp
+++ b/llvm/lib/CodeGen/LLVMTargetMachine.cpp
@@ -37,6 +37,11 @@ static cl::opt
     EnableTrapUnreachable("trap-unreachable", cl::Hidden,
                           cl::desc("Enable generating trap for unreachable"));
 
+static cl::opt
+    EnableNoTrapAfterNoreturn("no-trap-after-noreturn", cl::Hidden,
+                              cl::desc("Do not emit a trap instruction for 'unreachable' IR instructions "
+                              "after noreturn calls, even if --trap-unreachable is set."));
+
 void LLVMTargetMachine::initAsmInfo() {
   MRI.reset(TheTarget.createMCRegInfo(getTargetTriple().str()));
   assert(MRI && "Unable to create reg info");
@@ -95,6 +100,8 @@ LLVMTargetMachine::LLVMTargetMachine(const Target &T,
 
   if (EnableTrapUnreachable)
     this->Options.TrapUnreachable = true;
+  if (EnableNoTrapAfterNoreturn)
+    this->Options.NoTrapAfterNoreturn = true;
 }
 
 TargetTransformInfo
diff --git a/llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp b/llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp
index 131e99c66fa2e5a..d8cbddf74545da6 100644
--- a/llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp
+++ b/llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp
@@ -667,7 +667,7 @@ void WebAssemblyCFGStackify::removeUnnecessaryInstrs(MachineFunction &MF) {
 
   // When there is an unconditional branch right before a catch instruction and
   // it branches to the end of end_try marker, we don't need the branch, because
-  // it there is no exception, the control flow transfers to that point anyway.
+  // if there is no exception, the control flow transfers to that point anyway.
   // bb0:
   //   try
   //     ...
diff --git a/llvm/lib/Target/WebAssembly/WebAssemblyDebugFixup.cpp b/llvm/lib/Target/WebAssembly/WebAssemblyDebugFixup.cpp
index 4a75bab6b95ddcd..eb7d5b2cfd2a4b3 100644
--- a/llvm/lib/Target/WebAssembly/WebAssemblyDebugFixup.cpp
+++ b/llvm/lib/Target/WebAssembly/WebAssemblyDebugFixup.cpp
@@ -122,9 +122,20 @@ bool WebAssemblyDebugFixup::runOnMachineFunction(MachineFunction &MF) {
           // it will be culled later.
         }
       } else {
+        
+        // WebAssembly Peephole optimisation can remove instructions around wasm unreachable.
+        // This is valid for wasm, as unreachable is operand stack polymorphic. But this is not modeled
+        // in llvm at the moment, and so the stack may not seem to pop all that it pushes.
+        // Clear the stack so we don't violate the assert(Stack.empty()) later on.
+        if (MI.getOpcode() == WebAssembly::UNREACHABLE) {
+          Stack.clear();
+          break;
+        }
+
         // Track stack depth.
         for (MachineOperand &MO : reverse(MI.explicit_uses())) {
           if (MO.isReg() && MFI.isVRegStackified(MO.getReg())) {
+            assert(Stack.size() != 0 && "WebAssemblyDebugFixup: Pop: Operand stack empty!");
             auto Prev = Stack.back();
             Stack.pop_back();
             assert(Prev.Reg == MO.getReg() &&
diff --git a/llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp b/llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp
index 6e2d566d9b48630..a573f0d86436e58 100644
--- a/llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp
+++ b/llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp
@@ -20,6 +20,7 @@
 #include "llvm/CodeGen/MachineFunctionPass.h"
 #include "llvm/CodeGen/MachineInstrBuilder.h"
 #include "llvm/CodeGen/MachineRegisterInfo.h"
+#include 
 using namespace llvm;
 
 #define DEBUG_TYPE "wasm-peephole"
@@ -109,6 +110,53 @@ static bool maybeRewriteToFallthrough(MachineInstr &MI, MachineBasicBlock &MBB,
   return true;
 }
 
+static bool eraseDeadCodeAroundUnreachable(MachineInstr &UnreachbleMI, MachineBasicBlock &MBB) {
+  SmallVector ToDelete;
+
+  // Because wasm unreachable is stack polymorphic and unconditionally ends control,
+  // all instructions after it can be removed until the end of this block.
+  // We remove the common case of double unreachable.
+  auto ForwardsIterator = UnreachbleMI.getIterator();
+  for (ForwardsIterator++; !ForwardsIterator.isEnd(); ForwardsIterator++) {
+    MachineInstr& MI = *ForwardsIterator;
+    if (MI.getOpcode() == WebAssembly::UNREACHABLE) {
+      ToDelete.push_back(&MI);
+    } else {
+      break;
+    }
+  }
+
+  [&]() {
+    // For the same reasons as above, previous instructions that only affect
+    // local function state can be removed (e.g. local.set, drop, various reads).
+    // We remove the common case of "drop unreachable".
+    auto BackwardsIterator = UnreachbleMI.getReverseIterator();
+    for (BackwardsIterator++; !BackwardsIterator.isEnd(); BackwardsIterator++) {
+      MachineInstr& MI = *BackwardsIterator;
+      switch(MI.getOpcode()) {
+      case WebAssembly::DROP_I32:
+      case WebAssembly::DROP_I64:
+      case WebAssembly::DROP_F32:
+      case WebAssembly::DROP_F64:
+      case WebAssembly::DROP_EXTERNREF:
+      case WebAssembly::DROP_FUNCREF:
+      case WebAssembly::DROP_V128:
+        ToDelete.push_back(&MI);
+        continue;
+      default:
+        return;
+      }
+    }
+  }();
+
+  bool Changed = false;
+  for (MachineInstr* MI : ToDelete) {
+    MI->eraseFromParent();
+    Changed = true;
+  }
+  return Changed;
+}
+
 bool WebAssemblyPeephole::runOnMachineFunction(MachineFunction &MF) {
   LLVM_DEBUG({
     dbgs() << "********** Peephole **********\n"
@@ -159,6 +207,9 @@ bool WebAssemblyPeephole::runOnMachineFunction(MachineFunction &MF) {
       case WebAssembly::RETURN:
         Changed |= maybeRewriteToFallthrough(MI, MBB, MF, MFI, MRI, TII);
         break;
+      case WebAssembly::UNREACHABLE:
+        Changed |= eraseDeadCodeAroundUnreachable(MI, MBB);
+        break;
       }
 
   return Changed;
diff --git a/llvm/lib/Target/WebAssembly/WebAssemblyTargetMachine.cpp b/llvm/lib/Target/WebAssembly/WebAssemblyTargetMachine.cpp
index f8a4b95a95515e4..184910ae68f15e6 100644
--- a/llvm/lib/Target/WebAssembly/WebAssemblyTargetMachine.cpp
+++ b/llvm/lib/Target/WebAssembly/WebAssemblyTargetMachine.cpp
@@ -127,6 +127,7 @@ WebAssemblyTargetMachine::WebAssemblyTargetMachine(
   // LLVM 'unreachable' to ISD::TRAP and then lower that to WebAssembly's
   // 'unreachable' instructions which is meant for that case.
   this->Options.TrapUnreachable = true;
+  this->Options.NoTrapAfterNoreturn = false;
 
   // WebAssembly treats each function as an independent unit. Force
   // -ffunction-sections, effectively, so that we can emit them independently.
diff --git a/llvm/test/CodeGen/WebAssembly/unreachable.ll b/llvm/test/CodeGen/WebAssembly/unreachable.ll
index ad1c90090ac58bf..38b8f543dffd972 100644
--- a/llvm/test/CodeGen/WebAssembly/unreachable.ll
+++ b/llvm/test/CodeGen/WebAssembly/unreachable.ll
@@ -1,33 +1,128 @@
-; RUN: llc < %s -asm-verbose=false -verify-machineinstrs | FileCheck %s
-; RUN: llc < %s -asm-verbose=false -fast-isel -fast-isel-abort=1 -verify-machineinstrs | FileCheck %s
-
-; Test that LLVM unreachable instruction and trap intrinsic are lowered to
-; wasm unreachable
+; RUN: llc < %s -verify-machineinstrs | FileCheck %s
+; RUN: llc < %s -fast-isel -fast-isel-abort=1 -verify-machineinstrs | FileCheck %s
+; RUN: llc < %s -verify-machineinstrs --trap-unreachable | FileCheck %s
+; RUN: llc < %s -fast-isel -fast-isel-abort=1 -verify-machineinstrs --trap-unreachable | FileCheck %s
+; RUN: llc < %s -verify-machineinstrs --trap-unreachable --no-trap-after-noreturn | FileCheck %s
+; RUN: llc < %s -fast-isel -fast-isel-abort=1 -verify-machineinstrs --trap-unreachable --no-trap-after-noreturn | FileCheck %s
 
 target triple = "wasm32-unknown-unknown"
 
-declare void @llvm.trap()
-declare void @llvm.debugtrap()
-declare void @abort()
 
-; CHECK-LABEL: f1:
-; CHECK: call abort{{$}}
-; CHECK: unreachable
-define i32 @f1() {
-  call void @abort()
-  unreachable
-}
+; Test that the LLVM trap and debug trap intrinsics are lowered to wasm unreachable.
+
+declare void @llvm.trap() cold noreturn nounwind
+declare void @llvm.debugtrap() nounwind
 
-; CHECK-LABEL: f2:
-; CHECK: unreachable
-define void @f2() {
+define void @trap_ret_void() {
+; CHECK-LABEL: trap_ret_void:
+; CHECK:         .functype trap_ret_void () -> ()
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    # fallthrough-return
+; CHECK-NEXT:    end_function
   call void @llvm.trap()
   ret void
 }
 
-; CHECK-LABEL: f3:
-; CHECK: unreachable
-define void @f3() {
+define void @dtrap_ret_void() {
+; CHECK-LABEL: dtrap_ret_void:
+; CHECK:         .functype dtrap_ret_void () -> ()
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    # fallthrough-return
+; CHECK-NEXT:    end_function
   call void @llvm.debugtrap()
   ret void
 }
+
+; Test that LLVM trap followed by LLVM unreachable becomes exactly one wasm unreachable.
+define void @trap_unreach() {
+; CHECK-LABEL: trap_unreach:
+; CHECK:         .functype trap_unreach () -> ()
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    end_function
+  call void @llvm.trap()
+  unreachable
+}
+
+
+; Test that LLVM unreachable instruction is lowered to wasm unreachable when necessary
+; to fulfill the wasm operand stack requirements.
+
+declare void @ext_func()
+declare i32 @ext_func_i32()
+declare void @ext_never_return() noreturn
+
+; This test emits wasm unreachable to fill in for the missing i32 return value.
+define i32 @missing_ret_unreach() {
+; CHECK-LABEL: missing_ret_unreach:
+; CHECK:         .functype missing_ret_unreach () -> (i32)
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    call ext_func
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    end_function
+  call void @ext_func()
+  unreachable
+}
+
+; This is similar to the above test, but ensures wasm unreachable is emitted even
+; after a noreturn call.
+define i32 @missing_ret_noreturn_unreach() {
+; CHECK-LABEL: missing_ret_noreturn_unreach:
+; CHECK:         .functype missing_ret_noreturn_unreach () -> (i32)
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    call ext_never_return
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    end_function
+  call void @ext_never_return()
+  unreachable
+}
+
+; We could emit no instructions at all for the llvm unreachables in these next three tests, as the signatures match
+; and reaching llvm unreachable is undefined behaviour. But wasm unreachable is emitted for the time being.
+
+define void @void_sig_match_unreach() {
+; CHECK-LABEL: void_sig_match_unreach:
+; CHECK:         .functype void_sig_match_unreach () -> ()
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    call ext_func
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    end_function
+  call void @ext_func()
+  unreachable
+}
+
+define i32 @i32_sig_match_unreach() {
+; CHECK-LABEL: i32_sig_match_unreach:
+; CHECK:         .functype i32_sig_match_unreach () -> (i32)
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    call ext_func_i32
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    end_function
+  call i32 @ext_func_i32()
+  unreachable
+}
+
+define void @void_sig_match_noreturn_unreach() {
+; CHECK-LABEL: void_sig_match_noreturn_unreach:
+; CHECK:         .functype void_sig_match_noreturn_unreach () -> ()
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    call ext_never_return
+; CHECK-NEXT:    unreachable
+; CHECK-NEXT:    end_function
+  call void @ext_never_return()
+  unreachable
+}
+
+; This function currently doesn't emit unreachable.
+define void @void_sig_match_noreturn_ret() {
+; CHECK-LABEL: void_sig_match_noreturn_ret:
+; CHECK:         .functype void_sig_match_noreturn_ret () -> ()
+; CHECK-NEXT:  # %bb.0:
+; CHECK-NEXT:    call ext_never_return
+; CHECK-NEXT:    # fallthrough-return
+; CHECK-NEXT:    end_function
+  call void @ext_never_return()
+  ret void
+}
diff --git a/llvm/test/MC/WebAssembly/global-ctor-dtor.ll b/llvm/test/MC/WebAssembly/global-ctor-dtor.ll
index bc1be7931349697..f1ec71da1ebb641 100644
--- a/llvm/test/MC/WebAssembly/global-ctor-dtor.ll
+++ b/llvm/test/MC/WebAssembly/global-ctor-dtor.ll
@@ -80,29 +80,29 @@ declare void @func3()
 ; CHECK-NEXT:         Offset:          0x1D
 ; CHECK-NEXT:       - Type:            R_WASM_FUNCTION_INDEX_LEB
 ; CHECK-NEXT:         Index:           6
-; CHECK-NEXT:         Offset:          0x2C
+; CHECK-NEXT:         Offset:          0x2B
 ; CHECK-NEXT:       - Type:            R_WASM_TABLE_INDEX_SLEB
 ; CHECK-NEXT:         Index:           5
-; CHECK-NEXT:         Offset:          0x37
+; CHECK-NEXT:         Offset:          0x36
 ; CHECK-NEXT:       - Type:            R_WASM_MEMORY_ADDR_SLEB
 ; CHECK-NEXT:         Index:           3
-; CHECK-NEXT:         Offset:          0x3F
+; CHECK-NEXT:         Offset:          0x3E
 ; CHECK-NEXT:       - Type:            R_WASM_FUNCTION_INDEX_LEB
 ; CHECK-NEXT:         Index:           4
-; CHECK-NEXT:         Offset:          0x45
+; CHECK-NEXT:         Offset:          0x44
 ; CHECK-NEXT:     Functions:
 ; CHECK-NEXT:       - Index:           5
 ; CHECK-NEXT:         Locals:
 ; CHECK-NEXT:         Body:            1080808080000B
 ; CHECK-NEXT:       - Index:           6
 ; CHECK-NEXT:         Locals:
-; CHECK-NEXT:         Body:            02404181808080004100418080808000108180808000450D0000000B0B
+; CHECK-NEXT:         Body:            02404181808080004100418080808000108180808000450D00000B0B
 ; CHECK-NEXT:       - Index:           7
 ; CHECK-NEXT:         Locals:
 ; CHECK-NEXT:         Body:            1082808080000B
 ; CHECK-NEXT:       - Index:           8
 ; CHECK-NEXT:         Locals:
-; CHECK-NEXT:         Body:            02404182808080004100418080808000108180808000450D0000000B0B
+; CHECK-NEXT:         Body:            02404182808080004100418080808000108180808000450D00000B0B
 ; CHECK-NEXT:   - Type:            DATA
 ; CHECK-NEXT:     Segments:
 ; CHECK-NEXT:       - SectionOffset:   6

aheejin

Haven't read the code yet, but I wonder, why can't we turn NoTrapAfterNoreturn on? Do we need unreachable after noreturn calls?

llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp

aheejin · 2023-09-11T22:32:09Z

Also please run clang-format

majaha · 2023-09-11T22:42:35Z

Do we need unreachable after noreturn calls?

We do for most cases, because the body of a wasm function has to pass wasm validation. This means that the body has to have the same "type" as the function signature, even if the end of the function is never reached. Wasm unreachable is a special kind of instruction that can have any "type", and so it can be used as an easy way to fulfil the operand stack typing requirements. Essentially we're saying "Nothing can get past this point, so it doesn't matter that this functions body doesn't match its signature."

This is something that native-style architectures don't require, which is kinda what I was getting at with my earlier comment about an UB instruction, to better support these kinds of virtual ISAs.

This reverts commit 219c194.

This reverts commit c8ad813.

llvm/lib/CodeGen/LLVMTargetMachine.cpp

majaha · 2023-09-12T04:24:53Z

Hmm, I'm not sure how I could create another pull request that is stacked on top of this one for the peephole optimisation. Searching around in the LLVM discord, it seems like that's a bit of a pain point at the moment.

aheejin · 2023-09-12T06:33:42Z

Hmm, I'm not sure how I could create another pull request that is stacked on top of this one for the peephole optimisation. Searching around in the LLVM discord, it seems like that's a bit of a pain point at the moment.

I don't think the stacked PR feature exists in Github, which was a nice feature in Phabricator. You can either upload the next PR now that includes both PR's changes and rebase after this lands, or you can wait until this lands first and then create a new PR. Oh, I have seen my colleague using this plugin (https://app.graphite.dev/) for stacked PRs, but I haven't used it myself and don't know how to use it.

majaha · 2023-09-12T10:05:30Z

Here's the new pull request for the peephole optimisation: #66062

majaha · 2023-09-19T13:50:45Z

Are there any more thoughts, opinions, or questions about this PR?

nikic · 2023-09-19T14:04:40Z

This looks generally fine to me, but I think for the new -no-trap-after-noreturn option we should introduce that in a separate PR that actually tests the behavior in a case where it does something. It's weird to only test the option in a place where it doesn't actually do anything.

majaha · 2023-09-19T14:50:37Z

Do you mean that there should be a test that checks that the --no-trap-after-noreturn command line option actually enables the behaviour it's supposed to?

Would that be a unit test, or an llvm-lit style test? Where might that test live?

After a quick search the closest thing I could find for --trap-unreachable is this: https://github.com/llvm/llvm-project/blob/116f7a2dcb86a7a8812a60fb7101f90329dada19/llvm/test/CodeGen/ARM/trap-unreachable.ll
which tests the effect on the ARM backend. There's also one for the Hexagon backend. Is this the kind of test you had in mind?

aheejin

Sorry for the long delay; I was OOO for a while.

Sorry again; I'm still having a hard time understanding what this PR is trying to achieve. See inline comments.

llvm/test/CodeGen/WebAssembly/unreachable.ll

llvm/lib/Target/WebAssembly/WebAssemblyTargetMachine.cpp

llvm/test/CodeGen/WebAssembly/unreachable.ll

llvm/lib/CodeGen/LLVMTargetMachine.cpp

majaha · 2023-09-21T10:12:13Z

@aheejin I suggest looking at each of the first three commits individually. I carefully crafted and stacked them so that the test passes after each commit. That way, the changes to the test declaratively describe the effect of each commit.
You can click on each commit in turn near the top of this page. Also, when looking at changes, there's a little pulldown near the top right that says Changes from * commits that you can use to flick through the different commits.

I probably should have mentioned that earlier. It seems like GitHub doesn't surface this stacked-commit style information as well as it could. I was following nikic's guide for the old pull request system, where I assume this style was better supported.

I'll give a quick overview of what each of the first three commits do:

Add no-trap-after-noreturn flag and wasm tests
This commit doesn't actually contain any real changes. It simply adds some new tests in unreachable.ll that describe the current state of affairs, and adds the command line option --no-trap-after-noreturn to support those tests. This option was previously usable from C++ projects (clang, rustc etc.), but not from the command line, making it untestable (and thus buggy!).
In particular, the function @missing_ret_noreturn_unreach() produces invalid Wasm code when NoTrapAfterNoreturn is turned on: it's missing a necessary unreachable.
Ensure NoTrapAfterNoreturn is false for wasm
This is the bug fix commit. I did the simplest possible thing to fix the bug: copy the behaviour for TargetOptions.TrapUnreachable and overwrite the value of TargetOptions.NoTrapAfterNoreturn with false*. Note that TargetOptions is part of the public API and that these options can be set to anything by llvm's embedders (i.e. clang, rustc).
This is a bit of a hack, but a hack that already existed.
As you can see in the tests, this makes all combinations of the two command line options output the same code, and fixes the invalid output. But there's still those pesky double unreachables, which leads to the next commit:
Add peephole optimisation
This commit adds a simple peephole optimisation that removes unnecessary drops and unreachables near other unreachables. It would be nice if the wasm backend was able to model these properly so as not to add them in the first place, but this fixes the most obvious cases well enough for now. This commit has been reverted and moved to a different PR.

* Incidently, is this already buggy? We are overwriting our user's TargetOptions, and so if they re-used that object for a different backend they might be surprised by it changing on them. Should we be copying the whole TargetOptions instead?

majaha · 2023-09-21T18:53:14Z

@nikic I've split off the --no-trap-after-noreturn flag change into a separate PR: #67051
Is this the kind of change you had in mind?

aheejin · 2023-09-27T18:56:43Z

There's no difference between "doing nothing with respect to it" and "silently rejecting", it's the same thing. The wasm backend understands neither --xcoff-traceback-table, --trap-unreachable, or --no-trap-after-noreturn (the fact that --no-trap-after-noreturn affects the wasm output despite being conceptually unsupported is the bug). None of those are supported by the wasm backend, and they are all equally silently ignored.

I don't think they are the same thing. As I said above, --xcoff-traceback-table doesn't apply to Wasm because Wasm does not use xcoff. But --trap-unreachable or --no-trap-after-noreturn are applicable to wasm and we override them. I don't understand the analogy.

I have been kind of confused by your characterization of this as "bugfix", given that this does not change anything functionality-wise.
...So I'm not sure if I agree that this PR is a bugfix. I consider this as an improvement adding a failsafe mechanism.

It does change things functionality wise. Look at the tests, the output has changed with this pull request from invalid to valid. If that's not a bug fix, I don't know what is.

But --no-trap-after-noreturn didn't exist before, so there was no way to specify that from the command line. You created it, originally in this PR, and then the split-off PR in #67051. If this is a bugfix, it sounds like you are fixing a bug of your own making.

The reason I said this might count as an improvement was, in a hypothetical scenario when the default value for NoTrapAfterNoreturn changes, or someone else (if you didn't create that option) wants to add the same command line option, this overriding can guard us from those changes. But I think these scenarios are basically hypothetical and unlikely. So while I'm not opposed to these changes, I am confused by the motivation of this PR (together with #67051) and then more about the claim this is a bugfix.

What's your distinction criteria for "real contradiction" and "hints or suggestions"?

A real contradiction would be asking for two incompatible target options, like -exception-model=wasm -enable-emscripten-cxx-exceptions. TrapUnreachable is a much more wishy-washy request, it's frequently ignored across the codebase, and using it doesn't even guarantee that unreachable will produce a trap.

This is a case where the unreachable was optimized away or transformed to something else in opt, even before reaching the backend, so I don't think this counts as "we don't guarantee thatunreachable will produce a trap". We (the Wasm backend) doesn't even see the unreachable. (Of course, it is entirely possible that we optimize out some code blocks containing unreachable and don't end up producing a trap corresponding to that within our Wasm backend. Anyway what I'm saying is optimizing away unreachable doesn't conflict the command-line option.)

Also I'm having a hard time understanding your criteria between wishy-washy request and real contradiction still. To my understanding in both cases they are not compatible.

To put it plainly: the --trap-unreachable and --no-trap-after-noreturn options are not supported on the wasm backend, and they are silently ignored just like every other unsupported option. This is consistent with the behaviour both in the wasm backend, and across the codebase: 1, 2, 3.

Yeah I get that we haven't been warning about a similar case (--trap-unreachable) before. But I think they are more of what ended up happen, and not the firm intention not to warn for conflicted requests.

For example, that --trap-unreachable command line option was added much later (6d9f8c9) than when we set it to false (ffa143c). When first we set it to false in 2015, that option didn't exist so we had nothing to warn against. And someone who added that option in 2018 didn't go through all the code to check if there were any conflicting assignments.

sbc100 · 2023-09-27T19:29:15Z

But --no-trap-after-noreturn didn't exist before, so there was no way to specify that from the command line. You created it, originally in this PR, and then the split-off PR in #67051. If this is a bugfix, it sounds like you are fixing a bug of your own making.

I think perhaps the misunderstanding here is that NoTrapAfterNoreturn was usable/settable by compilers such as rust via the LLVM API, even though --no-trap-after-noreturn didn't exist in llvm itself?

So adding this option allows us to test something that wasn't testable via the command line, but was exposed via the API? Is that right?

llvm/test/CodeGen/WebAssembly/unreachable.ll

aheejin · 2023-09-27T19:46:29Z

But --no-trap-after-noreturn didn't exist before, so there was no way to specify that from the command line. You created it, originally in this PR, and then the split-off PR in #67051. If this is a bugfix, it sounds like you are fixing a bug of your own making.

I think perhaps the misunderstanding here is that NoTrapAfterNoreturn was usable/settable by compilers such as rust via the LLVM API, even though --no-trap-after-noreturn didn't exist in llvm itself?

So adding this option allows us to test something that wasn't testable via the command line, but was exposed via the API? Is that right?

Ah, so it was already possible to set NoTrapAfterNoreturn=0 via some API. I couldn't find one in the LLVM codebase that allowed it but I guess I don't know about Rust API part. Thanks for point that out, and sorry If I missed that. @majaha

majaha · 2023-09-27T21:29:22Z

Yeah you've got it, that's the main point. It can be set via the C++ API, and when I tried doing so in the Rust compiler I found it broke the WebAssembly backend. Sorry if I didn't make that clear enough earlier, I'm glad we're on the same page now.

aheejin · 2023-09-27T23:44:13Z

Yeah you've got it, that's the main point. It can be set via the C++ API, and when I tried doing so in the Rust compiler I found it broke the WebAssembly backend. Sorry if I didn't make that clear enough earlier, I'm glad we're on the same page now.

The other points I tried to say in #65876 (comment) stay the same though.

majaha · 2023-09-28T00:09:01Z

Yeah I get that we haven't been warning about a similar case (--trap-unreachable) before. But I think they are more of what ended up happen, and not the firm intention not to warn for conflicted requests.

For example, that --trap-unreachable command line option was added much later (6d9f8c9) than when we set it to false (ffa143c). When first we set it to false in 2015, that option didn't exist so we had nothing to warn against. And someone who added that option in 2018 didn't go through all the code to check if there were any conflicting assignments.

You might be right, but my feeling is that the behaviour of --trap-unreachable and --no-trap-after-noreturn should be consistent across all backends, and probably with other options too. I think that's a big enough change to warrant discussion and testing separately.

This change is consistent with the current behaviour and fixes a real issue, so I think it makes sense for this change to go in first, before we start to think about changing any existing behaviour.

aheejin · 2023-09-29T07:16:56Z

Yeah I get that we haven't been warning about a similar case (--trap-unreachable) before. But I think they are more of what ended up happen, and not the firm intention not to warn for conflicted requests.
For example, that --trap-unreachable command line option was added much later (6d9f8c9) than when we set it to false (ffa143c). When first we set it to false in 2015, that option didn't exist so we had nothing to warn against. And someone who added that option in 2018 didn't go through all the code to check if there were any conflicting assignments.

You might be right, but my feeling is that the behaviour of --trap-unreachable and --no-trap-after-noreturn should be consistent across all backends, and probably with other options too. I think that's a big enough change to warrant discussion and testing separately.

This change is consistent with the current behaviour and fixes a real issue, so I think it makes sense for this change to go in first, before we start to think about changing any existing behaviour.

OK fair enough. Sorry that this PR is taking so long time. I guess we can land this after we fix some tests / add some comments. I still think the test file needs more comments; I would like that the readers of the test file find it easy enough to figure out what the purpose of the tests in that file is, without referring to git blames and PR discussions.

majaha · 2023-10-02T00:04:53Z

That's a good point, I'll add some more explanatory comments.

Co-authored-by: Heejin Ahn <[email protected]>

llvm/test/CodeGen/WebAssembly/unreachable.ll

aheejin · 2023-10-05T05:50:44Z

llvm/test/CodeGen/WebAssembly/unreachable.ll

-  call void @ext_never_return()
-  ret void
-}
+}


No newline at the end

llvm/test/CodeGen/WebAssembly/unreachable.ll

Co-authored-by: Heejin Ahn <[email protected]>

aheejin

Thanks for your patience! Will merge this.

In the WebAssembly back end, the TrapUnreachable option is currently load-bearing for correctness, inserting wasm `unreachable` instructions where needed to create valid wasm. There is another option, NoTrapAfterNoreturn, that removes some of those traps and causes incorrect wasm to be emitted. This turns off `NoTrapAfterNoreturn` for the Wasm backend and adds new tests.

Some textual editing errors got through this pull request that was merged a few weeks ago: #65876 This patch clears up the unintentional duplicated line, and white-space at the end of the lines.

majaha added 3 commits September 9, 2023 21:48

Add no-trap-after-noreturn flag and wasm tests

7c43c80

Add the command line flag --no-trap-after-noreturn. Add and improve tests related to WebAssembly's unreachable instruction. Also fix various typos.

Ensure NoTrapAfterNoreturn is false for wasm

7e83469

Add peephole optimisation

c8ad813

majaha requested a review from a team as a code owner September 10, 2023 06:52

llvmbot added backend:WebAssembly mc Machine (object) code labels Sep 10, 2023

Fix broken lld wasm test

219c194

The new peephole optimisation removed an unreachable in this test.

llvmbot added lld lld:wasm labels Sep 10, 2023

aheejin reviewed Sep 11, 2023

View reviewed changes

llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp Outdated Show resolved Hide resolved

sbc100 reviewed Sep 11, 2023

View reviewed changes

llvm/lib/Target/WebAssembly/WebAssemblyPeephole.cpp Outdated Show resolved Hide resolved

majaha added 3 commits September 12, 2023 00:38

Revert "Fix broken lld wasm test"

91d5986

This reverts commit 219c194.

Revert "Add peephole optimisation"

d2b5701

This reverts commit c8ad813.

Run clang-format

05d0c17

aheejin reviewed Sep 11, 2023

View reviewed changes

llvm/lib/CodeGen/LLVMTargetMachine.cpp Outdated Show resolved Hide resolved

majaha mentioned this pull request Sep 12, 2023

Add Wasm peephole optimisation to remove dead code around unreachables #66062

Open

aheejin reviewed Sep 21, 2023

View reviewed changes

majaha mentioned this pull request Sep 21, 2023

Add command line option --no-trap-after-noreturn #67051

Merged

aheejin reviewed Sep 27, 2023

View reviewed changes

majaha and others added 3 commits October 2, 2023 01:07

Apply formatting and naming changes from code review

3cb13e5

Co-authored-by: Heejin Ahn <[email protected]>

Improve explanation in unreachable.ll test

0393836

Remove unintended "CHECK-NEXT" line

c210e9a

aheejin reviewed Oct 4, 2023

View reviewed changes

majaha added 2 commits October 4, 2023 22:30

Expand "unreach" -> "unreachable"

f0b5809

Adresss comments

b2d9080

aheejin reviewed Oct 5, 2023

View reviewed changes

llvm/test/CodeGen/WebAssembly/unreachable.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/WebAssembly/unreachable.ll Outdated Show resolved Hide resolved

majaha and others added 2 commits October 5, 2023 07:43

Apply suggestions from code review

02a5d64

Co-authored-by: Heejin Ahn <[email protected]>

Add newline

fdc6b54

aheejin approved these changes Oct 5, 2023

View reviewed changes

aheejin merged commit bd7ca98 into llvm:main Oct 5, 2023
3 checks passed

stepthomas mentioned this pull request Oct 10, 2023

AMDGPU stepthomas atomic csub no rtn forms ver2 stepthomas/llvm-project#1

Closed

majaha mentioned this pull request Oct 26, 2023

Cherry-pick NoTrapAfterNoreturn fix for wasm backend rust-lang/llvm-project#155

Closed

majaha deleted the wasmntanr branch October 26, 2023 14:30

majaha mentioned this pull request Oct 26, 2023

Fix comment in wasm unreachable test #70340

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure NoTrapAfterNoreturn is false for the wasm backend #65876

Ensure NoTrapAfterNoreturn is false for the wasm backend #65876

majaha commented Sep 10, 2023

majaha commented Sep 10, 2023

llvmbot commented Sep 11, 2023

aheejin left a comment •

edited

Loading

aheejin commented Sep 11, 2023

majaha commented Sep 11, 2023 •

edited

Loading

majaha commented Sep 12, 2023

aheejin commented Sep 12, 2023 •

edited

Loading

majaha commented Sep 12, 2023

majaha commented Sep 19, 2023

nikic commented Sep 19, 2023

majaha commented Sep 19, 2023

aheejin left a comment

majaha commented Sep 21, 2023

majaha commented Sep 21, 2023

aheejin commented Sep 27, 2023 •

edited

Loading

sbc100 commented Sep 27, 2023 •

edited

Loading

aheejin commented Sep 27, 2023

majaha commented Sep 27, 2023

aheejin commented Sep 27, 2023

majaha commented Sep 28, 2023

aheejin commented Sep 29, 2023

majaha commented Oct 2, 2023

aheejin Oct 5, 2023

aheejin left a comment

Ensure NoTrapAfterNoreturn is false for the wasm backend #65876

Ensure NoTrapAfterNoreturn is false for the wasm backend #65876

Conversation

majaha commented Sep 10, 2023

majaha commented Sep 10, 2023

llvmbot commented Sep 11, 2023

aheejin left a comment • edited Loading

Choose a reason for hiding this comment

aheejin commented Sep 11, 2023

majaha commented Sep 11, 2023 • edited Loading

majaha commented Sep 12, 2023

aheejin commented Sep 12, 2023 • edited Loading

majaha commented Sep 12, 2023

majaha commented Sep 19, 2023

nikic commented Sep 19, 2023

majaha commented Sep 19, 2023

aheejin left a comment

Choose a reason for hiding this comment

majaha commented Sep 21, 2023

majaha commented Sep 21, 2023

aheejin commented Sep 27, 2023 • edited Loading

sbc100 commented Sep 27, 2023 • edited Loading

aheejin commented Sep 27, 2023

majaha commented Sep 27, 2023

aheejin commented Sep 27, 2023

majaha commented Sep 28, 2023

aheejin commented Sep 29, 2023

majaha commented Oct 2, 2023

aheejin Oct 5, 2023

Choose a reason for hiding this comment

aheejin left a comment

Choose a reason for hiding this comment

aheejin left a comment •

edited

Loading

majaha commented Sep 11, 2023 •

edited

Loading

aheejin commented Sep 12, 2023 •

edited

Loading

aheejin commented Sep 27, 2023 •

edited

Loading

sbc100 commented Sep 27, 2023 •

edited

Loading