unreachable code paths need to be excluded from having coverage instrumentation #20992

andrewrk · 2024-08-08T03:27:05Z

Looking at the "if tower" example:

test "if tower" {
    const input_bytes = std.testing.fuzzInput(.{});
    if (input_bytes.len < 10) return;
    std.time.sleep(std.time.ns_per_ms / 3); // otherwise it finds the bug too fast!
    if (input_bytes[0] == 'A') {
        if (input_bytes[1] == 'l') {
            if (input_bytes[2] == 'e') {
                if (input_bytes[3] == 'x') {
                    if (input_bytes[4] == 'a') {
                        if (input_bytes[5] == 'n') {
                            if (input_bytes[6] == 'd') {
                                if (input_bytes[7] == 'r') {
                                    if (input_bytes[8] == 'a') {
                                        @panic("found bug");
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}

We see edges for every safety check:

We can access the raw pointer of the slices to escape the safety check, demonstrating that those edges disappear:

test "if tower" {
    const input_bytes = std.testing.fuzzInput(.{});
    if (input_bytes.len < 10) return;
    std.time.sleep(std.time.ns_per_ms / 3); // otherwise it finds the bug too fast!
    if (input_bytes.ptr[0] == 'A') {
        if (input_bytes.ptr[1] == 'l') {
            if (input_bytes.ptr[2] == 'e') {
                if (input_bytes.ptr[3] == 'x') {
                    if (input_bytes.ptr[4] == 'a') {
                        if (input_bytes.ptr[5] == 'n') {
                            if (input_bytes.ptr[6] == 'd') {
                                if (input_bytes.ptr[7] == 'r') {
                                    if (input_bytes.ptr[8] == 'a') {
                                        @panic("found bug");
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
    }
}

Those safety check edges are not interesting for code coverage because they are unreachable. The fuzzer still wants to know about any comparisons used which may have led to those unreachable branches, but we are not expecting to have code coverage for unreachable paths!

Fortunately, LLVM has a !nosanitize metadata node. Here is an example of using it on a branch:

  br i1 %19, label %cont, label %trap, !dbg !39, !nosanitize !28

...
!22 = distinct !DISubprogram(name: "main", scope: !2, file: !2, line: 2, type: !23, scopeLine: 2, flags: DIFlagPrototyped, spFlags: DISPFlagDefinition, unit: !12, retainedNodes: !28)
...
!28 = !{}

So, the cmp should still have the instrumentation because it provides the cmp operands to the fuzzer, and the fuzzer is trying to find inputs that cause the unreachable path to be reached, but there should not be a PC edge annotation on the branch, because we'll know it got hit when the process panics and crashes!

The text was updated successfully, but these errors were encountered:

work in progress commit that adds !nosanitize to all instructions. output is wrong: ``` opt: /home/andy/dev/llvm-project-18/llvm/include/llvm/AsmParser/LLParser.h:93: bool llvm::ValID::operator<(const llvm::ValID&) const: Assertion `Kind == RHS.Kind && "Comparing ValIDs of different kinds"' failed. () () ``` working on #20992

see #20992 Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com>

andrewrk added enhancement Solving this issue will likely involve adding new logic or components to the codebase. backend-llvm The LLVM backend outputs an LLVM IR Module. fuzzing labels Aug 8, 2024

andrewrk added this to the 0.14.0 milestone Aug 8, 2024

This was referenced Aug 8, 2024

introduce a fuzz testing web interface #20958

Merged

debug info audit - many virtual memory addresses have strange source locations #20989

Open

support code coverage when testing #352

Open

fix several debug info bugs #21075

Merged

andrewrk mentioned this issue Aug 19, 2024

llvm.Builder: add !nosanitize API #21141

Closed

andrewrk added a commit that referenced this issue Aug 21, 2024

llvm.Builder: add !nosanitize API

26d5c27

see #20992 Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com>

andrewrk added a commit that referenced this issue Aug 24, 2024

llvm.Builder: add !nosanitize API

ec2ea07

see #20992 Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com>

andrewrk added a commit that referenced this issue Aug 27, 2024

llvm.Builder: add !nosanitize API

e9a3ec7

see #20992 Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com>

andrewrk added a commit that referenced this issue Aug 29, 2024

llvm.Builder: add !nosanitize API

df52073

see #20992 Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com>

andrewrk mentioned this issue Aug 29, 2024

exclude unreachable code paths from having coverage instrumentation #21236

Merged

andrewrk closed this as completed in #21236 Aug 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unreachable code paths need to be excluded from having coverage instrumentation #20992

unreachable code paths need to be excluded from having coverage instrumentation #20992

andrewrk commented Aug 8, 2024

unreachable code paths need to be excluded from having coverage instrumentation #20992

unreachable code paths need to be excluded from having coverage instrumentation #20992

Comments

andrewrk commented Aug 8, 2024