Less basic fuzzer #21246

ProkopRandacek · 2024-08-29T22:38:46Z

This PR improves the current fuzzer implementation. It is not yet competetive with other fuzzers but it is working and ready for a first round of review (which I assume is better than to drop a huge PR later). I plan to keep improving the fuzzer in the near future.

The fuzzer stores a corpus of (once) interesting inputs inside a pair of memory mapped files, ready to be shared with other fuzzing processes (which there are none currently since I am not sure where is the best place to spawn them).

It randomly picks a input, mutates it, sees if it hits any new features in the instrumented program and if so, adds it to the corpus. The mutations are taken from llvm's libfuzzer.

My next plan is to:

track feature rarity
improve input selection based on rare features
implement more mutations
track values from cmp instrumentation and use in mutations
use inplace mutation
spawn more threads
improve feature capture
improve how crashing inputs are reported

closes #20814
closes #20803
wip on #20804

andrewrk · 2024-08-29T22:51:15Z

Exciting! This is great timing. I was about to embark on a similar branch, so instead I will cooperate with your efforts here.

Can you share your testing methodology?

lib/fuzzer/main.zig

lib/fuzzer/dump_corpus.zig

andrewrk · 2024-08-29T23:32:06Z

lib/fuzzer/InputPoolPosix.zig

+/// maximum 2GiB of input data should be enough. 32th bit is delete flag
+pub const Index = u31;


Suggested change

/// maximum 2GiB of input data should be enough. 32th bit is delete flag

pub const Index = u31;

pub const Flags = packed struct (u32) {

index: Index,

delete: bool,

pub const Index = enum(u31) {

/// This tag is used below in the line with

/// `MemoryMappedList(u8).init(buffer_file, std.math.maxInt(Index))`

/// but I don't see what the special value means. Replace these doc comments

/// and field name with a more descriptive one.

/// Or, if you did not mean for this to be a special value, delete this field and

/// initialize `index` with `undefined`.

special_value = std.math.maxInt(u31),

_,

};

pub const empty: Flags = .{

.index = .special_value,

.delete = true,

};

};

I promise you this type safety will come in handy.

Then you can elsewhere do this:

- const buffer = MemoryMappedList(u8).init(buffer_file, std.math.maxInt(Index)); + const buffer = MemoryMappedList(u8).init(buffer_file, Flags.empty);

Instead of deleteMask, use flags.delete etc.

special_value = std.math.maxInt(u31),

That is not a special value! :D I actually want to allocate 2GiB of virtual memory. It simplifies the problem of reallocating our mmaped file when some other process appended to the file.

This way all the process needs to do is read len (start of meta file) and access the already allocated buffer. the process that incremented len already called ftruncate on the file so all other processes can access their allocated buffer without doing anything at all.

Otherwise there would need to be some logic for other processes to store the allocation size and mremap it when grows too much and such and such.

Instead of deleteMask, use flags.delete etc

+1

lib/fuzzer/InputPoolPosix.zig

andrewrk · 2024-08-29T23:32:48Z

lib/fuzzer/InputPoolPosix.zig

+    ip.meta.deinit();
+}
+
+// Primitive spin lock implementation. There is basically no contention on it.


Use std.Thread.Mutex instead of implementing a spin lock.

The problem with that was that std.Thread.Mutex calls futex_wait with PRIVATE_FLAG, which signals to the kernel that this futex is not shared with other processes, which is not the case probably. Depends on how we are going to implement paralell fuzzing. I wanted to ask you for input on that.

I was thinking that the build system might spawn multiple separate fuzz processes and in that case, this lock would be inter-process and the PRIVATE_FLAG would be problematic. It would not be a problem if the fuzzer itself just spawns a couple of threads in a thread pool.

I dont know that is the policy of spawning threads manually vs letting the build system handle parallelism.

I think it will be better to let the build system handle parallelism because:

I didn't get std.Thread.Pool: process tree cooperation #20274 working in my most recent effort and I lack the motivation to work on that problem currently because it involves sad workarounds for a disappointing lack of primitives offered by Unix-like operating systems. Since the thread pool is in the build runner process, the fuzzer process does not know how many threads it can spawn.

Eventually, we want the ability to coordinate multiple build runners fuzzing across multiple different machines. This means coordinating seeds and inputs at that higher abstraction layer.

If we need to add an inter-process mutex to the standard library, that's fine, let's do it.

However there is also the lock-free approach. I haven't closely inspected what the mutex in this PR is being used for yet. Maybe locking could be entirely avoided.

In any case, I am vetoing the use of a spinlock.

lib/fuzzer/util.zig

andrewrk · 2024-08-30T00:18:29Z

When I ran this locally, it seemed to find a test failure rather quickly:

[nix-shell:~/dev/zig/build-release]$ stage3/bin/zig build test-std  -Dtest-filter="tokenizer" -Dskip-release-small -Dskip-release-fast -Dskip-debug -Dskip-non-native -Dskip-libc -Dskip-single-threaded --fuzz  --port 41099
info: web interface listening at http://127.0.0.1:41099/
test-std
└─ run test std-x86_64-linux.6.10.2...6.10.2-gnu.2.39-znver4-ReleaseSafe failure
failed with error.TestUnexpectedResult
error: the following command exited with error code 1:
/home/andy/dev/zig/.zig-cache/o/29911edb78c1004a6c983079ca845abb/test --seed=0x4ca436b6 --cache-dir=/home/andy/dev/zig/.zig-cache --listen=- 
error: all fuzz workers crashed
info: source changes detected; rebuilt wasm component
^C

Total Runs: 57649
Unique Runs: 64 (0.1%)
Coverage: 451 / 11895 (3.8%)

However, when I used the dump_corpus tool to extract strings

--- a/lib/fuzzer/dump_corpus.zig
+++ b/lib/fuzzer/dump_corpus.zig
@@ -39,6 +39,6 @@ pub fn main() void {
         // volatile was trying to achieve in the first place
         const str2: []const u8 = @volatileCast(str);
 
-        std.log.info("\"{s}\"", .{str2});
+        std.log.info("\"{}\"", .{std.zig.fmtEscapes(str2)});
     }
 }

--- a/lib/std/zig/tokenizer.zig
+++ b/lib/std/zig/tokenizer.zig
@@ -1841,10 +1841,7 @@ fn testTokenize(source: [:0]const u8, expected_token_tags: []const Token.Tag) !v
     try std.testing.expectEqual(source.len, last_token.loc.end);
 }
 
-test "fuzzable properties upheld" {
-    const source = std.testing.fuzzInput(.{});
-    const source0 = try std.testing.allocator.dupeZ(u8, source);
-    defer std.testing.allocator.free(source0);
+fn testProperties(source0: [:0]const u8) anyerror!void {
     var tokenizer = Tokenizer.init(source0);
     var tokenization_failed = false;
     while (true) {
@@ -1885,3 +1882,339 @@ test "fuzzable properties upheld" {
         }
     };
 }
+
+test "fuzzable properties upheld" {
+    const source = std.testing.fuzzInput(.{});
+    const source0 = try std.testing.allocator.dupeZ(u8, source);
+    defer std.testing.allocator.free(source0);
+    return testProperties(source0);
+}
+
+test "fuzzable properties upheld - corpus" {
+    for (corpus) |one| {
+        try testProperties(one);
+    }
+}
+
+const corpus = [_][:0]const u8{

the crashing input didn't seem to be present:

[nix-shell:~/dev/zig/build-release]$ stage3/bin/zig test ../lib/std/std.zig --test-filter "fuzzable properties upheld - corpus"
All 61 tests passed.

ProkopRandacek · 2024-08-30T20:54:26Z

the crashing input didn't seem to be present

haha that is because it just calls exit when a test fails. I totally forgot to report found bad inputs :D

ProkopRandacek · 2024-08-30T20:59:48Z

Can you share your testing methodology?

I am ashamed to admit that my testing methodology was to just run it on my toy bencode parser and see how quickly the coverage number goes up. This PR is more focused on setting up a working fuzzer-shaped project and less focused on tuning the fuzzer to get better results.

andrewrk · 2024-08-30T21:52:35Z

haha that is because it just calls exit when a test fails. I totally forgot to report found bad inputs :D

Ah yes, that is intentional, because another way for it to find a bad input is to segfault, abort, or crash in another unexpected manner. So that's why I thought it should write the input before running the user's test code. This way, a test failure and a crashing process look the same to the fuzzer.

I am ashamed to admit that my testing methodology was to just run it on my toy bencode parser and see how quickly the coverage number goes up.

No shame in this! We're both starting with toy examples and working our way up.

ProkopRandacek · 2024-08-31T10:04:08Z

haha that is because it just calls exit when a test fails. I totally forgot to report found bad inputs :D

Ah yes, that is intentional, because another way for it to find a bad input is to segfault, abort, or crash in another unexpected manner. So that's why I thought it should write the input before running the user's test code. This way, a test failure and a crashing process look the same to the fuzzer.

writing every input into the corpus before evaluating it doesn't work since deleting a string from the corpus in the current design is expensive and requires exclusive access to the entire corpus.

It should probably write the bad input from a panic handler (not into the corpus, just into some file) and somehow recover and keep fuzzing. The only way i can see how to recover from a panic handler back into the fuzzer is using longjump but that might be controversial. Maybe we can just start the fuzzer again. Crashing inputs should be rare..

ProkopRandacek · 2024-09-03T18:31:15Z

Thank you for the feedback! I appreciate it. I am currently unfortunately overwhelmed with other work so I'll get back to this in approximately 3 weeks. I hope that is OK. I look forward to getting this merged.

andrewrk · 2024-09-18T17:46:35Z

Welcome back. I have some local changes I expect to push today and you'll need to rebase on them. I did things a little differently than you.

Rationale being that max u64 in base 10 is 20 chars long so we want a chance to insert ascii number over the u64 range and crash parsers that don't expect parsing an ascii number can overflow usize.

This reverts commit 12e25db.

Co-authored-by: Andrew Kelley <andrew@ziglang.org>

ProkopRandacek · 2024-09-30T06:12:43Z

By local changes you mean that the fuzzer now gets a function pointer instead of being called from the test or is there something more? I like this change btw.

I'm aaking because I rebased on top of master and didn't get any conflicts.

andrewrk requested changes Aug 29, 2024

View reviewed changes

lib/fuzzer/main.zig Outdated Show resolved Hide resolved

andrewrk reviewed Aug 29, 2024

View reviewed changes

lib/fuzzer/dump_corpus.zig Outdated Show resolved Hide resolved

andrewrk requested changes Aug 29, 2024

View reviewed changes

andrewrk reviewed Aug 29, 2024

View reviewed changes

lib/fuzzer/util.zig Outdated Show resolved Hide resolved

ProkopRandacek force-pushed the prokop-fuzzer branch 2 times, most recently from b1f3351 to a32bf7c Compare September 18, 2024 11:43

ProkopRandacek added 16 commits September 30, 2024 08:06

move web stuff into separate folder

ba71b94

input pool

bc8f741

feature capture

a3b902e

move things around

47d2075

mutate

03ddf7d

Switch on union tag instead of integer

16e978d

clean up existing fuzzer

66ff598

comparison of analysis

2232277

make fuzz web stats sticky

e746181

handle mutating empty string

146d60a

document fuzzInput

191a613

features are u32 now

d0067b4

fix compile erorrs

131fdf3

non transparent stats for light webui

87ff252

first working version

21c0b77

logging

d5810e2

ProkopRandacek and others added 25 commits September 30, 2024 08:06

input pool no longer global

5dd980d

simplify feature capture

c6fde83

Rewrite mutate without input grow limitation

c947995

single test input for all mutation tests

0f03ce9

corpus in shared memory

b0ba95b

add coverage suffix to coverage file

c32e7d2

dump corpus tool

6cfff79

dont hardcode test input data

b80cbb8

remove lowest_stack, cleanup web

a0c0b9c

deduplicate features, walk the corpus on boot

f9f3a4c

extract misc stuff to util.zig

64b8aac

merge the two MemoryMappedList implementations

3bf40d8

write some docs

ef93b1b

restore input pool repack implementation

d29fc59

expand fuzzInput documentation

48846da

Don't repack input pool at all

57ad4fb

Increase insert repeated byte upper limit

cceca71

Rationale being that max u64 in base 10 is 20 chars long so we want a chance to insert ascii number over the u64 range and crash parsers that don't expect parsing an ascii number can overflow usize.

Don't re-add initial corpus every time we start

c9603bb

seed with current time

d731d8c

Revert "seed with current time"

2a3a1a3

This reverts commit 12e25db.

better wording of argument for @volatileCast

69320ef

Co-authored-by: Andrew Kelley <andrew@ziglang.org>

type safe meta file access in InputPoolPosix.zig

de9ed7b

remove util.zig and politely call msync

504a657

escape corpus content when printing

217772c

write a little readme

bc18acc

ProkopRandacek force-pushed the prokop-fuzzer branch from a32bf7c to bc18acc Compare September 30, 2024 06:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Less basic fuzzer #21246

Less basic fuzzer #21246

ProkopRandacek commented Aug 29, 2024

andrewrk commented Aug 29, 2024

andrewrk Aug 29, 2024

ProkopRandacek Aug 30, 2024

ProkopRandacek Aug 30, 2024

andrewrk Aug 29, 2024

ProkopRandacek Aug 30, 2024

ProkopRandacek Aug 30, 2024

andrewrk Aug 30, 2024 •

edited

Loading

andrewrk commented Aug 30, 2024

ProkopRandacek commented Aug 30, 2024

ProkopRandacek commented Aug 30, 2024

andrewrk commented Aug 30, 2024

ProkopRandacek commented Aug 31, 2024

ProkopRandacek commented Sep 3, 2024

andrewrk commented Sep 18, 2024

ProkopRandacek commented Sep 30, 2024 •

edited

Loading

		/// maximum 2GiB of input data should be enough. 32th bit is delete flag
		pub const Index = u31;

-/// maximum 2GiB of input data should be enough. 32th bit is delete flag
-pub const Index = u31;
+pub const Flags = packed struct (u32) {
+    index: Index,
+    delete: bool,
+    pub const Index = enum(u31) {
+        /// This tag is used below in the line with
+        /// `MemoryMappedList(u8).init(buffer_file, std.math.maxInt(Index))`
+        /// but I don't see what the special value means. Replace these doc comments
+        /// and field name with a more descriptive one.
+        /// Or, if you did not mean for this to be a special value, delete this field and
+        /// initialize `index` with `undefined`.
+        special_value = std.math.maxInt(u31),
+        _,
+    };
+    pub const empty: Flags = .{
+        .index = .special_value,
+        .delete = true,
+    };
+};

Less basic fuzzer #21246

Are you sure you want to change the base?

Less basic fuzzer #21246

Conversation

ProkopRandacek commented Aug 29, 2024

andrewrk commented Aug 29, 2024

andrewrk Aug 29, 2024

Choose a reason for hiding this comment

ProkopRandacek Aug 30, 2024

Choose a reason for hiding this comment

ProkopRandacek Aug 30, 2024

Choose a reason for hiding this comment

andrewrk Aug 29, 2024

Choose a reason for hiding this comment

ProkopRandacek Aug 30, 2024

Choose a reason for hiding this comment

ProkopRandacek Aug 30, 2024

Choose a reason for hiding this comment

andrewrk Aug 30, 2024 • edited Loading

Choose a reason for hiding this comment

andrewrk commented Aug 30, 2024

ProkopRandacek commented Aug 30, 2024

ProkopRandacek commented Aug 30, 2024

andrewrk commented Aug 30, 2024

ProkopRandacek commented Aug 31, 2024

ProkopRandacek commented Sep 3, 2024

andrewrk commented Sep 18, 2024

ProkopRandacek commented Sep 30, 2024 • edited Loading

andrewrk Aug 30, 2024 •

edited

Loading

ProkopRandacek commented Sep 30, 2024 •

edited

Loading