[red-knot] Add fuzzer to catch panics for invalid syntax #14678

dhruvmanila · 2024-11-29T13:32:20Z

Summary

This PR adds a fuzzer harness for red knot that runs the type checker on source code that contains invalid syntax.

Additionally, this PR also updates the init-fuzzer.sh script to increase the corpus size to:

Include various crates that includes Python source code
Use the 3.13 CPython source code

And, remove any non-Python files from the final corpus so that when the fuzzer tries to minify the corpus, it doesn't produce files that only contains documentation content as that's just noise.

Test Plan

Run ./fuzz/init-fuzzer.sh, say no to the large dataset.
Run the fuzzer with cargo +night fuzz run red_knot_check_invalid_syntax -- -timeout=5

github-actions · 2024-12-03T06:29:19Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

sharkdp · 2024-12-03T13:20:16Z

fuzz/init-fuzzer.sh

+    cp -r "../../../crates/ruff_python_parser/resources" "ruff_python_parser"
+
+    # Delete all non-Python files
+    find . -type f -not -name "*.py" -delete


Should we also keep stub files?

Using

Suggested change

find . -type f -not -name "*.py" -delete

find . -type f -not \( -name "*.py" -or -name "*.pyi" \) -delete

or

Suggested change

find . -type f -not -name "*.py" -delete

find . -type f -not -regex '.*\.pyi?' -delete

We could but at the end we would still use a .py file even if the content is coming from a stub file. The reason is that the fuzzer gives us the raw bytes of the source code that it generates.

Hmm, maybe we should run red knot once on a .py file and then on a .pyi file with the same content as we do on the corpus tests.

fuzz/fuzz_targets/red_knot_check_invalid_syntax.rs

* main: [red-knot] Test: Hashable/Sized => A/B (#14769) [`flake8-type-checking`] Expands TC006 docs to better explain itself (#14749) [`pycodestyle`] Handle f-strings properly for `invalid-escape-sequence (W605)` (#14748) [red-knot] Add fuzzer to catch panics for invalid syntax (#14678) Check `AIR001` from builtin or providers `operators` module (#14631) [airflow]: extend removed names (AIR302) (#14734)

dhruvmanila added the red-knot Multi-file analysis & type inference label Nov 29, 2024

dhruvmanila mentioned this pull request Dec 2, 2024

[red-knot] support invalid syntax without panics #13778

Closed

dhruvmanila force-pushed the dhruv/red-knot-fuzzer branch from aa40db7 to 94698f8 Compare December 3, 2024 06:09

[red-knot] Add fuzzer to catch panics for invalid syntax

4df8284

dhruvmanila force-pushed the dhruv/red-knot-fuzzer branch from 94698f8 to 4df8284 Compare December 3, 2024 06:14

Run fuzz build when code changes

929f14f

dhruvmanila marked this pull request as ready for review December 3, 2024 11:02

dhruvmanila requested review from MichaReiser and sharkdp December 3, 2024 11:58

sharkdp reviewed Dec 3, 2024

View reviewed changes

MichaReiser approved these changes Dec 3, 2024

View reviewed changes

fuzz/fuzz_targets/red_knot_check_invalid_syntax.rs Outdated Show resolved Hide resolved

dhruvmanila added 2 commits December 4, 2024 14:15

Cache the salsa database

3ee87d3

Remove file from the system

2710657

dhruvmanila merged commit 1685d95 into main Dec 4, 2024
21 checks passed

dhruvmanila deleted the dhruv/red-knot-fuzzer branch December 4, 2024 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] Add fuzzer to catch panics for invalid syntax #14678

[red-knot] Add fuzzer to catch panics for invalid syntax #14678

dhruvmanila commented Nov 29, 2024 •

edited

Loading

github-actions bot commented Dec 3, 2024 •

edited

Loading

sharkdp Dec 3, 2024

dhruvmanila Dec 4, 2024

	find . -type f -not -name "*.py" -delete
	find . -type f -not \( -name ".py" -or -name ".pyi" \) -delete

	find . -type f -not -name "*.py" -delete
	find . -type f -not -regex '.*\.pyi?' -delete

[red-knot] Add fuzzer to catch panics for invalid syntax #14678

[red-knot] Add fuzzer to catch panics for invalid syntax #14678

Conversation

dhruvmanila commented Nov 29, 2024 • edited Loading

Summary

Test Plan

github-actions bot commented Dec 3, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

sharkdp Dec 3, 2024

Choose a reason for hiding this comment

dhruvmanila Dec 4, 2024

Choose a reason for hiding this comment

dhruvmanila commented Nov 29, 2024 •

edited

Loading

github-actions bot commented Dec 3, 2024 •

edited

Loading

`ruff-ecosystem` results