Foundations of location-sensitive polonius #134268

lqd · 2024-12-13T16:24:18Z

I'd like to land the prototype I'm describing in the polonius project goal. It still is incomplete and naive and terrible but it's working "well enough" to consider landing.

I'd also like to make review easier by not opening a huge PR, but have a couple small-ish ones (the +/- line change summary of this PR looks big, but >80% is moving datalog to a single place).

This PR starts laying the foundation for that work:

it refactors and collects 99% of the old datalog fact gen, which was spread around everywhere, into a single dedicated module. It's still present at 3 small places (one of which we should revert anyways) that are kinda deep within localized components and are not as easily extractable into the rest of fact gen, so it's fine for now.
starts introducing the localized constraints, the building blocks of the naive way of implementing the location-sensitive analysis in-tree, which is roughly sketched out in https://smallcultfollowing.com/babysteps/blog/2023/09/22/polonius-part-1/ and https://smallcultfollowing.com/babysteps/blog/2023/09/29/polonius-part-2/ but with a different vibe than per-point environments described in these posts, just r1@p: r2@q constraints.
sets up the skeleton of generating these localized constraints: converting NLL typeck constraints, and creating liveness constraints
introduces the polonius dual to NLL MIR to help development and debugging. It doesn't do much currently but is a way to see these localized constraints: it's an NLL MIR dump + a dumb listing of the constraints, that can be dumped with -Zdump-mir=polonius -Zpolonius=next. Its current state is not intended to be a long-term thing, just for testing purposes -- I will replace its contents in the future with a different approach (an HTML+js file where we can more easily explore/filter/trace these constraints and loan reachability, have mermaid graphs of the usual graphviz dumps, etc).

I've started documenting the approach in this PR, I'll add more in the future. It's quite simple, and should be very clear when more constraints are introduced anyways.

r? @matthewjasper

Best reviewed per commit so that the datalog move is less bothersome to read, but if you'd prefer we separate that into a different PR, I can do that (and michael has offered to review these more mechanical changes if it'd help).

lqd · 2024-12-13T16:25:31Z

This should all be gated away and deactivatd by default, but let's check: @bors try @rust-timer queue

Foundations of location-sensitive polonius I'd to land the prototype I'm describing in the [polonius project goal](rust-lang/rust-project-goals#118). It still is incomplete and naive and terrible but it's working "well enough" to consider landing. I'd also like to make review easier by not opening a huge PR, but have a couple small-ish ones (the +/- line change summary of this PR looks big, but >80% is moving datalog to a single place). This PR starts laying the foundation for that work: - it refactors and collects 99% of the old datalog fact gen, which was spread around everywhere, into a single dedicated module. It's still present at 3 small places (one of which we should revert anyways) that are kinda deep within localized components that are not as easily extractable into the rest of fact gen, so it's fine for now. - starts introducing the localized constraints, the building blocks of the naive way of implementing the location-sensitive analysis in-tree, which is roughly sketched out in https://smallcultfollowing.com/babysteps/blog/2023/09/22/polonius-part-1/ and https://smallcultfollowing.com/babysteps/blog/2023/09/29/polonius-part-2/ but with a different vibe than per-point environments, just `r1@p: r2@q` constraints. - sets up the skeleton of generating these localized constraints: converting NLL typeck constraints, and creating liveness constraints - introduces the polonius dual to NLL MIR to help development and debugging. It doesn't do much original currently but is a way to see these localized constraints: its an NLL MIR dump + a dumb listing of the constraints. It's not intended to be a long-term thing, it's for testing purposes, and I will replace its contents in the future with a different approach (an HTML where we can more easily explore these constraints, have mermaid graphs of the usual graphviz dumps, etc) I've started documenting the approach in this PR, I'll add more in the future. It's quite simple, and should be very clear when more constraints are introduced anyways. r? `@matthewjasper` Best reviewed per commit so that the datalog move is less bothersome to read, but if you'd prefer we separate that into a different PR, I can do that.

bors · 2024-12-13T16:26:42Z

⌛ Trying commit 1331ccb with merge 0b8e535...

bors · 2024-12-13T18:10:10Z

☀️ Try build successful - checks-actions
Build commit: 0b8e535 (0b8e535a30719ca2ebdd3c9ebb1ea3f99530bd26)

rust-timer · 2024-12-13T19:25:57Z

Finished benchmarking commit (0b8e535): comparison URL.

Overall result: no relevant changes - no action needed

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

@bors rollup=never
@rustbot label: -S-waiting-on-perf -perf-regression

Instruction count

This benchmark run did not return any relevant results for this metric.

Max RSS (memory usage)

Results (primary 4.6%, secondary 2.7%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	4.6%	[4.6%, 4.6%]	1
Regressions ❌ (secondary)	2.7%	[2.7%, 2.7%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	4.6%	[4.6%, 4.6%]	1

Cycles

This benchmark run did not return any relevant results for this metric.

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 770.238s -> 768.851s (-0.18%)
Artifact size: 330.40 MiB -> 330.38 MiB (-0.00%)

jackh726 · 2024-12-14T17:47:13Z

Started the review this, but need to step away. Thoughts:

First, do you have a branch with all the changes you've made so far? Could be good to link it if so for context.

Second, this is still a "big" PR, definitely worth splitting into a couple, I think.

That being said, reviewed the first 2 commits and r+ for me on them (if you want to split them out). I'll keep going through the commits as I get time.

lqd · 2024-12-14T18:05:59Z

First, do you have a branch with all the changes you've made so far? Could be good to link it if so for context.

Yes and no, I have a messy older incomplete one and a bunch of local changes because I'm still working on it, so I'm basically cleaning them up as I go to open PRs.

Second, this is still a "big" PR, definitely worth splitting into a couple, I think.

Ok. I'll split the first 2 commits out now, it should simplify this PR a bit. I can split the rest of the old polonius' fact gen later if you or michael want to look at it.

(I also rebased to fix conflicts)

…kh726 A couple of polonius fact generation cleanups This PR is extracted from rust-lang#134268 for easier review and contains its first two commits. They have already been reviewed by `@jackh726.` r? `@jackh726`

Rollup merge of rust-lang#134315 - lqd:polonius-next-episode-1, r=jackh726 A couple of polonius fact generation cleanups This PR is extracted from rust-lang#134268 for easier review and contains its first two commits. They have already been reviewed by `@jackh726.` r? `@jackh726`

jackh726

Have reviewed first 8 commits (through "improve consistency..."). ~~A small nit, but otherwise~~ r=me if you want to split them out.

compiler/rustc_borrowck/src/polonius/legacy/mod.rs

jackh726

Cursory review of commits 9 onward.

Overall they look fine, but didn't think too deeply so not a full approval.

compiler/rustc_borrowck/src/polonius/mod.rs

compiler/rustc_borrowck/src/nll.rs

compiler/rustc_borrowck/src/polonius/mod.rs

…kh726 An octuple of polonius fact generation cleanups This PR is extracted from rust-lang#134268 for easier review and contains its first 8 commits. They have already been reviewed by `@jackh726` over there. r? `@jackh726`

jackh726 · 2024-12-17T15:07:12Z

I will come back to this for approval of the last few commits once the PR with the first 8 commits lands.

…kh726 An octuple of polonius fact generation cleanups This PR is extracted from rust-lang#134268 for easier review and contains its first 8 commits. They have already been reviewed by ``@jackh726`` over there. r? ``@jackh726``

…kh726 An octuple of polonius fact generation cleanups This PR is extracted from rust-lang#134268 for easier review and contains its first 8 commits. They have already been reviewed by ```@jackh726``` over there. r? ```@jackh726```

…kh726 An octuple of polonius fact generation cleanups This PR is extracted from rust-lang#134268 for easier review and contains its first 8 commits. They have already been reviewed by ````@jackh726```` over there. r? ````@jackh726````

these are the basic blocks of the naive polonius constraint graph implementation.

this describes the rough algorithm using the localized constraint graph

this will allow calling from polonius MIR

This is mostly for test purposes to show the localized constraints until the MIR debugger is set up.

- move constraints to an Option - check `-Zpolonius=next` only once - rewrite fixme comments to make the actionable part clear

lqd · 2024-12-20T10:12:43Z

I rebased after #134378 landed.

jackh726 · 2024-12-21T16:47:37Z

r? jackh726

@bors r+

bors · 2024-12-21T16:47:39Z

📌 Commit aeb3d10 has been approved by jackh726

It is now in the queue for this repository.

bors · 2024-12-21T21:15:34Z

⌛ Testing commit aeb3d10 with merge 426d173...

bors · 2024-12-22T00:00:39Z

☀️ Test successful - checks-actions
Approved by: jackh726
Pushing 426d173 to master...

rust-timer · 2024-12-22T01:16:48Z

Finished benchmarking commit (426d173): comparison URL.

Overall result: ❌ regressions - no action needed

@rustbot label: -perf-regression

Instruction count

This is the most reliable metric that we have; it was used to determine the overall result at the top of this comment. However, even this metric can sometimes exhibit noise.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	0.2%	[0.2%, 0.2%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-	-	0

Max RSS (memory usage)

Results (primary 2.2%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	2.2%	[2.2%, 2.2%]	1
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	2.2%	[2.2%, 2.2%]	1

Cycles

Results (primary -4.4%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-4.4%	[-6.7%, -2.0%]	2
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-4.4%	[-6.7%, -2.0%]	2

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 760.428s -> 762.028s (0.21%)
Artifact size: 330.62 MiB -> 330.62 MiB (0.00%)

Encode constraints that hold at all points as logical edges in location-sensitive polonius Currently, with the full setup in rust-lang#134980 (but is from rust-lang#134268), the polonius location-sensitive analysis converts `Locations::All` typeck constraints as edges at all points in the CFG. This was temporary. There's a FIXME about that already, and this PR implements it: we now use the constraints that hold at all points during traversal instead of eagerly materializing them as physical edges. Another easy one `@jackh726.` This fixes the slowness that was happening on the big CFG from the `saturating-float-casts` test (because of its 12M materialized edges) without, AFAICT, simply moving this overhead to traversal: materializing the logical edges is done on-demand. r? `@jackh726` (no rush either)

…kh726 Encode constraints that hold at all points as logical edges in location-sensitive polonius Currently, with the full setup in rust-lang#134980 (but is from rust-lang#134268), the polonius location-sensitive analysis converts `Locations::All` typeck constraints as edges at all points in the CFG. This was temporary. There's a FIXME about that already, and this PR implements it: we now use the constraints that hold at all points during traversal instead of eagerly materializing them as physical edges. Another easy one `@jackh726.` This fixes the slowness that was happening on the big CFG from the `saturating-float-casts` test (because of its 12M materialized edges) without, AFAICT, simply moving this overhead to traversal: materializing the logical edges is done on-demand. r? `@jackh726` (no rush either)

Rollup merge of rust-lang#135290 - lqd:polonius-next-episode-8, r=jackh726 Encode constraints that hold at all points as logical edges in location-sensitive polonius Currently, with the full setup in rust-lang#134980 (but is from rust-lang#134268), the polonius location-sensitive analysis converts `Locations::All` typeck constraints as edges at all points in the CFG. This was temporary. There's a FIXME about that already, and this PR implements it: we now use the constraints that hold at all points during traversal instead of eagerly materializing them as physical edges. Another easy one `@jackh726.` This fixes the slowness that was happening on the big CFG from the `saturating-float-casts` test (because of its 12M materialized edges) without, AFAICT, simply moving this overhead to traversal: materializing the logical edges is done on-demand. r? `@jackh726` (no rush either)

rustbot assigned matthewjasper Dec 13, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Dec 13, 2024

This comment was marked as resolved.

Sign in to view

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Dec 13, 2024

This comment has been minimized.

Sign in to view

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Dec 13, 2024

lqd force-pushed the polonius-next branch from 1331ccb to 1696689 Compare December 14, 2024 18:06

lqd mentioned this pull request Dec 14, 2024

A couple of polonius fact generation cleanups #134315

Merged

lqd force-pushed the polonius-next branch from 1696689 to 7559edf Compare December 15, 2024 14:51

jackh726 approved these changes Dec 15, 2024

View reviewed changes

compiler/rustc_borrowck/src/polonius/legacy/mod.rs Outdated Show resolved Hide resolved

jackh726 reviewed Dec 15, 2024

View reviewed changes

lqd mentioned this pull request Dec 16, 2024

An octuple of polonius fact generation cleanups #134378

Merged

lqd added 2 commits December 18, 2024 07:33

introduce localized outlives constraints

b70a915

these are the basic blocks of the naive polonius constraint graph implementation.

add general documentation on the polonius module

a5f0591

this describes the rough algorithm using the localized constraint graph

lqd added 4 commits December 18, 2024 07:33

set up skeleton for localized constraints conversion

e7fb93b

extract main NLL MIR dump function

ee93ce9

this will allow calling from polonius MIR

introduce beginnings of polonius MIR dump

c75c517

This is mostly for test purposes to show the localized constraints until the MIR debugger is set up.

address review comments

aeb3d10

- move constraints to an Option - check `-Zpolonius=next` only once - rewrite fixme comments to make the actionable part clear

lqd force-pushed the polonius-next branch from 525887e to aeb3d10 Compare December 18, 2024 07:33

jackh726 approved these changes Dec 21, 2024

View reviewed changes

rustbot assigned jackh726 and unassigned matthewjasper Dec 21, 2024

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Dec 21, 2024

bors added the merged-by-bors This PR was explicitly merged by bors. label Dec 22, 2024

bors merged commit 426d173 into rust-lang:master Dec 22, 2024
7 checks passed

rustbot added this to the 1.85.0 milestone Dec 22, 2024

lqd deleted the polonius-next branch December 22, 2024 05:50

lqd mentioned this pull request Dec 31, 2024

Scalable Polonius support on nightly rust-lang/rust-project-goals#118

Open

18 tasks

lqd mentioned this pull request Jan 9, 2025

Encode constraints that hold at all points as logical edges in location-sensitive polonius #135290

Merged

Foundations of location-sensitive polonius #134268

Foundations of location-sensitive polonius #134268

Uh oh!

Conversation

lqd commented Dec 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lqd commented Dec 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

This comment has been minimized.

bors commented Dec 13, 2024

Uh oh!

bors commented Dec 13, 2024

Uh oh!

This comment has been minimized.

rust-timer commented Dec 13, 2024

Overall result: no relevant changes - no action needed

Uh oh!

jackh726 commented Dec 14, 2024

Uh oh!

lqd commented Dec 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jackh726 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jackh726 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jackh726 commented Dec 17, 2024

Uh oh!

lqd commented Dec 20, 2024

Uh oh!

jackh726 commented Dec 21, 2024

Uh oh!

bors commented Dec 21, 2024

Uh oh!

bors commented Dec 21, 2024

Uh oh!

bors commented Dec 22, 2024

Uh oh!

Uh oh!

rust-timer commented Dec 22, 2024

Overall result: ❌ regressions - no action needed

Uh oh!

Uh oh!

lqd commented Dec 13, 2024 •

edited

Loading

lqd commented Dec 13, 2024 •

edited

Loading

lqd commented Dec 14, 2024 •

edited

Loading

jackh726 left a comment •

edited

Loading