perf: Merge done_cache and active_cache in ObligationForest #67892

Marwes · 2020-01-05T09:50:16Z

In the first commit, then a couple of refactorings to clarify ObligationForest

Removes one hash lookup (of two) in `register_obligation_at`.

rust-highfive · 2020-01-05T09:50:20Z

r? @davidtwco

(rust_highfive has picked a reviewer for you, use r? to override)

jonas-schievink · 2020-01-05T13:30:29Z

@bors try @rust-timer queue

rust-timer · 2020-01-05T13:30:30Z

Awaiting bors try build completion

bors · 2020-01-05T13:30:44Z

⌛ Trying commit ef83c93 with merge 00f61087aa85f4929c9cc4e71b161f462cfbea97...

petrochenkov · 2020-01-05T13:32:08Z

cc @nnethercote

bors · 2020-01-05T16:21:01Z

☀️ Try build successful - checks-azure
Build commit: 00f61087aa85f4929c9cc4e71b161f462cfbea97 (00f61087aa85f4929c9cc4e71b161f462cfbea97)

rust-timer · 2020-01-05T16:21:03Z

Queued 00f61087aa85f4929c9cc4e71b161f462cfbea97 with parent 7785834, future comparison URL.

davidtwco · 2020-01-06T10:47:01Z

r? @nnethercote (thought he's currently on PTO, might be worth re-assigning but I'm not sure who else is a good choice)

Marwes · 2020-01-06T14:37:01Z

rust-timer didn't post the completion message and the regressions look rather excessive... The changes should really just make it do less work (modulo some added branches which should not affect things much).

bjorn3 · 2020-01-06T14:53:15Z

src/librustc_data_structures/obligation_forest/mod.rs

+        &mut self,
+        obligation: O,
+        parent: Option<NodeIndex>,
+    ) -> Result<(), ()> {
        match self.active_cache.entry(obligation.as_predicate().clone()) {


This now clones the predicate even when it is done. Maybe that is the problem?

Looked into it the predicate type, it is always a ty::Predicate (or &str in tests) which is copyable and only takes up 32 bytes which isn't tiny but shouldn't really affect things...

Ugh, ok I think it is because active_cache now gets really big, so the retain becomes really expensive as it fills up with None values...

The `alternative_predicates` is instead used to track when predicates changes so that they still get removed/marked as done on completion. Unsure if this is the best way but it seems to work.

Marwes · 2020-01-06T22:09:56Z

I think I have a way to remove the retain on done_cache but it is not as clean as I'd like as the behavior around when the inner predicate changes is a bit hard to understand.

As a side effect the need for rewriting node indices in compress were removed which did at least remove some other messy code.

jonas-schievink · 2020-01-06T22:19:54Z

Let's give this another go then.

@bors try @rust-timer queue

rust-timer · 2020-01-06T22:19:56Z

Awaiting bors try build completion

bors · 2020-01-06T22:20:10Z

⌛ Trying commit f33df29 with merge 20e1c5fd4d787c557ee4c2b745ff727a5e3e87a5...

bors · 2020-01-07T01:02:37Z

☀️ Try build successful - checks-azure
Build commit: 20e1c5fd4d787c557ee4c2b745ff727a5e3e87a5 (20e1c5fd4d787c557ee4c2b745ff727a5e3e87a5)

rust-timer · 2020-01-07T01:02:39Z

Queued 20e1c5fd4d787c557ee4c2b745ff727a5e3e87a5 with parent ebbb2bf, future comparison URL.

rust-timer · 2020-01-07T03:33:59Z

Finished benchmarking try commit 20e1c5fd4d787c557ee4c2b745ff727a5e3e87a5, comparison URL.

Marwes · 2020-01-09T10:43:43Z

Closing until I can figure out a way to avoid the regressions.

nnethercote · 2020-01-30T03:00:36Z

I'm late to the party here, due to PTO, but my main comment is that ObligationForest's code is really hot in some cases (esp. keccak and inflate) and has been highly optimized. Even small and seemingly innocuous changes can cause measureable regressions -- I have given up on dozens of such changes for that reason -- and this PR has a relatively large change in the data structures.

Marwes · 2020-01-30T13:38:42Z

I am aware that it can be really hot (that's why I looked at improving it!). While there are many micro optimizations which help improve the status quo the algorithm as a whole I do believe it can be improved, iterating and filtering through all obligations on each process_call can definitely baloon in runtime if it ends up going through it multiple times without being able to actually remove any obligations (I suspect this is part of the problem in #68666, though lack of memoization is probably the main problem)

Marwes · 2020-02-05T10:15:52Z

Looked into why inflate is so heavy on ObligationForest and the culprit appears to be that ~3k obligations end up as pending which are then iterated through on every call to process_obligations, checking if they can make progress.

This makes it clear why ObligationForest are so resistant to changes in data structures. The current way of representing the data as a plain Vec is more or less optimal when iterating through obligations to search for one that can make progress. Any changes that adds indirections to that single process ends up getting heavily penalized even if there are wins elsewhere.

So to actually get some wins here we need to do one or more of the following

Don't generate so many obligations that can't be solved (yet) - Might be helped by lazy normalization perhaps?
Track which obligations actually can make progress - If we know which (if any) obligations can make progress, there is no need to search for them. I have a rough idea, though it may very well have more overhead than it is worth.
Call process_obligations less - May give ObligationForest more information between each pass so that less full iterations are needed.

Marwes added 3 commits January 5, 2020 10:24

perf: Merge done_cache and active_cache in ObligationForest

13a4bd4

Removes one hash lookup (of two) in `register_obligation_at`.

refactor: Remove unnecessary RefCell

18b1a6d

refactor: Add a NodeIndex alias to clarify ObligationForest

ef83c93

rust-highfive assigned davidtwco Jan 5, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jan 5, 2020

rust-highfive assigned nnethercote and unassigned davidtwco Jan 6, 2020

bjorn3 reviewed Jan 6, 2020

View reviewed changes

Avoid the need to call retain on active_cache

f33df29

The `alternative_predicates` is instead used to track when predicates changes so that they still get removed/marked as done on completion. Unsure if this is the best way but it seems to work.

Marwes closed this Jan 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Merge done_cache and active_cache in ObligationForest #67892

perf: Merge done_cache and active_cache in ObligationForest #67892

Marwes commented Jan 5, 2020

rust-highfive commented Jan 5, 2020

jonas-schievink commented Jan 5, 2020

rust-timer commented Jan 5, 2020

bors commented Jan 5, 2020

petrochenkov commented Jan 5, 2020

bors commented Jan 5, 2020

rust-timer commented Jan 5, 2020

davidtwco commented Jan 6, 2020

Marwes commented Jan 6, 2020

bjorn3 Jan 6, 2020

Marwes Jan 6, 2020

Marwes Jan 6, 2020

Marwes commented Jan 6, 2020

jonas-schievink commented Jan 6, 2020

rust-timer commented Jan 6, 2020

bors commented Jan 6, 2020

bors commented Jan 7, 2020

rust-timer commented Jan 7, 2020

rust-timer commented Jan 7, 2020

Marwes commented Jan 9, 2020

nnethercote commented Jan 30, 2020

Marwes commented Jan 30, 2020

Marwes commented Feb 5, 2020

perf: Merge done_cache and active_cache in ObligationForest #67892

perf: Merge done_cache and active_cache in ObligationForest #67892

Conversation

Marwes commented Jan 5, 2020

rust-highfive commented Jan 5, 2020

jonas-schievink commented Jan 5, 2020

rust-timer commented Jan 5, 2020

bors commented Jan 5, 2020

petrochenkov commented Jan 5, 2020

bors commented Jan 5, 2020

rust-timer commented Jan 5, 2020

davidtwco commented Jan 6, 2020

Marwes commented Jan 6, 2020

bjorn3 Jan 6, 2020

Choose a reason for hiding this comment

Marwes Jan 6, 2020

Choose a reason for hiding this comment

Marwes Jan 6, 2020

Choose a reason for hiding this comment

Marwes commented Jan 6, 2020

jonas-schievink commented Jan 6, 2020

rust-timer commented Jan 6, 2020

bors commented Jan 6, 2020

bors commented Jan 7, 2020

rust-timer commented Jan 7, 2020

rust-timer commented Jan 7, 2020

Marwes commented Jan 9, 2020

nnethercote commented Jan 30, 2020

Marwes commented Jan 30, 2020

Marwes commented Feb 5, 2020