Remove procedure cache from the assembler #1411

bobbinth · 2024-07-23T07:28:24Z

This PR builds on #1409 and tries to further simplify the assembler by removing ProcedureCache struct.

The PR consists of 3 commits:

The first commit just removes the ProcedureCache struct and moves all relevant functionality into the MastForestBuilder struct.
The second commit removes ResolvedTarget::Cached variant. The motivation is that it doesn't really matter if the procedure is cached or not - MastForestBuilder will make sure that we cannot create nodes with duplicate MAST roots, and so the same procedure cannot be inserted twice into the forest.
The last commit completely removes root tracking from the ModuleGraph. This means that the name resolver does not try to resolve InvocationTarget::MastRoot but just maps it directly to ResolvedTarget::PhantomCall (which I renamed into ResolvedTarget::MastRoot). The idea here is similar to the previous point in that MastForestBuilder has enough information to resolve a MAST root to a correct node in the forest.

I'm pretty sure that the first commit does not break anything, but I'm less sure about the other two commits (especially the 3rd one). So, it is possible to that I've missed some important details (tests are passing, but it is possible that something is not being tested).

Update: the last commit was removed from this PR as we can't test its effects sufficiently yet.

bobbinth · 2024-07-23T08:32:36Z

assembly/src/assembler/module_graph/name_resolver.rs

                    // This is a phantom procedure - we know its root, but do not have its
                    // definition
                    break Err(AssemblyError::Failed {


It is not really clear to me why we have this error condition here.

Because the find method expects to resolve a FullyQualifiedProcedureName to a GlobalProcedureIndex or return an error if that isn't possible. This is one of the reasons why it is useful to represent basic information about compiled modules/procedures in the graph, since those procedures will get assigned a GlobalProcedureIndex, and can be used to resolve invocations throughout the compilation graph.

bitwalker · 2024-07-23T16:26:10Z

assembly/src/assembler/mast_forest_builder.rs

+
+        // We don't have a cache entry yet, but we do want to make sure we don't have a conflicting
+        // cache entry with the same MAST root:
+        if let Some(cached) = self.find_procedure(&proc_root) {


I think we need to discuss adding a new node type to MAST that represents procedure entry, specifically to ensure that the number of procedure locals is represented in the digest. This also lets us recover essential procedure metadata when loading a library from MAST (such as number of procedure locals).

bitwalker

I think all of these changes should be fine. However, without some tests to verify assembly of various combinations of compiled and source-form modules, it is very difficult to say for sure if there are any issues that fall out of this. Unfortunately, we can't really add all of the ones we need until after we have #1401 merged. We could probably add some tests around phantoms though.

One of the risks of conflating procedures we have information about and those we don't (phantoms), is that there are places where we handled phantoms differently (and now treat them like resolved MAST root), and places where we handled a resolved MAST root differently (and will now treat them like a phantom). This may result in things that would have previously compiled successfully, failing to compile; or vice versa. We'll definitely need a good suite of tests to cover the various ways (successfully and unsuccessfully) that you can assemble a mix of compiled and source-form modules, to ensure that the behavior is correct.

bitwalker · 2024-07-23T17:43:57Z

assembly/src/assembler/module_graph/rewrites/module.rs

-                    kind,
-                    target: target.clone(),
-                });
-            }
            Ok(ResolvedTarget::Phantom(_)) => (),


We should start populating self.invoked for ResolvedTarget::Phantom, since we use that information to construct the set of procedures referenced transitively from any given procedure in the module being visited. Previously, we only did that for ResolvedTarget::Cached, because we only cared about procedures we actually had on hand, but now that all targets resolved to a MAST root will use ResolvedTarget::Phantom, we should track all such references.

bitwalker · 2024-07-23T17:50:17Z

assembly/src/assembler/module_graph/name_resolver.rs

                    // This is a phantom procedure - we know its root, but do not have its
                    // definition
                    break Err(AssemblyError::Failed {


Because the find method expects to resolve a FullyQualifiedProcedureName to a GlobalProcedureIndex or return an error if that isn't possible. This is one of the reasons why it is useful to represent basic information about compiled modules/procedures in the graph, since those procedures will get assigned a GlobalProcedureIndex, and can be used to resolve invocations throughout the compilation graph.

bobbinth · 2024-07-23T19:20:09Z

I think all of these changes should be fine. However, without some tests to verify assembly of various combinations of compiled and source-form modules, it is very difficult to say for sure if there are any issues that fall out of this. Unfortunately, we can't really add all of the ones we need until after we have #1401 merged. We could probably add some tests around phantoms though.

One of the risks of conflating procedures we have information about and those we don't (phantoms), is that there are places where we handled phantoms differently (and now treat them like resolved MAST root), and places where we handled a resolved MAST root differently (and will now treat them like a phantom). This may result in things that would have previously compiled successfully, failing to compile; or vice versa. We'll definitely need a good suite of tests to cover the various ways (successfully and unsuccessfully) that you can assemble a mix of compiled and source-form modules, to ensure that the behavior is correct.

Agreed. What I'll do is roll-back the 3rd commit in this PR and then once we have a more complete implementation of different parts, we can re-apply it in a separate PR.

bobbinth requested review from bitwalker and plafer July 23, 2024 08:29

bobbinth commented Jul 23, 2024

View reviewed changes

bitwalker reviewed Jul 23, 2024

View reviewed changes

bitwalker approved these changes Jul 23, 2024

View reviewed changes

bobbinth force-pushed the bobbin-remove-proc-cache branch from f71665d to 6730fc3 Compare July 23, 2024 20:44

Base automatically changed from plafer-single-use-assembler to next July 23, 2024 20:49

plafer and others added 3 commits July 23, 2024 13:56

Remove all uses of AssemblyContext

07c43f2

chore: resolve merge conflict

27c1fc0

refactor: remove ProcedureCache from assembler

d04a0d6

bobbinth force-pushed the bobbin-remove-proc-cache branch from 6730fc3 to d04a0d6 Compare July 23, 2024 21:06

bobbinth merged commit 51ab7bb into next Jul 23, 2024
9 checks passed

bobbinth deleted the bobbin-remove-proc-cache branch July 23, 2024 21:21

bobbinth mentioned this pull request Aug 5, 2024

Implement extensible subsystem for on-demand storage/provisioning of MAST objects #1226

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove procedure cache from the assembler #1411

Remove procedure cache from the assembler #1411

bobbinth commented Jul 23, 2024 •

edited

Loading

bobbinth Jul 23, 2024

bitwalker Jul 23, 2024

bitwalker Jul 23, 2024

bitwalker left a comment

bitwalker Jul 23, 2024

bitwalker Jul 23, 2024

bobbinth commented Jul 23, 2024

Remove procedure cache from the assembler #1411

Remove procedure cache from the assembler #1411

Conversation

bobbinth commented Jul 23, 2024 • edited Loading

bobbinth Jul 23, 2024

Choose a reason for hiding this comment

bitwalker Jul 23, 2024

Choose a reason for hiding this comment

bitwalker Jul 23, 2024

Choose a reason for hiding this comment

bitwalker left a comment

Choose a reason for hiding this comment

bitwalker Jul 23, 2024

Choose a reason for hiding this comment

bitwalker Jul 23, 2024

Choose a reason for hiding this comment

bobbinth commented Jul 23, 2024

bobbinth commented Jul 23, 2024 •

edited

Loading