feat(map): Allow chaining lists before mapping #136

jan-ferdinand · 2024-09-19T13:50:22Z

Previously, the higher-order assembly function map took all elements of an input list, mapped them according to some supplied function, and generated an output list. Equivalently, written in rust, this was:list.into_iter().map(f).collect_vec().

Now, map is generalized over the number of input lists, enabling chaining multiple input lists while still producing only one output list. As rust: list_0.into_iter().chain(list_1).map(f).collect_vec().

Between 0 and 15 (both inclusive) input lists are supported.

Note: the ABI for map has changed. Before, an inner function could access runtime parameters on the stack at a distance of 3. Now, this distance has increased to (3 + num_input_lists). For a “traditional” map, this is now 4.

Sword-Smith · 2024-09-19T14:12:34Z

Looks very good. Two observations:

There are some benchmark results that show surprising changes.
Instead of restricting values generated for environment_args in pseudorandom_initial_state, would it make sense to rewrite the test containing an assert that there was no overflow (on a u128 addition) not use the .test() method but let it set up its environment in a more manual way?

jan-ferdinand · 2024-09-19T14:43:50Z

There are some benchmark results that show surprising changes.

True. I guess that pertains mainly to tasmlib_list_higher_order_u32_map_test_hash_xfield_element. Could it be explained by different test input being generated? Since the input state generator has to accomodate multiple lists, I rewrote it, re-ordering the operations and arguments to the various rng::gen() calls.

[W]ould it make sense to rewrite the test [to] not use the .test() method but let it set up its environment in a more manual way?

Potentially yes. The one thing I tried was accessing ShadowedFunction::test_initial_state(), but that's a private method (and I'm fine with that). What would be a good way to set up a test in this way?

tasm-lib/src/list/higher_order/map.rs

Sword-Smith · 2024-09-19T22:02:58Z

Please note that I rebased this branch against master in order to be able to call the test_rust_equivalence_given_execution_state test helper function such that I could rewrite the test case that forced you to do shady things in the pseudorandom_initial_state trait function.

jan-ferdinand · 2024-09-20T11:45:22Z

Apart from your suggestions, I also made one more addition that might be worth reviewing: the type ChainMap now declares an associated constant NUM_INTERNAL_REGISTERS, with which the required offset for any mapping function requiring runtime parameters from deeper on the stack can be accessed programatically. The intention is to

centralize knowledge: other snippets do not need to be aware of chain_map internals, and
future-proofing our code: should we choose to optimize ChainMap::<1>, we can now do so without having to touch all snippets that use Map.

Let me know if you have any suggestions to improve this design.

jan-ferdinand · 2024-09-20T11:48:31Z

some benchmark results that show surprising changes

Is the change to the initial state generator a likely explanation? If so, are we fine with the benchmark changing more dramatically than the changes to the assembly alone would suggest?

Sword-Smith · 2024-09-20T12:13:04Z

some benchmark results that show surprising changes

Is the change to the initial state generator a likely explanation? If so, are we fine with the benchmark changing more dramatically than the changes to the assembly alone would suggest?

Yes, we are fine with that. But with the suggested change of using the bench input parameter to set the list length (a suggestion you already implemented), we can trust the benchmarks of Map going forward. That's enough for me.

Other benchmarks reveal the performance changes to Map resulting from this PR, and those changes look positive (positive in the value sense, i.e. a decrese in the row counts).

Sword-Smith · 2024-09-20T12:16:14Z

tasm-lib/src/list/higher_order/map.rs

                assert_eq!(1, sn.input_types().len(), "{INNER_FN_INCORRECT_NUM_INPUTS}");
                let fn_body = sn.function_code(library);
                let (_, instructions) = tokenize(&fn_body).unwrap();
                let labelled_instructions = isa::parser::to_labelled_instructions(&instructions);
-                let snippet_name =
-                    library.explicit_import(&sn.entrypoint_name(), &labelled_instructions);
-                (triton_asm!(call { snippet_name }), String::default())
+                let label = library.explicit_import(&sn.entrypoint_name(), &labelled_instructions);


For some reason, we are using explicit_import here. I'm not 100 % sure that's necessary. Maybe we can just use the import method directly on the snippet instead? Or maybe there's some borrowing check that fails if you do that? Unsure. Feel free to investigate, or to just ignore this comment, as the current approach works just fine.

I tried that, but could not get it to work. The main culprit seems to be the trait object snippet, which is of type &Box<dyn BasicSnippet>. Library::import() requires a Box<dyn BasicSnippet>. Dereferencing snippet does not work because that would constitute a move, but the method only takes &self. Similar problems arise when changing the match statement to match self.f instead of &self.f. Cloning the snippet also does not work, because the Snippet trait does not have a trait bound on Clone. When trying to introduce such a trait bound, all hell breaks lose.

Long story short: it might be possible, but I decided to give up after a few minutes of trying.

Oh, my comment is about the match arm one further down. The place where you put the comment has the additional complication that the type of sn is &Box<dyn DeprecatedSnippet>, which would have to be transformed into a Box<dyn BasicSnippet> first.

Previously, the higher-order assembly function `map` took all elements of an input list, mapped them according to some supplied function, and generated an output list. Equivalently, written in rust, this was:`list.into_iter().map(f).collect_vec()`. Now, `map` is generalized over the number of input lists, enabling chaining multiple input lists while still producing only one output list. As rust: `list_0.into_iter().chain(list_1).map(f).collect_vec()`. Between 0 and 15 (both inclusive) input lists are supported. Note: the ABI for map has changed. Before, an inner function could access runtime parameters on the stack at a distance of 3. Now, this distance has increased to (3 + num_input_lists). For a “traditional” map, this is now 4.

jan-ferdinand requested a review from Sword-Smith September 19, 2024 13:50

jan-ferdinand force-pushed the chain_map branch from a292324 to 91f80d8 Compare September 19, 2024 14:07

Sword-Smith force-pushed the chain_map branch from 91f80d8 to 67d1292 Compare September 19, 2024 21:48

Sword-Smith requested changes Sep 19, 2024

View reviewed changes

Sword-Smith force-pushed the master branch from a2a1b6a to 8d5ffc9 Compare September 19, 2024 22:31

Sword-Smith self-requested a review September 20, 2024 12:13

Sword-Smith approved these changes Sep 20, 2024

View reviewed changes

Sword-Smith reviewed Sep 20, 2024

View reviewed changes

jan-ferdinand force-pushed the chain_map branch from cd5ef3d to f25b154 Compare September 20, 2024 12:46

jan-ferdinand force-pushed the chain_map branch from f25b154 to 031927a Compare September 20, 2024 12:47

jan-ferdinand merged commit 031927a into master Sep 20, 2024
3 checks passed

jan-ferdinand deleted the chain_map branch September 20, 2024 12:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(map): Allow chaining lists before mapping #136

feat(map): Allow chaining lists before mapping #136

jan-ferdinand commented Sep 19, 2024

Sword-Smith commented Sep 19, 2024

jan-ferdinand commented Sep 19, 2024

Sword-Smith commented Sep 19, 2024

jan-ferdinand commented Sep 20, 2024

jan-ferdinand commented Sep 20, 2024

Sword-Smith commented Sep 20, 2024 •

edited

Loading

Sword-Smith Sep 20, 2024 •

edited

Loading

jan-ferdinand Sep 20, 2024

jan-ferdinand Sep 20, 2024

feat(map): Allow chaining lists before mapping #136

feat(map): Allow chaining lists before mapping #136

Conversation

jan-ferdinand commented Sep 19, 2024

Sword-Smith commented Sep 19, 2024

jan-ferdinand commented Sep 19, 2024

Sword-Smith commented Sep 19, 2024

jan-ferdinand commented Sep 20, 2024

jan-ferdinand commented Sep 20, 2024

Sword-Smith commented Sep 20, 2024 • edited Loading

Sword-Smith Sep 20, 2024 • edited Loading

Choose a reason for hiding this comment

jan-ferdinand Sep 20, 2024

Choose a reason for hiding this comment

jan-ferdinand Sep 20, 2024

Choose a reason for hiding this comment

Sword-Smith commented Sep 20, 2024 •

edited

Loading

Sword-Smith Sep 20, 2024 •

edited

Loading