Add NodeHandle enum for node references within Node #35

jimpo · 2019-11-11T18:25:11Z

Currently, branch and extension nodes just have a byte slice reference to the child nodes and one uses the NodeCodec::try_decode_hash method to determine whether it is a hash or an inline node reference. Instead, we make this an explicit part of the decoded Node structure to simply code and make the codec more flexible. For example, we can stop encoding the length of the hashes inside branch nodes and instead use another bitfield inside branch nodes to indicate whether each child is a hash or inline reference, which should save space.

This is another breaking change to NodeCodec building on #34.

cheme

Running same bench as in #34, the perf from #34 degrades (8%), I am not sure about the origin of it but it could have been related to the fact that children have to decode their inline nodes but the bench is on iterator so it should go into all inline node, so it is probably not that (unless there is multiple calls to that).

Regarding the PR, I agree that this try_decode_hash should not be a codec method. Maybe there is a way to put it in another trait like Layout or somewhere else, to avoid those static rules (size == lenght(hash)) in code.

cheme · 2019-11-12T14:13:56Z

trie-db/src/triedbmut.rs

@@ -100,72 +100,84 @@ where
 {
 	// load an inline node into memory or get the hash to do the lookup later.
 	fn inline_or_hash<C, H>(
-		node: &[u8],
+		parent_hash: H::Out,


'parent_hash' as a pointer here?

cheme · 2019-11-12T14:14:06Z

trie-db/src/triedbmut.rs

+				NodeHandle::Hash(hash)
+			},
+			EncodedNodeHandle::Inline(data) => {
+				let child = Node::from_encoded::<C, H>(parent_hash, data, db, storage)?;


is it possible that the small perf hit comes from getting a decoded data handle when previously it was encoded data? I don't really think so (exept if from branch this method is called on child we do not need, but it should not), but I am not sure where it comes from.

cheme · 2019-11-12T14:18:37Z

trie-db/src/triedbmut.rs

-					child(4), child(5), child(6), child(7),
-					child(8), child(9), child(10), child(11),
-					child(12), child(13), child(14), child(15),
+					child(0)?, child(1)?, child(2)?, child(3)?,


ok we are doing it here.

I believe the performance hit may come for the fact that before this pr, call to child here only went into try_decode_hash that do not do inline node decoding.
After this pr for all inline node (even if not needed) we run a decoding step.

But it shouldn't matter for iteration since all child are queried, for other operation it can, but it is probably minor.

cheme · 2019-11-12T14:59:08Z

No sorry, I misread the code, there is no such static rules.

arkpar · 2019-11-18T14:17:22Z

Needs a rebase

arkpar · 2019-11-19T11:56:05Z

Performance hit might come from from_encoded returning Result now. I think proper error handling is still worth it though.

jimpo requested review from cheme and arkpar November 11, 2019 18:25

cheme reviewed Nov 12, 2019

View reviewed changes

jimpo added 3 commits November 18, 2019 17:06

Introduce explicit NodeHandle in node instead of using try_decode_hash.

ee49cef

Remove try_decode_hash from NodeCodec trait.

e873471

Refactor NodeCodec to have HashOut associated type.

42574e6

jimpo force-pushed the jimpo/decode-hash branch from 41a51f8 to 42574e6 Compare November 18, 2019 16:07

arkpar approved these changes Nov 19, 2019

View reviewed changes

arkpar merged commit c5c64a4 into master Nov 19, 2019

arkpar deleted the jimpo/decode-hash branch November 19, 2019 11:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NodeHandle enum for node references within Node #35

Add NodeHandle enum for node references within Node #35

jimpo commented Nov 11, 2019

cheme left a comment

cheme Nov 12, 2019

cheme Nov 12, 2019

cheme Nov 12, 2019

cheme Nov 19, 2019

cheme Nov 19, 2019

cheme commented Nov 12, 2019

arkpar commented Nov 18, 2019

arkpar commented Nov 19, 2019

Add NodeHandle enum for node references within Node #35

Add NodeHandle enum for node references within Node #35

Conversation

jimpo commented Nov 11, 2019

cheme left a comment

Choose a reason for hiding this comment

cheme Nov 12, 2019

Choose a reason for hiding this comment

cheme Nov 12, 2019

Choose a reason for hiding this comment

cheme Nov 12, 2019

Choose a reason for hiding this comment

cheme Nov 19, 2019

Choose a reason for hiding this comment

cheme Nov 19, 2019

Choose a reason for hiding this comment

cheme commented Nov 12, 2019

arkpar commented Nov 18, 2019

arkpar commented Nov 19, 2019