Use `MAX_PREALLOCATION` consistently #605

serban300 · 2024-06-19T07:12:07Z

Related to #609

Use MAX_PREALLOCATION both when reading a vec from bytes and when decoding each element.

serban300 · 2024-07-17T12:16:21Z

@bkchr @koute could you PTAL on this PR since it's somewhat related to #609 ? Even though it's not a prerequisite, I think it would be nice to have.

Use `MAX_PREALLOCATION` both when reading a vec from bytes and when decoding each element.

Increase MAX_PREALLOCATION in order to avoid calling realloc too often

koute · 2024-07-22T04:28:45Z

src/codec.rs

-	// If there is input len and it cannot be pre-allocated then return directly.
-	if input_len.map(|l| l < byte_len).unwrap_or(false) {
-		return Err("Not enough data to decode vector".into());
+		num_undecoded_items = num_undecoded_items.saturating_sub(chunk_len);


This saturating_sub's completely unnecessary here since impossible to have chunk_len > num_undecoded_items due to the min.

Addressed in #615

koute · 2024-07-22T04:33:36Z

src/codec.rs

+	if let Some(input_len) = input.remaining_len()? {
+		if input_len < len {
+			return Err("Not enough data to decode vector".into());
+		}
+	}


This isn't correct as deserializing T might take any number of bytes (including even zero bytes, e.g. ()).

What we should do here is to have a serialized_size_hint() method (or, more specifically, probably an associated const so that it can be checked statically to fit within MAX_PREALLOCATION) or something like that on T which would return a value that could allow this check. (We already have encoded_fixed_size there, but that returns an exact number of bytes; it could be used here, but technically that's too strict and we can do better here by using the minimum.)

We should just drop this check.

Yeah, alternatively we can drop it. Although having it here can have one benefit - if we end up not having enough data then this will return an early error instead of wasting time trying to deserialize it. Nice to have, but not strictly necessary.

To implement this, we would need to write quite a lot of code. For example for an enum we would need to know the variant that requires the least amount of bytes. However, it could then still fail at decoding because we try to decode always the enum variant that uses much more bytes etc.

Hm, well, would it be that much code? I implemented this in my serialization crate and it's mostly fine; with enums you essentially just autogenerate a (min(variant1, variant2, ..), max(variant1, variant2, ..)) in your impl. Of course this is just an optimization (in some cases it would make incomplete deserializations fail early, and in some cases it would allow the compiler to remove per-element size checks), and as you've said it can still fail at decoding depending on what you're decoding.

Anyway, I'm fine with going with your suggestion to just delete the check.

Removed it for the moment: #615

koute · 2024-07-22T04:45:16Z

src/codec.rs

 where
-	I: Input,
-	T: ToMutByteSlice + Default + Clone,
+	F: FnMut(&mut Vec<T>, usize) -> Result<(), Error>,
 {
 	debug_assert!(MAX_PREALLOCATION >= mem::size_of::<T>(), "Invalid precondition");


We should make this into a static assert and check it at compile time.

I couldn't manage to do this so far. I tried something like

const _: () = { assert!(MAX_PREALLOCATION >= mem::size_of::<T>()) }

inside decode_vec_chunked()

But I'm getting an error: can't use generic parameters from outer item.

Any suggestion would be helpful

You don't need to define a constant; since Rust 1.79 you can use a const {} block to force const evaluation of an expression.

Yes, this works, thanks ! PTAL on #615

But the CI fails, because the CI image uses rust 1.73.0 . We can try to release a paritytech/ci-unified:bullseye-1.79.0 image. Will check how this can be done.

serban300 · 2024-07-22T06:59:24Z

Thanks for the review ! Will address the comments in a new PR today.

LE: Here is the PR: #615

* Address #605 code review comments * Check MAX_PREALLOCATION >= mem::size_of::<T> statically * Update CI image to paritytech/ci-unified:bullseye-1.79.0 This reverts commit c54689d.

serban300 self-assigned this Jun 19, 2024

serban300 marked this pull request as draft June 19, 2024 07:15

serban300 changed the title ~~Use MAX_PREALLOCATION consistently~~ [WIP] Use MAX_PREALLOCATION consistently Jun 19, 2024

serban300 force-pushed the small-fixes branch from 744c41c to a7ad4db Compare June 19, 2024 07:17

serban300 changed the title ~~[WIP] Use MAX_PREALLOCATION consistently~~ Use MAX_PREALLOCATION consistently Jun 19, 2024

serban300 marked this pull request as ready for review June 19, 2024 07:34

serban300 force-pushed the small-fixes branch from a7ad4db to 05f06f2 Compare July 17, 2024 10:18

serban300 requested review from bkchr and koute July 17, 2024 12:15

serban300 added 3 commits July 17, 2024 18:56

Use MAX_PREALLOCATION consistently

c52cb31

Use `MAX_PREALLOCATION` both when reading a vec from bytes and when decoding each element.

Simplify VecDeque::encode_to()

31d6c23

Increase MAX_PREALLOCATION

247c9e0

Increase MAX_PREALLOCATION in order to avoid calling realloc too often

serban300 force-pushed the small-fixes branch from 152e4ad to 247c9e0 Compare July 17, 2024 15:56

bkchr approved these changes Jul 18, 2024

View reviewed changes

serban300 merged commit 36baa4f into paritytech:master Jul 19, 2024
16 of 17 checks passed

koute reviewed Jul 22, 2024

View reviewed changes

serban300 mentioned this pull request Jul 22, 2024

Follow-up on #605 #615

Merged

serban300 added a commit that referenced this pull request Jul 23, 2024

Follow-up on #605 (#615)

a388fa9

* Address #605 code review comments * Check MAX_PREALLOCATION >= mem::size_of::<T> statically * Update CI image to paritytech/ci-unified:bullseye-1.79.0 This reverts commit c54689d.

niklasad1 mentioned this pull request Oct 24, 2024

Prep to release 3.7.1 #645

Closed

jsdw mentioned this pull request Oct 24, 2024

Prep to release 3.7.0 #646

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `MAX_PREALLOCATION` consistently #605

Use `MAX_PREALLOCATION` consistently #605

serban300 commented Jun 19, 2024 •

edited

Loading

serban300 commented Jul 17, 2024

koute Jul 22, 2024

serban300 Jul 22, 2024

koute Jul 22, 2024

bkchr Jul 22, 2024

koute Jul 22, 2024

bkchr Jul 22, 2024

koute Jul 22, 2024

serban300 Jul 22, 2024 •

edited

Loading

koute Jul 22, 2024

serban300 Jul 22, 2024

koute Jul 22, 2024

serban300 Jul 22, 2024 •

edited

Loading

serban300 commented Jul 22, 2024 •

edited

Loading

Use MAX_PREALLOCATION consistently #605

Use MAX_PREALLOCATION consistently #605

Conversation

serban300 commented Jun 19, 2024 • edited Loading

serban300 commented Jul 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

serban300 Jul 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

serban300 Jul 22, 2024 • edited Loading

Choose a reason for hiding this comment

serban300 commented Jul 22, 2024 • edited Loading

Use `MAX_PREALLOCATION` consistently #605

Use `MAX_PREALLOCATION` consistently #605

serban300 commented Jun 19, 2024 •

edited

Loading

serban300 Jul 22, 2024 •

edited

Loading

serban300 Jul 22, 2024 •

edited

Loading

serban300 commented Jul 22, 2024 •

edited

Loading