Implementer's Guide: Incorporate HRMP to TransientValidationData #1588

pepyakin · 2020-08-14T14:12:34Z

Updates TransientValidationData according to my comment here #1539 (comment)

Related to #1576
Follow-up #1629

What to look for

It is important to ensure that TransientValidationData contains all information to perform checks equivalent to those in Router's acceptance criteria.

rphmeier · 2020-08-18T12:58:27Z

Will there be a follow-up for PersistedValidationData? For instance, to store the incoming (downward) messages?

rphmeier · 2020-08-18T13:02:35Z

roadmap/implementers-guide/src/types/candidate.md

+	hrmp_digest: Vec<BlockNumber>,
+	/// The watermark of the HRMP. That is, the block number up to which (inclusive) all HRMP messages
+	/// sent to the parachain are processed.
+	hrmp_watermark: BlockNumber,


@bkchr One thing to watch out for here, is that because this is in the TransientValidationData, it is not a direct parameter to the validation function and instead will be provided by inherent. That means that a malicious collator can provide something different (e.g. 100 instead of 90) and the relay-chain will still accept the block. So it is possible for this value to move backwards from the parachain's perspective and Cumulus needs to account for that. Similarly with other values in this struct.

Actually, after conversation in DM with Sergei, our conclusion was that this watermark should not be provided via inherent and instead Cumulus should be tracking the watermark internally.

Not sure I understand this here, why isn't the relay chain checking that the watermark is >= old_watermark?

The relay-chain actually does check that the watermark is > old_watermark.

See the excerpt from the router.md

check_hrmp_watermark(P: ParaId, new_hrmp_watermark): - new_hrmp_watermark should be strictly greater than the value of HrmpWatermarks for P (if any). - new_hrmp_watermark must not be greater than the context's block number. - in HrmpChannelDigests for P an entry with the block number equal to new_hrmp_watermark should exist.

What happens if the parachain can not process all downward messages of a given relay chain block and thus could not increase the value?

FWIW, in this particular place, we are discussing the HRMP watermark. For XCMP and by extension HRMP we have agreed on a simplifying assumption: we assume that a parachain can receive and process all the incoming messages within a block. I think that might be fine for HRMP, I don't think that we will reach the usage levels where they could be a problem and we are going to put aggressive limits on HRMP anyway (I see it happening within upcoming #1632).

For XCMP I think we might want to revisit this assumption. Under circumstances when the parachain is bombarded by the stream of messages over all the channels and processing of all these messages just don't fit into the PVF time or PoV space budget. Although that doesn't seem likely, I also feel that this simplifying assumption doesn't give us a lot - maybe it is not that hard to start with them?

Then, regarding downward messages. In the current version that we are discussing here, downward messages are implemented the same way you (@bkchr) introduced them: i.e. instead of a watermark we just used a simple processed_downward_messages. I liked this approach because, well, I didn't see a lot of reasons to use watermarks, but then today I finally broke down and switched to watermarks in #1629 (see the PR description to find out why).

But now I think maybe it would be a good idea to revert it back to processed_downward_messages. I'll think about that.

Thank you for the explanation :)

Although that doesn't seem likely, I also feel that this simplifying assumption doesn't give us a lot - maybe it is not that hard to start with them?

I'm still not sure that this is true, IMHO we can reach this limit relative easy. In the end the parachain wants to process some of its own transactions as well. Just alone think about a parathread that collects messages over quite some time, the parachain will probably not be able to process them in one block.

I think in a big part it depends on the parametrization and there are two parameters in play: weight and pov size. I don't think that it is productive to speak about the weight here, because it is highly dependent on the exact code of PVF at hand and more primarily it's PVF which will be designed to not violate any constraints put onto the it by polkadot (although we need to come up with a less costraining system overall, about this later). Therefore, let's assume for now that messages received will always fit into the weight constraints.

Regarding the PoV size, this could be enforced simply.

Since:

there can be only so many paras, N, executed at the same time (I think currently we aim for N = 100 "cores"), and

every parachain can target the same single parachain, and

every parachain can only send one XCMP/HRMP message per block per recipient

we arrive to the worst case of N messages can be received at the same time, then we can say that we want to devote a half of PoV at most for messages. Dividing the terms gives us the maximum size of the message which would make it impossible to overwhelm the parachain as long as it's PVF doesn't use more than half of its PVF (and once again, it should follow the constraints unless it accepts possibility of getting in the aforementioned unfortunate situation).

Also, to clarify, I say it is unlikely situation because I don't think getting 100 slots and spamming another chain is an easy task. I feel there might be just going to the chain and spamming it might be cheaper. (Note that is not to say that we shouldn't keep this vector is mind!)

Regarding the parathread, I don't think they are too special here. The alternative, AFAIU your concerns, is to allow to process messages not only a block worth but message-by-message. In such case, it would be approximately the same.

As an aside: there is also a thing that is related to one of our recent discussions in the parachains team, i.e. parathreads should be able to process all the messages so that these messages don't slide beyond the availability pruning window, which should be less than a day.

3\. every parachain can only send one XCMP/HRMP message per block

Okay, I never heard from this before. However, this clearly lowers the amount of messages :)

I beg your pardon! It is actually

every parachain can only send one XCMP/HRMP message per block per recipient

pepyakin · 2020-08-24T12:30:10Z

Will there be a follow-up for PersistedValidationData? For instance, to store the incoming (downward) messages?

FWIW, a follow up is here #1629

roadmap/implementers-guide/src/types/candidate.md

bkchr · 2020-08-24T15:25:35Z

roadmap/implementers-guide/src/types/candidate.md

+	hrmp_digest: Vec<BlockNumber>,
+	/// The watermark of the HRMP. That is, the block number up to which (inclusive) all HRMP messages
+	/// sent to the parachain are processed.
+	hrmp_watermark: BlockNumber,


Not sure I understand this here, why isn't the relay chain checking that the watermark is >= old_watermark?

Co-authored-by: Bastian Köcher <bkchr@users.noreply.github.com>

pepyakin · 2020-08-24T17:50:04Z

I grouped and split into a new struct HrmpTransientValidationData all data related to HRMP. I also added some data that was missing to check the acceptance criteria for a candidate, making this PR good to go.

I used quite some bit of git-fu (between this and some not-yet-published PR) so there might be some rebase artifacts.

Note, that some data is superfluous there. If we had had #1594 we wouldn't have needed those to live in TransientValidationData because these data do not inform the collator.

Take for example open_requests/close_requests. The decision for opening or closing a channel is taken by the PVF and not by a collator. If the PVF decided to open another channel beyond the allowed number of channels anyway, then all bets are off, and the collator won't be able to do anything.

One piece that is still missing is how do the messages get inside. The follow-up #1629 doesn't directly address this yet. Another, is the number of maximum channels. But I will file it as a follow-up with other limiting and tightening. (UPD: Follow up: #1632)

rphmeier · 2020-08-26T09:12:39Z

roadmap/implementers-guide/src/types/candidate.md

+	/// elements are ordered by ascending the block number. The vector doesn't contain duplicates.
+	digest: Vec<BlockNumber>,
+	/// The watermark of the HRMP. That is, the block number up to which (inclusive) all HRMP messages
+	/// sent to the parachain are processed.


The docs are a bit unclear. Is this the previous watermark of the para or the new watermark?

Yeah, it is the previous. I didn't want to repeat myself that all of those are collected before the execution of the PVF so I left a note above:

It's worth noting that all the data is collected before the candidate execution.

Do you think it's better to clarify that they are from the current state explictily?

roadmap/implementers-guide/src/types/candidate.md

rphmeier · 2020-08-26T09:14:05Z

roadmap/implementers-guide/src/types/candidate.md

+	open_requests: Vec<(HrmpChannelId, HrmpAbridgedOpenChannelRequest)>,
+	/// A vector of close requests in which the para participates either as sender or recipient.
+	/// The vector doesn't contain two entries with the same `HrmpChannelId`.
+	close_requests: Vec<HrmpChannelId>,


Does a channel appear in this vector only during one parablock, or does it appear in this vector as long as the channel has not yet been closed?

Those are channel close requests and these two fields contain all the registered at the context block before the execution of PVF. You can think basically as a copy of the storage list found in the router, but only filtered to be relevant to the parachain at hand.
As the open/close requests found in the router module storage, they live up until they are handled. For the case of close requests - it's the first session boundary.

rphmeier · 2020-08-26T09:15:17Z

roadmap/implementers-guide/src/types/candidate.md

+	/// A vector of open requests in which the para participates either as sender or recipient. The
+	/// items are ordered ascending by `HrmpChannelId`. The vector doesn't contain two entries
+	/// with the same `HrmpChannelId`.
+	open_requests: Vec<(HrmpChannelId, HrmpAbridgedOpenChannelRequest)>,


I wonder if this and close requests need to be part of PersistedValidationData. We don't want the collator to be able to forge this data.

Or maybe the hash of that data should be in the PersistedValidationData.

Right, the PVF needs a way to authenticate that the following events:

there is a new inbound open request

an outbound open request was confirmed by the recipient

an outbound open request timed out. This one could theoretically be tracked by the PVF, but only assuming that the configuration doesn't change which can happen even if rarely.

(I believe other cases can be tracked by the PVF itself. E.g. it should know that it shouldn't send two open requests to the same recipient, confirm the same inbound open request twice, etc)

A collator should not be able to conceal any of these events.

I see the following solutions:

put raw requests into PersistedValidationData. That inflates the persistent data, but not that bad: the number of open and close requests are bounded (the former is by the sum of max ingress and egress channels, the latter by the number of already opened channels).

Then we can further compress data by leveraging the fact that either sender or recipient is this para.

Then, we can remove confirmed: bool for all inbound open requests.

This data will be duplicated for successive block in the same session though.

put a hash into PersistedValidationData.

Essentially, allows us to move this data into the PVF's state and not duplicate it.

send a DM for each of these events.

The same as previous but less complex since it relies on an existing mechanism

A caveat: the para can be a bit slow to process DMQ and potentially miss a notification or receive it when it's too late.

punt off on the relay chain storage root proofs.

If we assume that with #1642 we limit the total number of channels to a 2 digit number and then retire HRMP completely may be it makes sense to actually go with 1 for the time being.

I created #1663 to address this

roadmap/implementers-guide/src/types/candidate.md

Co-authored-by: Bernhard Schuster <bernhard@ahoi.io>

pepyakin · 2020-08-31T11:12:58Z

I think all points were addressed or created issues for follow-ups, so I am going ahead and merging this to make my life less painful because of the rebases of the next PRs.

That said, feel free to leave any comments and I will address them in the follow up PRs.

pepyakin · 2020-08-31T12:01:33Z

bot merge

ghost · 2020-08-31T12:01:35Z

Trying merge.

pepyakin added A0-please_review Pull request needs code review. B0-silent Changes should not be mentioned in any release notes C1-low PR touches the given topic and has a low impact on builders. labels Aug 14, 2020

rphmeier reviewed Aug 18, 2020

View reviewed changes

pepyakin and others added 8 commits August 18, 2020 15:29

Add a note about time of collection of TransientValidationData

5a526e0

Add HRMP digest and dmq length to TransientValidationData

f4aaa88

Add a note that the vector in hrmp digest is never empty

aacfe76

Add hrmp watermark to TransientValidationData

2560365

Add HRMP egress limits

395c5c4

Incorporate the latest dispatchable upward messages changes.

83f1ed3

Update candidate.md

782dbde

Update candidate.md docs

af3ce69

pepyakin force-pushed the ser-hrmp-after-validation-data branch from 4bbb9a9 to af3ce69 Compare August 18, 2020 14:38

pepyakin and others added 2 commits August 18, 2020 16:41

Fix wording

76faea5

Delete assignmets.md

a95d276

pepyakin mentioned this pull request Aug 24, 2020

Implementer's Guide: Integrate DMP into PersistentValidationData + DMQ watermarks #1629

Closed

bkchr reviewed Aug 24, 2020

View reviewed changes

Update roadmap/implementers-guide/src/types/candidate.md

257ebb2

Co-authored-by: Bastian Köcher <bkchr@users.noreply.github.com>

pepyakin added A3-in_progress Pull request is in progress. No review needed at this stage. and removed A0-please_review Pull request needs code review. labels Aug 24, 2020

pepyakin added 3 commits August 24, 2020 18:57

Extract HrmpTransientValidationData and add additional data.

acb5054

Some clarifications.

14f9207

Introduce HrmpAbridgedOpenChannelRequest

22488e9

pepyakin added A0-please_review Pull request needs code review. and removed A3-in_progress Pull request is in progress. No review needed at this stage. labels Aug 24, 2020

pepyakin mentioned this pull request Aug 24, 2020

Implementer's Guide: Tighten HRMP #1632

Merged

rphmeier reviewed Aug 26, 2020

View reviewed changes

roadmap/implementers-guide/src/types/candidate.md Show resolved Hide resolved

rphmeier reviewed Aug 26, 2020

View reviewed changes

drahnr reviewed Aug 27, 2020

View reviewed changes

roadmap/implementers-guide/src/types/candidate.md Outdated Show resolved Hide resolved

pepyakin and others added 2 commits August 31, 2020 12:25

Update roadmap/implementers-guide/src/types/candidate.md

a960ac9

Co-authored-by: Bernhard Schuster <bernhard@ahoi.io>

Fix typo: egress->ingress

10d62a5

pepyakin mentioned this pull request Aug 31, 2020

Implementer's Guide: HRMP: Reliable way to notify PVF about incoming channels #1663

Closed

A note about sorting

87e7190

This comment has been minimized.

Sign in to view

drahnr approved these changes Aug 31, 2020

View reviewed changes

This comment has been minimized.

Sign in to view

ghost merged commit f125cbe into master Aug 31, 2020

ghost deleted the ser-hrmp-after-validation-data branch August 31, 2020 12:01

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementer's Guide: Incorporate HRMP to TransientValidationData #1588

Implementer's Guide: Incorporate HRMP to TransientValidationData #1588

pepyakin commented Aug 14, 2020 •

edited

Loading

rphmeier commented Aug 18, 2020

rphmeier Aug 18, 2020 •

edited

Loading

rphmeier Aug 18, 2020

bkchr Aug 24, 2020

pepyakin Aug 24, 2020 •

edited

Loading

bkchr Aug 24, 2020

pepyakin Aug 24, 2020

bkchr Aug 24, 2020

pepyakin Aug 25, 2020 •

edited

Loading

bkchr Aug 25, 2020

pepyakin Aug 25, 2020

pepyakin commented Aug 24, 2020

bkchr Aug 24, 2020

pepyakin commented Aug 24, 2020 •

edited

Loading

rphmeier Aug 26, 2020

pepyakin Aug 26, 2020

rphmeier Aug 26, 2020

pepyakin Aug 26, 2020

rphmeier Aug 26, 2020

pepyakin Aug 26, 2020

pepyakin Aug 31, 2020

pepyakin Aug 31, 2020

pepyakin commented Aug 31, 2020

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

pepyakin commented Aug 31, 2020

ghost commented Aug 31, 2020

Implementer's Guide: Incorporate HRMP to TransientValidationData #1588

Implementer's Guide: Incorporate HRMP to TransientValidationData #1588

Conversation

pepyakin commented Aug 14, 2020 • edited Loading

rphmeier commented Aug 18, 2020

rphmeier Aug 18, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pepyakin Aug 24, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pepyakin Aug 25, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pepyakin commented Aug 24, 2020

Choose a reason for hiding this comment

pepyakin commented Aug 24, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pepyakin commented Aug 31, 2020

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

pepyakin commented Aug 31, 2020

ghost commented Aug 31, 2020

pepyakin commented Aug 14, 2020 •

edited

Loading

rphmeier Aug 18, 2020 •

edited

Loading

pepyakin Aug 24, 2020 •

edited

Loading

pepyakin Aug 25, 2020 •

edited

Loading

pepyakin commented Aug 24, 2020 •

edited

Loading