[Networking] Gossipsub content-addressed messages #1528

AgeManning · 2019-12-16T22:59:16Z

Currently, gossipsub uses source_peer_id + sequence_number to address the messages sent across the gossipsub network. If a client re-publishes a seen message, this will look like a new message to all other peers. This can lead to duplicate messages on the network, which then needs to be filtered at the application layer.

Rust and go (libp2p/go-libp2p-pubsub#248) now have the ability to customise the message id of gossipsub messages. I propose we set the gossipsub message id to:
base64(sha256(data)) where data is the gossipsub protobuf data field which typically contains our ssz-encoded data or snappy-compressed data.

This way, gossipsub will filter out duplicate messages before notifying the application layer. In principle, we then just need to verify the hash at the application layer to ensure duplicates aren't sent/received.

The text was updated successfully, but these errors were encountered:

Nashatyrev · 2019-12-17T10:00:43Z

@AgeManning

If a client re-publishes a seen message,

Is it considered to be malicious client behavior? Or you mean some real use case?

If this is to protect from malicious clients then I don't understand why it's a problem to filter the stuff on the application layer? The duplicates would be filtered on app layer and would not be propagated. The same way as any other invalid payload (e.g. invalid validator signature or whatever) would be filtered on the app layer and not propagated to the network

AgeManning · 2019-12-17T11:55:21Z

As a real use case, Prysm for all gossip messages are re-publishing them, which changes the source id and sequence number making old messages look like new ones. As I understand, this is because the API to validate messages in go doesn't simply provide a method for validation then re-propagation.

Currently there is nothing in the specification that prevents clients from re-publishing seen messages, so I don't think this is strictly malicious behaviour.

For our use, we don't particularly care about the source of the message, mainly just it's contents, so it makes sense (at least to me) to content-address messages to prevent duplication at the gossip layer.

If a client fails to filter at the application layer (for whatever reason) and re-propagates a message to a client that re-publishes the message, I think it's possible to get into propagation loops of the same message which spams the gossip channel. This would be one extra step to prevent such scenarios (even though I agree this could be solved at the application layer).

AgeManning · 2019-12-17T11:58:28Z

As a fun side-effect, messages will be a few bytes smaller also :)

Nashatyrev · 2019-12-17T13:27:27Z

As I understand, this is because the API to validate messages in go doesn't simply provide a method for validation then re-propagation.

Oh, this sounds rather like a Go implementation issue.
IMHO validating and re-publishing doesn't look like the way the Gossip is intended to be used :(

Otherwise content-addressing looks ok but we may need to consider hashing computation overhead and possible related DoS attacks

vyzo · 2019-12-17T14:20:25Z

This isn't true, the go implementation has validators; messages are only propagated if they pass validation.

vyzo · 2019-12-17T14:21:55Z

Note that the validators don't have to return at once, they can take a while to complete and that's fine.

djrtwo · 2019-12-17T16:11:21Z

I support this and the base64(sha256(data)) looks good. Want to put together a PR @AgeManning?

In the general case, it seems that libp2p pubsub should handle this for us as long as we don't re-publish messages after validation. That said, there are cases in which messages might very well be published by two independent nodes and in that case, it would be advantageous to see them as the same message within pubsub.

The example I'm thinking of is one ValidatorClient (VC) connected to multiple BeaconNodes (BN). In this type of setup a VC might simultaneously publish signed messages to more than one BN to add some redundancy and potentially some speed gains in propagation.

It should be noted that we still need application layer validation of non-duplicates for the AggregateAndProof messages on beacon_aggregate_and_proof pubsub topic. Two "aggregators" might compute the same aggregate attestation but have a different "proof" signature in the wrapper container. A naive base64(sha256(data)) would not catch the duplicate attestation in this case. This validation is already in the networking spec and should remain there.

vyzo · 2019-12-17T16:21:23Z

If you are using the hash as the message ID, then they won't get republished (as long as they are within the time cache window).

djrtwo · 2020-01-03T23:55:19Z

closed via #1538

AgeManning mentioned this issue Dec 16, 2019

Eth2 networking call 1 ethereum/eth2.0-pm#111

Closed

prestonvanloon mentioned this issue Dec 17, 2019

Add pubsub message ID function prysmaticlabs/prysm#4304

Merged

hwwhww added the scope:networking label Dec 17, 2019

Nashatyrev mentioned this issue Dec 17, 2019

Configurable message id function libp2p/go-libp2p-pubsub#248

Merged

AgeManning mentioned this issue Dec 17, 2019

Content-address gossipsub messages #1535

Closed

djrtwo mentioned this issue Dec 18, 2019

Content-address gossipsub messages [replacement] #1538

Merged

djrtwo closed this as completed Jan 3, 2020

ralexstokes mentioned this issue Feb 27, 2020

Support custom logic to form a message ID for pubsub libp2p/py-libp2p#408

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Networking] Gossipsub content-addressed messages #1528

[Networking] Gossipsub content-addressed messages #1528

AgeManning commented Dec 16, 2019 •

edited

Loading

Nashatyrev commented Dec 17, 2019

AgeManning commented Dec 17, 2019 •

edited

Loading

AgeManning commented Dec 17, 2019

Nashatyrev commented Dec 17, 2019

vyzo commented Dec 17, 2019

vyzo commented Dec 17, 2019 •

edited

Loading

djrtwo commented Dec 17, 2019

vyzo commented Dec 17, 2019

djrtwo commented Jan 3, 2020

[Networking] Gossipsub content-addressed messages #1528

[Networking] Gossipsub content-addressed messages #1528

Comments

AgeManning commented Dec 16, 2019 • edited Loading

Nashatyrev commented Dec 17, 2019

AgeManning commented Dec 17, 2019 • edited Loading

AgeManning commented Dec 17, 2019

Nashatyrev commented Dec 17, 2019

vyzo commented Dec 17, 2019

vyzo commented Dec 17, 2019 • edited Loading

djrtwo commented Dec 17, 2019

vyzo commented Dec 17, 2019

djrtwo commented Jan 3, 2020

AgeManning commented Dec 16, 2019 •

edited

Loading

AgeManning commented Dec 17, 2019 •

edited

Loading

vyzo commented Dec 17, 2019 •

edited

Loading