GossipSub: Limit flood publishing #911

Menduist · 2023-06-09T08:30:45Z

Part of #854

This PR adds a limit to the flood publishing, so that we spend at most 1 heartbeat flood publishing
Of course, this requires to know our bandwidth, which is currently hardcoded to 100 mbps, but this will be replaced by an actual estimate later

With 700ms heartbeat and 1mb message size, we will send it to a maximum of 17 peers (including the mesh)

The "1 heartbeat" period is used because after that, we will probably be responding to IWANTs

Next step for flood publishing will be to integrate #850 (staggered sending), which will not only give us the bandwidth estimate, but also send first to the mesh and then the flood.

codecov · 2023-06-12T13:25:08Z

Codecov Report

Merging #911 (f68b32a) into unstable (b784167) will decrease coverage by 0.05%.
The diff coverage is 100.00%.

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable     #911      +/-   ##
============================================
- Coverage     83.51%   83.46%   -0.05%     
============================================
  Files            91       91              
  Lines         15144    15145       +1     
============================================
- Hits          12647    12641       -6     
- Misses         2497     2504       +7

Files Changed	Coverage Δ
libp2p/utility.nim	`42.22% <ø> (ø)`
libp2p/protocols/pubsub/gossipsub.nim	`87.01% <100.00%> (+0.25%)`	⬆️
libp2p/protocols/pubsub/gossipsub/behavior.nim	`88.39% <100.00%> (+0.24%)`	⬆️

... and 9 files with indirect coverage changes

arnetheduck · 2023-06-13T09:15:58Z

libp2p/protocols/pubsub/gossipsub.nim

+      msgSize = data.len
+      bandwidth = 25_000_000 #TODO replace with bandwidth estimate
+      msToTransmit = max(msgSize div (bandwidth div 1000), 1)
+      maxFloodPublish =


no need for a min?

If the messages are that big (2mb), it will effectively disable flood publish
The idea of this PR is that flood publish shouldn't last longer than one heartbeat, since after that we will be busy responding to IWANT requests

consider the case where we're not subscribed to the topic - sending to 0 peers seems a bit harsh then...

Right, that won't happen before >50mb, but still good to cover it

thought a bit more about this, and I think we want to put the minimum at dmin at least ..

it feels very risky to send only to one peer - if that peer is slow, it'll delay the message for all peers by a heartbeat and it relies a bit too heavily on the IHAVE/IWANT mechanism to recover, ie there's no redundancy...

I know there's a cost here for sending big messages from slow peers, but I think the risk for normal/highbandwidth peers is more real and costly.

The minimum is currently dLow (it will be caught by the if peers.len < g.parameters.dLow: below)

aren't fanout peers populated only when we're not subscribed? also as such, there might not be enough of them in the fanout table either?

We also use them when we can't publish to enough peers (ie bad mesh), and they are replenished when < dLow

We also use them when we can't publish to enough peers (ie bad mesh), and they are replenished when < dLow

so sending to g.gossipsub[topic] as a last resort wouldn't be appropriate?

This should be the same as using the fanout, the fanout is only a small cache of gossipsub to have more stable diffusion routes, basically

arnetheduck · 2023-06-29T13:29:28Z

The "1 heartbeat" period is used because after that, we will probably be responding to IWANTs

after 1 heartbeat, should we sent an IHAVE to all peers we didn't flood it to?

Menduist · 2023-06-29T13:41:38Z

after 1 heartbeat, should we sent an IHAVE to all peers we didn't flood it to?

Not sure thats wise, most likely they will all reply with IWANTs, and we will be back to square one

arnetheduck · 2023-06-29T13:45:21Z

Not sure thats wise, most likely they will all reply with IWANTs, and we will be back to square one

hm, what might make sense here is an exponential increase per heartbeat .. if we send message to 4 peers, we iwant 4 peers at the heartbeat, then 8 and so on for each heartbeat that passes - that should help avoid eclipsing while at the same time allowing the message to spread before

arnetheduck · 2023-06-29T13:58:22Z

libp2p/protocols/pubsub/gossipsub.nim

  if g.parameters.floodPublish:
+    let
+      msgSize = data.len
+      bandwidth = 25_000_000 #TODO replace with bandwidth estimate


I suspect we need to make this a (debug) parameter until we have something better - ie in nimbus-eth2, this would be a hidden command-line parameter that we pull out in case of weird behavior - hidden, because obviously it's something that should go away at some point

Menduist · 2023-07-24T11:10:55Z

@diegomrsantos not sure I like the refacto you did, the splits seems a bit random, and the flow became harder to follow IMO

Not sure publishMessage is relevant, but we could do something like

if g.parameters.floodPublish:
  addFloodPublishPeers(g, topic, data)

if peers.len < g.parameters.dLow:
  # not subscribed, or bad mesh, send to fanout peers
  addFanoutPeers(g, topic, peers)

diegomrsantos · 2023-07-25T10:46:34Z

@Menduist the original code was confusing to me, but feel free to remove or modify the changes.

Menduist added 2 commits June 9, 2023 10:26

Limit flood publishing

f2209dc

wait longer for slow cI

f50cdd4

Menduist changed the title ~~Limit flood publishing~~ GossipSub: Limit flood publishing Jun 9, 2023

Menduist added 2 commits June 12, 2023 11:13

even slower for CI

07f7670

really fix ci

8f78a34

arnetheduck reviewed Jun 13, 2023

View reviewed changes

Menduist added 3 commits June 13, 2023 15:51

Always publish to at least dLow

50aaaf4

fix ci

30e0dae

fix ci

652e781

Fix bandwidth and split publish into smaller procs

c10c232

arnetheduck reviewed Jun 29, 2023

View reviewed changes

Fix test

d974bc4

Menduist added 3 commits July 26, 2023 11:46

refacto

b06e362

revert refactos

136296f

Merge remote-tracking branch 'origin/unstable' into limitfloodpublish

f68b32a

Menduist mentioned this pull request Jul 28, 2023

GossipSub: refacto GossipSub publish #936

Draft

Menduist enabled auto-merge (squash) July 28, 2023 11:37

diegomrsantos approved these changes Jul 31, 2023

View reviewed changes

Menduist merged commit 7a369dd into unstable Jul 31, 2023

Menduist deleted the limitfloodpublish branch July 31, 2023 09:13

Menduist mentioned this pull request Jul 31, 2023

Bump libp2p for gossipsub improvements status-im/nimbus-eth2#5229

Merged

diegomrsantos mentioned this pull request Aug 8, 2023

Bandwidth estimate as a parameter #941

Merged

Menduist mentioned this pull request Oct 12, 2023

GossipSub: improve flood publish #854

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GossipSub: Limit flood publishing #911

GossipSub: Limit flood publishing #911

Menduist commented Jun 9, 2023 •

edited

Loading

codecov bot commented Jun 12, 2023 •

edited

Loading

arnetheduck Jun 13, 2023

Menduist Jun 13, 2023

arnetheduck Jun 13, 2023

Menduist Jun 13, 2023

arnetheduck Jun 29, 2023

Menduist Jun 29, 2023

arnetheduck Jun 29, 2023

Menduist Jun 29, 2023

arnetheduck Jun 29, 2023

Menduist Jun 29, 2023

arnetheduck commented Jun 29, 2023

Menduist commented Jun 29, 2023

arnetheduck commented Jun 29, 2023

arnetheduck Jun 29, 2023

Menduist commented Jul 24, 2023

diegomrsantos commented Jul 25, 2023 •

edited

Loading

GossipSub: Limit flood publishing #911

GossipSub: Limit flood publishing #911

Conversation

Menduist commented Jun 9, 2023 • edited Loading

codecov bot commented Jun 12, 2023 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arnetheduck commented Jun 29, 2023

Menduist commented Jun 29, 2023

arnetheduck commented Jun 29, 2023

Choose a reason for hiding this comment

Menduist commented Jul 24, 2023

diegomrsantos commented Jul 25, 2023 • edited Loading

Menduist commented Jun 9, 2023 •

edited

Loading

codecov bot commented Jun 12, 2023 •

edited

Loading

diegomrsantos commented Jul 25, 2023 •

edited

Loading