approval-voting: Make tests deterministic #3899

alexggh · 2024-03-29T15:40:03Z

With random connectivity and latency is hard to actually figure it out a delta in the benchmarking, so disable them in order to get full deterministic behaviour when measuring performance.

At least on my machine with this configuration the results for approval-throughput are really similar between subsequent runs:

CPU usage, seconds                     total   per block

approval-distribution                36.9025      3.6902
approval-distribution                36.7579      3.6758
approval-distribution                37.0418      3.7042
approval-distribution                37.0339      3.7034
approval-distribution                36.9342      3.6934
approval-distribution                36.7177       3.6718



approval-voting                      52.7756      5.2776
approval-voting                      52.5999      5.2600
approval-voting                      53.2158      5.3216
approval-voting                      53.2493      5.3249
approval-voting                      52.8524      5.2852
approval-voting                      52.8611      5.2861
approval-voting                      52.8210      5.2821

With random connectivity and latency is hard to actually figure it out a delta in the benchmarking, so disable them in order to get full deterministic behaviour when measuring performance. Signed-off-by: Alexandru Gheorghe <alexandru.gheorghe@parity.io>

sandreim · 2024-04-08T12:41:06Z

This sounds like a good idea to try. This would reduce the number of async tasks as we do this to implement latency:

// Emulate RTT latency
			self.spawn_handle
				.spawn("peer-latency-emulator", "test-environment", async move {
					tokio::time::sleep(latency_ms).await;
					to_node.unbounded_send(message).expect("Sending to the node never fails");
				});

However, I'd expect we generate deterministic latencies for the tests, so we get the same thing in all CI run.

AndreiEres · 2024-04-09T16:24:17Z

polkadot/node/subsystem-bench/src/lib/environment.rs

@@ -347,7 +347,7 @@ impl TestEnvironment {
 				break
 			}
 			// Check value every 50ms.


Suggested change

// Check value every 50ms.

// Check value every 1000ms.

Accidentally changed it :D , will revert it back.

AndreiEres · 2024-04-09T16:34:16Z

I noticed that with sending messages in a spawned task even if the latency is zero, I receive more stable results. Without it with a zero latency, results can spike up to 40%.

pub async fn send_message(&mut self, message: NetworkMessage) {
	self.tx_limiter.reap(message.size()).await;
	let to_node = self.to_node.clone();
	let latency = std::time::Duration::from_millis(self.latency_ms as u64);

	// Emulate RTT latency
	self.spawn_handle
		.spawn("peer-latency-emulator", "test-environment", async move {
			if !latency.is_zero() {
				tokio::time::sleep(latency).await;
			}
			to_node.unbounded_send(message).expect("Sending to the node never fails");
		});
}

AndreiEres · 2024-04-09T16:41:49Z

I tried running availability benchmarks with a zero latency, but I can't say that I see a meaningful difference. It even has no sense to post the results here. However, I would try to apply this to our availability benchmarks and check the charts with the results over a time.
WDYT @sandreim @alexggh

Implements the idea from #3899 - Removed latencies - Number of runs reduced from 50 to 5, according to local runs it's quite enough - Network message is always sent in a spawned task, even if latency is zero. Without it, CPU time sometimes spikes. - Removed the `testnet` profile because we probably don't need that debug additions. After the local tests I can't say that it brings a significant improvement in the stability of the results. However, I belive it is worth trying and looking at the results over time.

With random connectivity and latency is hard to actually figure it out a delta in the benchmarking, so disable them in order to get full deterministic behaviour when measuring performance. At least on my machine with this configuration the results for approval-throughput are really similar between subsequent runs: ``` CPU usage, seconds total per block approval-distribution 36.9025 3.6902 approval-distribution 36.7579 3.6758 approval-distribution 37.0418 3.7042 approval-distribution 37.0339 3.7034 approval-distribution 36.9342 3.6934 approval-distribution 36.7177 3.6718 approval-voting 52.7756 5.2776 approval-voting 52.5999 5.2600 approval-voting 53.2158 5.3216 approval-voting 53.2493 5.3249 approval-voting 52.8524 5.2852 approval-voting 52.8611 5.2861 approval-voting 52.8210 5.2821 ``` --------- Signed-off-by: Alexandru Gheorghe <alexandru.gheorghe@parity.io>

AndreiEres reviewed Apr 9, 2024

View reviewed changes

AndreiEres mentioned this pull request Apr 10, 2024

Run subsystem-benchmark without network latency #4068

Merged

alexggh added 2 commits July 2, 2024 09:38

Merge branch 'master' into alexghh/fix_bench

b033ed4

Update environment.rs

6782626

alexggh marked this pull request as ready for review July 2, 2024 06:40

alexggh added the R0-silent Changes should not be mentioned in any release notes label Jul 2, 2024

AndreiEres approved these changes Jul 2, 2024

View reviewed changes

alindima approved these changes Jul 2, 2024

View reviewed changes

alexggh enabled auto-merge July 2, 2024 08:08

alexggh disabled auto-merge July 3, 2024 07:29

Merge branch 'master' into alexghh/fix_bench

75d0600

alexggh enabled auto-merge July 3, 2024 09:18

Merge branch 'master' into alexghh/fix_bench

7ead0d4

alexggh added this pull request to the merge queue Jul 3, 2024

Merged via the queue into master with commit 33324fe Jul 3, 2024
151 of 157 checks passed

alexggh deleted the alexghh/fix_bench branch July 3, 2024 11:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

approval-voting: Make tests deterministic #3899

approval-voting: Make tests deterministic #3899

alexggh commented Mar 29, 2024

sandreim commented Apr 8, 2024

AndreiEres Apr 9, 2024

AndreiEres Apr 9, 2024

alexggh Apr 10, 2024

AndreiEres commented Apr 9, 2024 •

edited

Loading

AndreiEres commented Apr 9, 2024

approval-voting: Make tests deterministic #3899

approval-voting: Make tests deterministic #3899

Conversation

alexggh commented Mar 29, 2024

sandreim commented Apr 8, 2024

AndreiEres Apr 9, 2024

Choose a reason for hiding this comment

AndreiEres Apr 9, 2024

Choose a reason for hiding this comment

alexggh Apr 10, 2024

Choose a reason for hiding this comment

AndreiEres commented Apr 9, 2024 • edited Loading

AndreiEres commented Apr 9, 2024

AndreiEres commented Apr 9, 2024 •

edited

Loading