Use same amount of messages for all consumer benchmarks #1382

erikvanoosten · 2024-11-13T10:47:01Z

In this change common benchmark code is moved into ZioBenchmark, ConsumerZioBenchmark and ProducerZioBenchmark so that it becomes easier to make the different benchmark comparable.

After running the consumer benchmarks with different number of records, and different records sizes per run, this PR settled on 50k records of ~512 bytes per run for all consumer benchmarks. With these amounts the zio-kafka based benchmarks and the 'comparison' benchmarks have roughly the same scaling elasticity (where 'scaling elasticity' is defined as the throughput growth factor divided by the number of records growth factor).

After this PR is merged, the benchmark history will be rewritten with linear scaling so that we can compare historic runs against new runs.

svroonland · 2024-11-13T11:33:26Z

zio-kafka-bench/src/main/scala/zio/kafka/bench/comparison/ComparisonBenchmark.scala

@@ -21,7 +21,7 @@ trait ComparisonBenchmark extends ZioBenchmark[Env] {
  protected final val nrPartitions: Int = 6
  protected final val topicPartitions: List[TopicPartition] =
    (0 until nrPartitions).map(TopicPartition(topic1, _)).toList
-  protected final val numberOfMessages: Int           = 1000000
+  protected final val numberOfMessages: Int           = 50000


Is it still enough to get a stable benchmark result..?

Most probably. We deemed it sufficient for the zio-kafka based benchmarks. Historically there is variation in the results, but I suspect its fine.

On the other hand, it might be better to raise both to 100_000. I suspect the benchmark duration will not be hugely affected.

100k messages does affect total runtime while ratio between the different benches does not change. So back to 50k.

erikvanoosten · 2024-11-15T07:46:20Z

This is quite amazing. Doubling the number of messages also almost doubles the throughput! (Both for zio-kafka and the direct consumers.)

That means that the benchmarks are not really representative. We probably need to increase the number of messages further. Also maybe make them larger.

svroonland · 2024-11-15T08:03:21Z

We could also look at latency metrics for various RPS loads

Also: - extract common test parameters

Benches take very long with large messages.

erikvanoosten · 2024-11-16T12:37:55Z

Going from 100k 1024 byte records, to 50k 512 byte records make the benches 3 times slower (not 4 times as could be expected). This true for all consumer benchmarks. The benchmarks in total go from ~20 min to ~17m.

With 50k 512 byte records the benchmark workflow takes about as long as before (which has 50k 10 byte records).

I would like to merge this change and then rewrite bench history a bit so that it all stays (roughly) comparable.

zio-kafka-bench/src/main/scala/zio/kafka/bench/ZioBenchmark.scala

Also: use `record` i.s.o. `message` for consistency with all other zio-kafka and kafka documentation.

erikvanoosten requested a review from svroonland November 13, 2024 10:47

svroonland reviewed Nov 13, 2024

View reviewed changes

erikvanoosten force-pushed the bench-equalizer branch from 2cd592d to 1859be6 Compare November 14, 2024 17:05

svroonland approved these changes Nov 15, 2024

View reviewed changes

erikvanoosten added 3 commits November 16, 2024 11:59

Use same amount of messages for all consumer benchmarks

8941a61

Let's try 10000 messages

dad8e34

Increase record size to 1KB

00f8855

Also: - extract common test parameters

erikvanoosten force-pushed the bench-equalizer branch from 1859be6 to 00f8855 Compare November 16, 2024 11:04

Lower message count, slightly smaller messages

aaffb20

Benches take very long with large messages.

erikvanoosten requested a review from svroonland November 16, 2024 12:39

svroonland approved these changes Nov 22, 2024

View reviewed changes

zio-kafka-bench/src/main/scala/zio/kafka/bench/ZioBenchmark.scala Outdated Show resolved Hide resolved

erikvanoosten added 3 commits November 22, 2024 19:56

Address review comment

b4d4e62

Also: use `record` i.s.o. `message` for consistency with all other zio-kafka and kafka documentation.

Merge branch 'master' into bench-equalizer

0d2cae6

Merge branch 'master' into bench-equalizer

fbb5099

erikvanoosten merged commit b713170 into master Nov 23, 2024
12 of 13 checks passed

erikvanoosten deleted the bench-equalizer branch November 23, 2024 09:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use same amount of messages for all consumer benchmarks #1382

Use same amount of messages for all consumer benchmarks #1382

erikvanoosten commented Nov 13, 2024 •

edited

Loading

svroonland Nov 13, 2024

erikvanoosten Nov 13, 2024

erikvanoosten Nov 13, 2024

erikvanoosten Nov 16, 2024

erikvanoosten commented Nov 15, 2024

svroonland commented Nov 15, 2024

erikvanoosten commented Nov 16, 2024 •

edited

Loading

Use same amount of messages for all consumer benchmarks #1382

Use same amount of messages for all consumer benchmarks #1382

Conversation

erikvanoosten commented Nov 13, 2024 • edited Loading

svroonland Nov 13, 2024

Choose a reason for hiding this comment

erikvanoosten Nov 13, 2024

Choose a reason for hiding this comment

erikvanoosten Nov 13, 2024

Choose a reason for hiding this comment

erikvanoosten Nov 16, 2024

Choose a reason for hiding this comment

erikvanoosten commented Nov 15, 2024

svroonland commented Nov 15, 2024

erikvanoosten commented Nov 16, 2024 • edited Loading

erikvanoosten commented Nov 13, 2024 •

edited

Loading

erikvanoosten commented Nov 16, 2024 •

edited

Loading