Improve RadixNode add a few more benchmarks #352

johnynek · 2022-01-16T00:18:54Z

part of #350

This may be marginally faster, but the main value it adds is that RadixNode can now return the matching string without allocating. Also, I think the implementation is somewhat simpler.

I expect this should be faster if the set of strings gets larger (the benchmarks only have a few different prefixes. This uses a hash based approach so it is always O(1) to check a character (not log(N) worst case).

Before:

[info] Benchmark                         Mode  Cnt   Score   Error  Units
[info] StringInBenchmarks.linearMatchIn  avgt    3  77.603 ± 0.529  ns/op
[info] StringInBenchmarks.radixMatchIn   avgt    3  59.675 ± 2.126  ns/op

This PR:

[info] Benchmark                         Mode  Cnt   Score   Error  Units
[info] StringInBenchmarks.linearMatchIn  avgt    3  77.427 ± 4.083  ns/op
[info] StringInBenchmarks.radixMatchIn   avgt    3  59.067 ± 4.797  ns/op

Current benchmark values: [info] Benchmark Mode Cnt Score Error Units [info] StringInBenchmarks.linearMatchIn avgt 3 77.603 ± 0.529 ns/op [info] StringInBenchmarks.radixMatchIn avgt 3 59.675 ± 2.126 ns/op

codecov-commenter · 2022-01-16T01:10:30Z

Codecov Report

Merging #352 (d79bcfa) into main (3acc967) will increase coverage by 0.15%.
The diff coverage is 97.33%.

@@            Coverage Diff             @@
##             main     #352      +/-   ##
==========================================
+ Coverage   96.56%   96.71%   +0.15%     
==========================================
  Files           9        9              
  Lines        1049     1128      +79     
  Branches       94       99       +5     
==========================================
+ Hits         1013     1091      +78     
- Misses         36       37       +1

Impacted Files	Coverage Δ
...e/shared/src/main/scala/cats/parse/RadixNode.scala	`95.31% <95.08%> (-4.69%)`	⬇️
core/shared/src/main/scala/cats/parse/Parser.scala	`96.69% <98.87%> (+0.27%)`	⬆️
...shared/src/main/scala/cats/parse/Accumulator.scala	`100.00% <0.00%> (+2.43%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7b524da...d79bcfa. Read the comment docs.

results: [info] Benchmark (test) Mode Cnt Score Error Units [info] StringInBenchmarks.linearMatchIn foo avgt 3 77.718 ± 3.173 ns/op [info] StringInBenchmarks.linearMatchIn broad avgt 3 92973.234 ± 2156.151 ns/op [info] StringInBenchmarks.oneOfParse foo avgt 3 91.240 ± 4.854 ns/op [info] StringInBenchmarks.oneOfParse broad avgt 3 806.046 ± 37.539 ns/op [info] StringInBenchmarks.radixMatchIn foo avgt 3 59.750 ± 1.315 ns/op [info] StringInBenchmarks.radixMatchIn broad avgt 3 769.000 ± 25.609 ns/op [info] StringInBenchmarks.stringInParse foo avgt 3 91.551 ± 0.600 ns/op [info] StringInBenchmarks.stringInParse broad avgt 3 809.561 ± 54.042 ns/op

johnynek · 2022-01-17T20:18:46Z

I added a benchmark that is close to ideal for the radix tree to have a kind of upper bound comparison. In that example this PR gives performance about 27% faster than main (and 90x faster than a naive linear search).

So, I think it is fair to say the worst case performance of the new code is slightly worse than a linear search (17% comparing the favorable linear search benchmark with the tree). Note, linear search can be very good with only a few alternatives. But when there are many alternatives, the tree can be far, far better.

// With new benchmarks:
[info] Benchmark                         (test)  Mode  Cnt      Score      Error  Units
[info] StringInBenchmarks.linearMatchIn     foo  avgt    3     77.718 ±    3.173  ns/op
[info] StringInBenchmarks.linearMatchIn   broad  avgt    3  92973.234 ± 2156.151  ns/op
[info] StringInBenchmarks.oneOfParse        foo  avgt    3     91.240 ±    4.854  ns/op
[info] StringInBenchmarks.oneOfParse      broad  avgt    3    806.046 ±   37.539  ns/op
[info] StringInBenchmarks.radixMatchIn      foo  avgt    3     59.750 ±    1.315  ns/op
[info] StringInBenchmarks.radixMatchIn    broad  avgt    3    769.000 ±   25.609  ns/op
[info] StringInBenchmarks.stringInParse     foo  avgt    3     91.551 ±    0.600  ns/op
[info] StringInBenchmarks.stringInParse   broad  avgt    3    809.561 ±   54.042  ns/op

// new benchmarks on main
[info] Benchmark                         (test)  Mode  Cnt      Score      Error  Units
[info] StringInBenchmarks.linearMatchIn     foo  avgt    3     78.476 ±    2.503  ns/op
[info] StringInBenchmarks.linearMatchIn   broad  avgt    3  93335.231 ± 2911.905  ns/op
[info] StringInBenchmarks.oneOfParse        foo  avgt    3     96.497 ±    1.285  ns/op
[info] StringInBenchmarks.oneOfParse      broad  avgt    3   1107.582 ±    1.940  ns/op
[info] StringInBenchmarks.stringInParse     foo  avgt    3     96.489 ±    0.397  ns/op
[info] StringInBenchmarks.stringInParse   broad  avgt    3   1106.763 ±    3.840  ns/op

johnynek · 2022-01-18T18:15:29Z

@regadas if you have time, I'd love your review (or anyone).

I think this is ready.

regadas · 2022-01-25T17:18:43Z

Hi, @johnynek this one slipped under my radar. I'll take a look at it tmr.

johnynek · 2022-01-25T17:40:13Z

Thank you! I know this OSS stuff can be a grind. Don't worry at all. I appreciate your help.

regadas

This looks great! Sorry for the delay @johnynek

johnynek · 2022-01-27T08:25:34Z

core/shared/src/main/scala/cats/parse/RadixNode.scala

+  def matchAt(str: String, off: Int): Int =
+    matchAtOrNull(str, off) match {
+      case null => -1
+      case nonNull => nonNull.length


Actually I think this is wrong. It has to be offset + NonNull.length, or the comment is wrong. Maybe we should just fix the comment, but also the Parser is assuming it returns the new offset not the length.

I'll send a PR with a failing test and then fix.

I think this is happening because we are only testing at offset 0.

Nice catch!

Being offset + nonNull.length makes more sense and the consistency with matchAtOrNull reflects it (which needs a fix).

These make more sense now

assert(len <= targ.length, s"len = $len, off = $off") assertEquals(left, targ.substring(off, len), s"len = $len, left = $left")

johnynek added 5 commits January 15, 2022 08:16

Add tests, improve benchmarks

b4a2573

Current benchmark values: [info] Benchmark Mode Cnt Score Error Units [info] StringInBenchmarks.linearMatchIn avgt 3 77.603 ± 0.529 ns/op [info] StringInBenchmarks.radixMatchIn avgt 3 59.675 ± 2.126 ns/op

use a hashing rather than sorting stategy

92adcf8

improve tests

9d814f4

fix a corner case

daab41e

fix 2.11 compilation

58fa16f

johnynek added 2 commits January 17, 2022 09:32

run format

d174703

johnynek added 4 commits January 17, 2022 10:21

fix warning on StringInBenchmarks

b066967

add another test, cleanup

05d274e

code polish

62c7da8

one more test...

d79bcfa

johnynek requested a review from regadas January 18, 2022 18:15

regadas approved these changes Jan 27, 2022

View reviewed changes

regadas merged commit 3dda8ab into main Jan 27, 2022

regadas deleted the oscar/improve_radix_bench branch January 27, 2022 07:56

johnynek commented Jan 27, 2022

View reviewed changes

armanbilge mentioned this pull request Jan 27, 2022

Use scalafmt scala213 runner dialect #359

Merged

johnynek mentioned this pull request Jan 28, 2022

Fix RadixNode.matchAt #362

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve RadixNode add a few more benchmarks #352

Improve RadixNode add a few more benchmarks #352

johnynek commented Jan 16, 2022

codecov-commenter commented Jan 16, 2022 •

edited

Loading

johnynek commented Jan 17, 2022

johnynek commented Jan 18, 2022

regadas commented Jan 25, 2022

johnynek commented Jan 25, 2022

regadas left a comment

johnynek Jan 27, 2022

regadas Jan 27, 2022

Improve RadixNode add a few more benchmarks #352

Improve RadixNode add a few more benchmarks #352

Conversation

johnynek commented Jan 16, 2022

codecov-commenter commented Jan 16, 2022 • edited Loading

Codecov Report

johnynek commented Jan 17, 2022

johnynek commented Jan 18, 2022

regadas commented Jan 25, 2022

johnynek commented Jan 25, 2022

regadas left a comment

Choose a reason for hiding this comment

johnynek Jan 27, 2022

Choose a reason for hiding this comment

regadas Jan 27, 2022

Choose a reason for hiding this comment

codecov-commenter commented Jan 16, 2022 •

edited

Loading