-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix last cache, and make multithreaded benchmarks actually run multithreaded. #15
Conversation
Codecov Report
@@ Coverage Diff @@
## master #15 +/- ##
==========================================
+ Coverage 90.54% 94.63% +4.09%
==========================================
Files 3 3
Lines 148 149 +1
==========================================
+ Hits 134 141 +7
+ Misses 14 8 -6
Continue to review full report at Codecov.
|
Last fix seems to have broken CI. |
For the tested default vector length, 10 was much too short. |
Test pass locally for me as well (macOS). I guess we could
|
An upper bound could be reasonable, but it seems we're capable of hitting pretty extreme lower bounds in these benchmarks. |
Basic setup on master, see https://git.uni-paderborn.de/pc2-ci/julia/STREAMBenchmark-jl/-/jobs. |
https://github.com/JuliaPerf/STREAMBenchmark.jl/runs/4076840066?check_suite_focus=true#step:6:126
Maybe we should add a small tolerance here. |
Hope you don't mind that I'm taking the liberty to push here (merge the latest changes on master etc.) I've introduced a |
On the PC² CI runner I now get
with this PR. This looks good. The expected bandwidth for 20 threads is ~100 GB/s, see https://github.com/JuliaPerf/BandwidthBenchmark.jl/blob/main/benchmarks/noctua_pc2/bwbench/bwbench.out#L185 |
I'll merge this right away as it fixes an important bug ( |
FYI, I noticed that the ultra-low memory bandwidth values also occurred for the PC² CI runners (on our cluster). Investingating this, I noticed that only the run with |
|
The
last_cachesize
method from my previous PR accidentally returned the first cache size instead.