Skip to content

my-vegetable-has-exploded/dot-bench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Benchmark

  • install rust
  • change to nightly
rustup override set nightly
  • run benchmark
RUSTFLAGS='-C target-cpu=native' cargo bench

some result

avx512+vnni

ubuntu@ip-172-31-65-183:~/rust/dot-bench$ cargo bench
    Finished `bench` profile [optimized] target(s) in 0.01s
     Running unittests src/lib.rs (target/release/deps/dot_bench-787c47d968a41971)

running 6 tests
test tests::test_dot ... ignored
test tests::bench_dot_avx512     ... bench:         321 ns/iter (+/- 9)
test tests::bench_dot_f32        ... bench:       6,079 ns/iter (+/- 26)
test tests::bench_dot_fallback   ... bench:         430 ns/iter (+/- 8)
test tests::bench_dot_vnni       ... bench:         229 ns/iter (+/- 3)
test tests::bench_dot_vnni_fault ... bench:         185 ns/iter (+/- 3)

test result: ok. 0 passed; 0 failed; 1 ignored; 5 measured; 0 filtered out; finished in 4.94s

armv8+sve

running 12 tests
test tests::bench_dot_f16_auto_vectorization     ... bench:         458 ns/iter (+/- 7)
test tests::bench_dot_f16_fallback               ... bench:      26,929 ns/iter (+/- 465)
test tests::bench_dot_f16_sve                    ... bench:         556 ns/iter (+/- 9)
test tests::bench_dot_f32_fallback               ... bench:       1,178 ns/iter (+/- 4)
test tests::bench_dot_f32_simsimd                ... bench:       1,922 ns/iter (+/- 4)
test tests::bench_dot_f32_sve                    ... bench:       1,246 ns/iter (+/- 3)
test tests::bench_dot_f32_sve_auto_vectorization ... bench:       1,178 ns/iter (+/- 6)
test tests::bench_dot_i8_fallback                ... bench:         544 ns/iter (+/- 13)
test tests::bench_dot_i8_simsimd                 ... bench:         450 ns/iter (+/- 7)
test tests::bench_dot_i8_sve                     ... bench:         242 ns/iter (+/- 0)
test tests::bench_dot_i8_sve_auto_vectorization  ... bench:         543 ns/iter (+/- 18)

show assembly code

install cargo-show-asm

cargo install cargo-show-asm

show assembly code of certain function

cargo asm --lib dot_f16_fallback

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published