perf tests cleanup #613

brian-kelley · 2020-02-21T18:44:39Z

Lots of cleanup/improvement to perf tests

Remove MyCrsMatrix (instead, using KokkosSparse::CrsMatrix)
Remove spmv/matrix_market.hpp (instead, using the IO utils)
Make spgemm perf test not crash if --xyz is the last command line arg, but a value is expected after it (e.g. --openmp $numthreads). Instead, print error message and exit
Make all perf tests use whatever scalar/ordinal/offset/layout types are enabled by ETI, rather than hardcoding double/int/int/left.
Cleaned up lots of unused parameters in the various raw OpenMP spmv versions
Added Ordinal/Offset (aka lno_t vs. size_type) distinction to the raw spmv versions

Also fixed an issue with Blas1 dot ETI: when scalar=float, the rank-1 version that returns a scalar should use double to accumulate the sum. This case was not being ETI'd, which broke the perf test build. This is now ETI'd correctly.

This PR resolves #583, #493, and #492.

I was able to build and run the unit and perf tests cleanly with serial, openmp and cuda enabled, and with double, float, complex_double and complex_float all enabled. I was also able to build without scalar=double, meaning float was used in all the tests.

RIDE spot check (with double and complex_double):
#######################################################
PASSED TESTS
#######################################################
cuda-10.1.105-Cuda_OpenMP-release build_time=578 run_time=418
cuda-10.1.105-Cuda_Serial-release build_time=515 run_time=521
cuda-9.2.88-Cuda_OpenMP-release build_time=504 run_time=425
cuda-9.2.88-Cuda_Serial-release build_time=553 run_time=526
gcc-6.4.0-OpenMP_Serial-release build_time=244 run_time=394
gcc-7.2.0-OpenMP-release build_time=127 run_time=128
gcc-7.2.0-OpenMP_Serial-release build_time=210 run_time=362
gcc-7.2.0-Serial-release build_time=119 run_time=227
ibm-16.1.0-Serial-release build_time=480 run_time=398

Bowman spot check (with double and complex_double):
#######################################################
PASSED TESTS
#######################################################
intel-16.4.258-Pthread-release build_time=733 run_time=1029
intel-16.4.258-Pthread_Serial-release build_time=1065 run_time=1993
intel-16.4.258-Serial-release build_time=715 run_time=949
intel-17.2.174-OpenMP-release build_time=881 run_time=569
intel-17.2.174-OpenMP_Serial-release build_time=1248 run_time=1468
intel-17.2.174-Pthread-release build_time=813 run_time=893
intel-17.2.174-Pthread_Serial-release build_time=1151 run_time=1784
intel-17.2.174-Serial-release build_time=791 run_time=876

- Cleaning up duplicated MatrixMarket code from perf_test/spmv that exists in IOUtils (kokkos#493) - Changing the scalar/lno_t/size_type/layout to tolerate any ETI combination (previously, only double/int/int/left was really supported)

When doing dot(View<float*>, View<float*>) -> float (or with complex<float>*), dot uses double (or complex<double>) to actually sum the values. This wasn't getting ETI'd correctly, but now perf_test and unit_test both build correctly with float enabled as a scalar.

in InnerProductSpaceTraits, when casting complex<double> to complex<float>

instead of the test_common versions of those which are now gone

Last resort if nothing else is enabled, or nothing else was selected at runtime with argc/argv

If "--flag" expects another argument after, check that there is actually another arg before trying to read it.

brian-kelley · 2020-02-21T18:48:03Z

I also made the SPGEMM perf test run with serial device, if it's enabled and neither CUDA nor OpenMP ran. Before, if you didn't give --openmp or --cuda, it would just print out a bunch of machine info and exit without doing a multiply, but it also didn't give a warning that nothing happened.

brian-kelley · 2020-02-25T23:54:02Z

2 TODOs for this PR:

decide how to handle the fact that the default for each ETI type is unconditionally enabled. This means that it's currently impossible for the user to get a perf test to use scalar=float, without changing source code (see Can I run KokkosKernels spgemm with float or int32 type? #583)
re run spot checks to make sure PRs since last week (and ideally, other pending PRs) didn't break this

ndellingwood · 2020-02-26T18:04:49Z

@brian-kelley are any modifications related to your first TODO needed before merging this PR, or can that be handled in a separate PR (in which case I'll review for merge so you don't have to wait longer for this to go in)?

brian-kelley · 2020-02-26T18:13:49Z

@ndellingwood No, the first TODO is not blocking this PR. It just means that users can't use float in the perf tests (yet) without changing source code themselves. This is fine as @jjwilke is already working on a solution in #620.

ndellingwood

Thanks @brian-kelley !

brian-kelley · 2020-02-27T21:23:47Z

@ndellingwood Thanks for reviewing, I'll re-run the checks to make sure nothing here broke.

brian-kelley · 2020-02-27T21:35:18Z

OK, rebasing this on develop built cleanly with Serial,Cuda,OpenMP on my machine so it's probably fine, but I'm still going to let the spot check frun.

brian-kelley · 2020-02-28T00:51:17Z

Spot checks still passed, so I'm merging.

brian-kelley and others added 14 commits February 19, 2020 13:50

WIP: sparse perf-test cleanup and ETI fixes

f8d5d42

- Cleaning up duplicated MatrixMarket code from perf_test/spmv that exists in IOUtils (kokkos#493) - Changing the scalar/lno_t/size_type/layout to tolerate any ETI combination (previously, only double/int/int/left was really supported)

Misc fixes and cleanup in perftests

f6c0f60

Finish cleanup of spmv perf tests

eae9f14

Fix -Wnarrowing warning

00aee41

in InnerProductSpaceTraits, when casting complex<double> to complex<float>

WIP: cleaning out MyCRSMatrix

0b9466f

Removed unused file ...spgemm_impl_common.hpp

cef4cee

Make perf tests use StaticCrsGraph/CrsMatrix

73acc1a

instead of the test_common versions of those which are now gone

Fixed typo "K" -> "k" in spiluk perf test

88b284d

Removed git conflict stuff from sptrsv perftest

6d60296

Made the spgemm perf test run with Serial

16208de

Last resort if nothing else is enabled, or nothing else was selected at runtime with argc/argv

Fixed SPGEMM ref output type, perftest warnings

9359cde

Fixed more warnings

ae14746

Made SPGEMM driver not crash in arg parsing

6a32a49

If "--flag" expects another argument after, check that there is actually another arg before trying to read it.

brian-kelley added bug enhancement labels Feb 21, 2020

brian-kelley requested review from srajama1, ndellingwood, vqd8a and lucbv February 21, 2020 18:44

brian-kelley self-assigned this Feb 21, 2020

ndellingwood approved these changes Feb 27, 2020

View reviewed changes

brian-kelley merged commit a5a7337 into kokkos:develop Feb 28, 2020

brian-kelley mentioned this pull request Mar 6, 2020

Some tests don't use Kokkos Kernels :) #647

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf tests cleanup #613

perf tests cleanup #613

brian-kelley commented Feb 21, 2020

brian-kelley commented Feb 21, 2020

brian-kelley commented Feb 25, 2020

ndellingwood commented Feb 26, 2020

brian-kelley commented Feb 26, 2020

ndellingwood left a comment

brian-kelley commented Feb 27, 2020

brian-kelley commented Feb 27, 2020

brian-kelley commented Feb 28, 2020

perf tests cleanup #613

perf tests cleanup #613

Conversation

brian-kelley commented Feb 21, 2020

brian-kelley commented Feb 21, 2020

brian-kelley commented Feb 25, 2020

ndellingwood commented Feb 26, 2020

brian-kelley commented Feb 26, 2020

ndellingwood left a comment

Choose a reason for hiding this comment

brian-kelley commented Feb 27, 2020

brian-kelley commented Feb 27, 2020

brian-kelley commented Feb 28, 2020