Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reduce BatchedGemm test coverage #1737

Merged
merged 1 commit into from
Mar 21, 2023

Conversation

e10harvey
Copy link
Contributor

Related to #1730.

@e10harvey
Copy link
Contributor Author

be9310d reduces the runtime of each permutation of Cuda.batched_scalar_batched_gemm from ~8000ms to ~2000ms:

$ time ./batched/dense/unit_test/KokkosKernels_batched_gemm_cuda
Kokkos::OpenMP::initialize WARNING: OMP_PROC_BIND environment variable not set
  In general, for best performance with OpenMP 4.0 or better set OMP_PROC_BIND=spread and OMP_PLACES=threads
  For best performance with OpenMP 3.1 set OMP_PROC_BIND=true
  For unit testing set OMP_PROC_BIND=false

[==========] Running 88 tests from 1 test case.
[----------] Global test environment set-up.
[----------] 88 tests from Cuda
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_nt_bhalf_bhalf
[       OK ] Cuda.batched_scalar_serial_gemm_nt_nt_bhalf_bhalf (487 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_nt_bhalf_bhalf
[       OK ] Cuda.batched_scalar_serial_gemm_t_nt_bhalf_bhalf (337 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_t_bhalf_bhalf
[       OK ] Cuda.batched_scalar_serial_gemm_nt_t_bhalf_bhalf (336 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_t_bhalf_bhalf
[       OK ] Cuda.batched_scalar_serial_gemm_t_t_bhalf_bhalf (339 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_nt_half_half
[       OK ] Cuda.batched_scalar_serial_gemm_nt_nt_half_half (391 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_nt_half_half
[       OK ] Cuda.batched_scalar_serial_gemm_t_nt_half_half (372 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_t_half_half
[       OK ] Cuda.batched_scalar_serial_gemm_nt_t_half_half (369 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_t_half_half
[       OK ] Cuda.batched_scalar_serial_gemm_t_t_half_half (373 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_nt_double_double
[       OK ] Cuda.batched_scalar_serial_gemm_nt_nt_double_double (130 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_nt_double_double
[       OK ] Cuda.batched_scalar_serial_gemm_t_nt_double_double (130 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_t_double_double
[       OK ] Cuda.batched_scalar_serial_gemm_nt_t_double_double (129 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_t_double_double
[       OK ] Cuda.batched_scalar_serial_gemm_t_t_double_double (130 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_nt_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_serial_gemm_nt_nt_dcomplex_dcomplex (201 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_nt_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_serial_gemm_t_nt_dcomplex_dcomplex (201 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_t_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_serial_gemm_nt_t_dcomplex_dcomplex (204 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_t_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_serial_gemm_t_t_dcomplex_dcomplex (201 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_nt_dcomplex_double
[       OK ] Cuda.batched_scalar_serial_gemm_nt_nt_dcomplex_double (249 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_nt_dcomplex_double
[       OK ] Cuda.batched_scalar_serial_gemm_t_nt_dcomplex_double (201 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_nt_t_dcomplex_double
[       OK ] Cuda.batched_scalar_serial_gemm_nt_t_dcomplex_double (200 ms)
[ RUN      ] Cuda.batched_scalar_serial_gemm_t_t_dcomplex_double
[       OK ] Cuda.batched_scalar_serial_gemm_t_t_dcomplex_double (200 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_bhalf_bhalf_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_bhalf_bhalf_left (2188 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_bhalf_bhalf_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_bhalf_bhalf_left (2185 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_bhalf_bhalf_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_bhalf_bhalf_left (2201 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_bhalf_bhalf_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_bhalf_bhalf_left (2183 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_bhalf_bhalf_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_bhalf_bhalf_right (2791 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_bhalf_bhalf_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_bhalf_bhalf_right (2791 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_bhalf_bhalf_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_bhalf_bhalf_right (2761 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_bhalf_bhalf_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_bhalf_bhalf_right (2795 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_half_half_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_half_half_left (2170 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_half_half_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_half_half_left (2160 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_half_half_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_half_half_left (2160 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_half_half_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_half_half_left (2158 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_half_half_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_half_half_right (2797 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_half_half_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_half_half_right (2773 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_half_half_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_half_half_right (2748 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_half_half_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_half_half_right (2838 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_double_double_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_double_double_left (2107 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_double_double_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_double_double_left (2103 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_double_double_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_double_double_left (2104 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_double_double_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_double_double_left (2102 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_double_double_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_double_double_right (2236 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_double_double_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_double_double_right (2224 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_double_double_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_double_double_right (2210 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_double_double_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_double_double_right (2211 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_dcomplex_dcomplex_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_dcomplex_dcomplex_left (2221 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_dcomplex_dcomplex_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_dcomplex_dcomplex_left (2223 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_dcomplex_dcomplex_left
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_dcomplex_dcomplex_left (2220 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_dcomplex_dcomplex_left
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_dcomplex_dcomplex_left (2221 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_nt_dcomplex_dcomplex_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_nt_dcomplex_dcomplex_right (2852 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_nt_dcomplex_dcomplex_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_nt_dcomplex_dcomplex_right (2883 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_nt_t_dcomplex_dcomplex_right
[       OK ] Cuda.batched_scalar_batched_gemm_nt_t_dcomplex_dcomplex_right (2824 ms)
[ RUN      ] Cuda.batched_scalar_batched_gemm_t_t_dcomplex_dcomplex_right
[       OK ] Cuda.batched_scalar_batched_gemm_t_t_dcomplex_dcomplex_right (2864 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_nt_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_gemm_nt_nt_bhalf_bhalf (332 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_nt_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_gemm_t_nt_bhalf_bhalf (332 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_t_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_gemm_nt_t_bhalf_bhalf (335 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_t_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_gemm_t_t_bhalf_bhalf (343 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_nt_half_half
[       OK ] Cuda.batched_scalar_team_gemm_nt_nt_half_half (369 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_nt_half_half
[       OK ] Cuda.batched_scalar_team_gemm_t_nt_half_half (381 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_t_half_half
[       OK ] Cuda.batched_scalar_team_gemm_nt_t_half_half (360 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_t_half_half
[       OK ] Cuda.batched_scalar_team_gemm_t_t_half_half (378 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_nt_double_double
[       OK ] Cuda.batched_scalar_team_gemm_nt_nt_double_double (128 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_nt_double_double
[       OK ] Cuda.batched_scalar_team_gemm_t_nt_double_double (142 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_t_double_double
[       OK ] Cuda.batched_scalar_team_gemm_nt_t_double_double (129 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_t_double_double
[       OK ] Cuda.batched_scalar_team_gemm_t_t_double_double (129 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_nt_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_gemm_nt_nt_dcomplex_dcomplex (217 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_nt_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_gemm_t_nt_dcomplex_dcomplex (209 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_t_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_gemm_nt_t_dcomplex_dcomplex (202 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_t_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_gemm_t_t_dcomplex_dcomplex (202 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_nt_dcomplex_double
[       OK ] Cuda.batched_scalar_team_gemm_nt_nt_dcomplex_double (205 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_nt_dcomplex_double
[       OK ] Cuda.batched_scalar_team_gemm_t_nt_dcomplex_double (203 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_nt_t_dcomplex_double
[       OK ] Cuda.batched_scalar_team_gemm_nt_t_dcomplex_double (206 ms)
[ RUN      ] Cuda.batched_scalar_team_gemm_t_t_dcomplex_double
[       OK ] Cuda.batched_scalar_team_gemm_t_t_dcomplex_double (203 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_nt_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_nt_bhalf_bhalf (163 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_nt_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_nt_bhalf_bhalf (167 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_t_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_t_bhalf_bhalf (166 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_t_bhalf_bhalf
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_t_bhalf_bhalf (169 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_nt_half_half
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_nt_half_half (181 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_nt_half_half
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_nt_half_half (184 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_t_half_half
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_t_half_half (180 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_t_half_half
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_t_half_half (184 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_nt_double_double
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_nt_double_double (129 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_nt_double_double
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_nt_double_double (130 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_t_double_double
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_t_double_double (129 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_t_double_double
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_t_double_double (129 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_nt_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_nt_dcomplex_dcomplex (211 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_nt_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_nt_dcomplex_dcomplex (203 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_nt_t_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_vector_gemm_nt_t_dcomplex_dcomplex (205 ms)
[ RUN      ] Cuda.batched_scalar_team_vector_gemm_t_t_dcomplex_dcomplex
[       OK ] Cuda.batched_scalar_team_vector_gemm_t_t_dcomplex_dcomplex (207 ms)
[----------] 88 tests from Cuda (90227 ms total)

[----------] Global test environment tear-down
[==========] 88 tests from 1 test case ran. (90227 ms total)
[  PASSED  ] 88 tests.

real	1m30.938s
user	47m25.875s
sys	2m9.609s

Before be9310d:

$ time ./batched/dense/unit_test/KokkosKernels_batched_gemm_cuda
<snip>
[----------] 88 tests from Cuda (282513 ms total)

[----------] Global test environment tear-down
[==========] 88 tests from 1 test case ran. (282514 ms total)
[  PASSED  ] 88 tests.

real	4m43.221s
user	171m23.001s
sys	7m5.391s

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection Is Not Necessary for this Pull Request.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 391
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 5
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC1020

  • Build Num: 32
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC1020_Light_LayoutRight

  • Build Num: 303
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC1020

  • Build Num: 266
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19

  • Build Num: 353
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001

  • Build Num: 405
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110

  • Build Num: 207
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_GCC1020

  • Build Num: 204
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA908_ROCM520

  • Build Num: 206
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Using Repos:

Repo: KOKKOSKERNELS (e10harvey/kokkos-kernels)
  • Branch: reduce_test_coverage
  • SHA: be9310d
  • Mode: TEST_REPO

Pull Request Author: e10harvey

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs 30 Mins. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 391
  • Status: FAILED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 5
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC1020

  • Build Num: 32
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC1020_Light_LayoutRight

  • Build Num: 303
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC1020

  • Build Num: 266
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19

  • Build Num: 353
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001

  • Build Num: 405
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110

  • Build Num: 207
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_A64FX_GCC1020

  • Build Num: 204
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA908_ROCM520

  • Build Num: 206
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH reduce_test_coverage
KOKKOSKERNELS_SOURCE_REPO https://github.com/e10harvey/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA be9310d
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA afd686e
PR_LABELS
PULLREQUESTNUM 1737
TEST_REPO_ALIAS KOKKOSKERNELS
Console Output (last 100 lines) : KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight # 391 (click to expand)

Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on weaver (testbed) in workspace /home/jenkins/weaver-new/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight
The recommended git tool is: NONE
No credentials specified
 > /home/projects/ppc64le/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/weaver-new/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/ppc64le/git/2.10.1/bin/git config remote.origin.url https://github.com/e10harvey/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/e10harvey/kokkos-kernels
 > /home/projects/ppc64le/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/ppc64le/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/e10harvey/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/ppc64le/git/2.10.1/bin/git rev-parse refs/remotes/origin/reduce_test_coverage^{commit} # timeout=10
JENKINS-19022: warning: possible memory leak due to Git plugin usage; see: https://plugins.jenkins.io/git/#remove-git-plugin-buildsbybranch-builddata-script
Checking out Revision be9310d975035b7d9af75446f83285d33e96428e (refs/remotes/origin/reduce_test_coverage)
 > /home/projects/ppc64le/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/ppc64le/git/2.10.1/bin/git checkout -f be9310d975035b7d9af75446f83285d33e96428e # timeout=10
Commit message: "Reduce BatchedGemm test coverage"
First time build. Skipping changelog.
The recommended git tool is: NONE
No credentials specified
 > /home/projects/ppc64le/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/weaver-new/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/ppc64le/git/2.10.1/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/ppc64le/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/ppc64le/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/ppc64le/git/2.10.1/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > /home/projects/ppc64le/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/ppc64le/git/2.10.1/bin/git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > /home/projects/ppc64le/git/2.10.1/bin/git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight] $ /bin/bash -el /tmp/jenkins8760027483553457748.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/weaver-new/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight
***Forced exclusive execution
Job <53460> is submitted to queue .
<>
<>
Build step 'Execute shell' marked build as failure
Finished: FAILURE

Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10 # 5 (click to expand)

gcc-9.3.0-OpenMP-release build_time=468 run_time=98
gcc-9.3.0-OpenMP_Serial-release build_time=635 run_time=239
gcc-9.3.0-Serial-release build_time=430 run_time=109
Running on machine: weaver
KokkosKernels Repository Status:  446d14be1f40a8670ebc68063ac4cad1bedee346 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: gcc/9.3.0
Testing compiler gcc/9.3.0
Unrecognized compiler gcc/9.3.0 when looking for Spack variants
Unrecognized compiler gcc/9.3.0 when looking for Spack variants
Unrecognized compiler gcc/9.3.0 when looking for Spack variants
Starting job gcc-9.3.0-OpenMP-release
kokkos devices: OpenMP
kokkos arch: Power9,Volta70
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED gcc-9.3.0-OpenMP-release
Unrecognized compiler gcc/9.3.0 when looking for Spack variants
Unrecognized compiler gcc/9.3.0 when looking for Spack variants
Unrecognized compiler gcc/9.3.0 when looking for Spack variants
Starting job gcc-9.3.0-Serial-release
kokkos devices: Serial
kokkos arch: Power9,Volta70
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED gcc-9.3.0-Serial-release
#######################################################
PASSED TESTS
#######################################################
gcc-9.3.0-OpenMP-release build_time=447 run_time=100
gcc-9.3.0-Serial-release build_time=411 run_time=115
Running on machine: weaver
KokkosKernels Repository Status: 446d14be1f40a8670ebc68063ac4cad1bedee346 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: clang/13.0.0
Testing compiler clang/13.0.0
Unrecognized compiler clang/13.0.0 when looking for Spack variants
Unrecognized compiler clang/13.0.0 when looking for Spack variants
Unrecognized compiler clang/13.0.0 when looking for Spack variants
Starting job clang-13.0.0-Cuda-release
kokkos devices: Cuda
kokkos arch: Power9,Volta70
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED clang-13.0.0-Cuda-release
#######################################################
PASSED TESTS
#######################################################
clang-13.0.0-Cuda-release build_time=1009 run_time=518
/home/jenkins/weaver-new/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC1020 # 32 (click to expand)

 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_GCC1020] $ /bin/bash -el /tmp/jenkins199741869419289981.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_GCC1020
Running on machine: blake
Running on machine: blake
KokkosKernels Repository Status:  08a94d932eaa00c765798e57d9a0cb7154fded5d Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: gcc/10.2.0
Testing compiler gcc/10.2.0
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Starting job gcc-10.2.0-OpenMP-release
kokkos devices: OpenMP
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args: --disable-test-eti-only
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
KokkosKernels Repository Status: 08a94d932eaa00c765798e57d9a0cb7154fded5d Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: gcc/10.2.0
Testing compiler gcc/10.2.0
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Starting job gcc-10.2.0-Threads_Serial-release
kokkos devices: Threads,Serial
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args: --disable-test-eti-only
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED gcc-10.2.0-OpenMP-release
#######################################################
PASSED TESTS
#######################################################
gcc-10.2.0-OpenMP-release build_time=4342 run_time=356
PASSED gcc-10.2.0-Threads_Serial-release
#######################################################
PASSED TESTS
#######################################################
gcc-10.2.0-OpenMP-release build_time=4342 run_time=356
gcc-10.2.0-Threads_Serial-release build_time=4293 run_time=690
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_GCC1020
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC1020_Light_LayoutRight # 303 (click to expand)

Commit message: "Reduce BatchedGemm test coverage"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 4dbd51ecb1e9fe7301a58a39879218713c45b42f # timeout=10
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_GCC1020_Light_LayoutRight/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_GCC1020_Light_LayoutRight] $ /bin/bash -el /tmp/jenkins2385075001566479502.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_GCC1020_Light_LayoutRight
Running on machine: blake
KokkosKernels Repository Status:  271f9462c380f2ce6bb334528bcadb611bc16035 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: gcc/10.2.0
Testing compiler gcc/10.2.0
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Starting job gcc-10.2.0-Threads_Serial-release
kokkos devices: Threads,Serial
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args: --no-default-eti
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutRight
PASSED gcc-10.2.0-Threads_Serial-release
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Starting job gcc-10.2.0-OpenMP-release
kokkos devices: OpenMP
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args: --no-default-eti
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutRight
PASSED gcc-10.2.0-OpenMP-release
#######################################################
PASSED TESTS
#######################################################
gcc-10.2.0-OpenMP-release build_time=383 run_time=79
gcc-10.2.0-Threads_Serial-release build_time=532 run_time=175
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_GCC1020_Light_LayoutRight
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_GCC1020 # 266 (click to expand)

[EnvInject] - Loading node environment variables.
Building remotely on blake (Testbed skylake) in workspace /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC1020
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC1020/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/e10harvey/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/e10harvey/kokkos-kernels
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/e10harvey/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse be9310d975035b7d9af75446f83285d33e96428e^{commit} # timeout=10
Checking out Revision be9310d975035b7d9af75446f83285d33e96428e (detached)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f be9310d975035b7d9af75446f83285d33e96428e # timeout=10
Commit message: "Reduce BatchedGemm test coverage"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 4dbd51ecb1e9fe7301a58a39879218713c45b42f # timeout=10
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC1020/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_Tpls_GCC1020] $ /bin/bash -el /tmp/jenkins6123107066524176717.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC1020
Running on machine: blake
KokkosKernels Repository Status:  5f19ca744666e160e9c119c591d6da6c49c88ca9 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: gcc/10.2.0
Testing compiler gcc/10.2.0
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Starting job gcc-10.2.0-OpenMP_Serial-release
kokkos devices: OpenMP,Serial
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED gcc-10.2.0-OpenMP_Serial-release
#######################################################
PASSED TESTS
#######################################################
gcc-10.2.0-OpenMP_Serial-release build_time=609 run_time=186
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC1020
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_INTEL19 # 353 (click to expand)

Commit message: "Reduce BatchedGemm test coverage"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 4dbd51ecb1e9fe7301a58a39879218713c45b42f # timeout=10
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_INTEL19/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_Tpls_INTEL19] $ /bin/bash -el /tmp/jenkins7324188390614974161.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_INTEL19
Running on machine: blake
KokkosKernels Repository Status:  e79315c74e18ae4925d4c5eaf78176bcfc4a2ab4 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: intel/19.5.281
Testing compiler intel/19.5.281
Unrecognized compiler intel/19.5.281 when looking for Spack variants
Unrecognized compiler intel/19.5.281 when looking for Spack variants
Unrecognized compiler intel/19.5.281 when looking for Spack variants
Starting job intel-19.5.281-OpenMP-release
kokkos devices: OpenMP
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized -diag-disable=1011 -diag-disable=869 -diag-disable=1011 -diag-disable=869
extra_args: --disable-perftests
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED intel-19.5.281-OpenMP-release
Unrecognized compiler intel/19.5.281 when looking for Spack variants
Unrecognized compiler intel/19.5.281 when looking for Spack variants
Unrecognized compiler intel/19.5.281 when looking for Spack variants
Starting job intel-19.5.281-Threads-release
kokkos devices: Threads
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized -diag-disable=1011 -diag-disable=869 -diag-disable=1011 -diag-disable=869
extra_args: --disable-perftests
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED intel-19.5.281-Threads-release
#######################################################
PASSED TESTS
#######################################################
intel-19.5.281-OpenMP-release build_time=1228 run_time=93
intel-19.5.281-Threads-release build_time=817 run_time=104
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_INTEL19
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_CLANG1001 # 405 (click to expand)

[EnvInject] - Loading node environment variables.
Building remotely on blake (Testbed skylake) in workspace /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_CLANG1001
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_CLANG1001/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/e10harvey/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/e10harvey/kokkos-kernels
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/e10harvey/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse be9310d975035b7d9af75446f83285d33e96428e^{commit} # timeout=10
Checking out Revision be9310d975035b7d9af75446f83285d33e96428e (detached)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f be9310d975035b7d9af75446f83285d33e96428e # timeout=10
Commit message: "Reduce BatchedGemm test coverage"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 4dbd51ecb1e9fe7301a58a39879218713c45b42f # timeout=10
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_CLANG1001/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_CLANG1001] $ /bin/bash -el /tmp/jenkins3452292888287383892.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_CLANG1001
Running on machine: blake
KokkosKernels Repository Status:  0d067bd33977a6902b144d7812bb07e9740fcbc2 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: clang/10.0.1
Testing compiler clang/10.0.1
Unrecognized compiler clang/10.0.1 when looking for Spack variants
Unrecognized compiler clang/10.0.1 when looking for Spack variants
Unrecognized compiler clang/10.0.1 when looking for Spack variants
Starting job clang-10.0.1-Threads_Serial-release
kokkos devices: Threads,Serial
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED clang-10.0.1-Threads_Serial-release
#######################################################
PASSED TESTS
#######################################################
clang-10.0.1-Threads_Serial-release build_time=608 run_time=198
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_CLANG1001
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110 # 207 (click to expand)

 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110

Currently Loaded Modules:

  1. gcc/10.2.0 11) numactl/2.0.12 21) parmetis/4.0.3
  2. autoconf/2.69 12) hwloc/1.11.11 22) metis/5.1.0
  3. automake/1.16.1 13) pmix/2.2.3 23) openblas/0.3.4
  4. libtool/2.4.6 14) ucx/1.7.0 24) superlu/5.2.1
  5. cmake/3.17.0 15) openmpi/4.0.2 25) superlu-dist/5.4.0
  6. git/2.19.2 16) curl/7.71.0 26) boost/1.72.0
  7. zlib/1.2.11 17) netcdf-c/4.6.3 27) fftw/3.3.8
  8. bzip2/1.0.6 18) parallel-netcdf/1.11.1 28) singularity/3.5.3
  9. xz/5.2.4 19) hdf5/1.10.5 29) devpack-gnu10/20201002
  10. yaml-cpp/0.6.2 20) cgns/3.4.0

Currently Loaded Modules:

  1. cmake/3.17.0

salloc: Granted job allocation 3111
Running on machine: inouye
KokkosKernels Repository Status: 481d1172e714456210e6dc3e4d46c8fd3a699cd4 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: armpl/21.1.0
Testing compiler armpl/21.1.0
Unrecognized compiler armpl/21.1.0 when looking for Spack variants
Unrecognized compiler armpl/21.1.0 when looking for Spack variants
Unrecognized compiler armpl/21.1.0 when looking for Spack variants
Starting job armpl-21.1.0-OpenMP-release
kokkos devices: OpenMP
kokkos arch: A64FX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED armpl-21.1.0-OpenMP-release
Unrecognized compiler armpl/21.1.0 when looking for Spack variants
Unrecognized compiler armpl/21.1.0 when looking for Spack variants
Unrecognized compiler armpl/21.1.0 when looking for Spack variants
Starting job armpl-21.1.0-Serial-release
kokkos devices: Serial
kokkos arch: A64FX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED armpl-21.1.0-Serial-release
#######################################################
PASSED TESTS
#######################################################
armpl-21.1.0-OpenMP-release build_time=1743 run_time=1042
armpl-21.1.0-Serial-release build_time=1627 run_time=248
salloc: Relinquishing job allocation 3111
/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_Tpls_ARMPL2110
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_A64FX_GCC1020 # 204 (click to expand)

Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > git --version # timeout=10
 > git --version # 'git version 2.19.2'
Setting http proxy: proxy.sandia.gov:80
 > git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > git config core.sparsecheckout # timeout=10
 > git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_A64FX_GCC1020] $ /bin/bash -el /tmp/jenkins4228565187572202585.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_GCC1020

Currently Loaded Modules:

  1. gcc/10.2.0 11) numactl/2.0.12 21) parmetis/4.0.3
  2. autoconf/2.69 12) hwloc/1.11.11 22) metis/5.1.0
  3. automake/1.16.1 13) pmix/2.2.3 23) openblas/0.3.4
  4. libtool/2.4.6 14) ucx/1.7.0 24) superlu/5.2.1
  5. cmake/3.17.0 15) openmpi/4.0.2 25) superlu-dist/5.4.0
  6. git/2.19.2 16) curl/7.71.0 26) boost/1.72.0
  7. zlib/1.2.11 17) netcdf-c/4.6.3 27) fftw/3.3.8
  8. bzip2/1.0.6 18) parallel-netcdf/1.11.1 28) singularity/3.5.3
  9. xz/5.2.4 19) hdf5/1.10.5 29) devpack-gnu10/20201002
  10. yaml-cpp/0.6.2 20) cgns/3.4.0

Currently Loaded Modules:

  1. cmake/3.17.0

salloc: Granted job allocation 3112
Running on machine: inouye
KokkosKernels Repository Status: 3b099e03fbe81d10ea413bfb61e56acf580291b6 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: gcc/10.2.0
Testing compiler gcc/10.2.0
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Unrecognized compiler gcc/10.2.0 when looking for Spack variants
Starting job gcc-10.2.0-OpenMP_Serial-release
kokkos devices: OpenMP,Serial
kokkos arch: A64FX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED gcc-10.2.0-OpenMP_Serial-release
#######################################################
PASSED TESTS
#######################################################
gcc-10.2.0-OpenMP_Serial-release build_time=2411 run_time=2150
salloc: Relinquishing job allocation 3112
/home/jenkins/inouye/workspace/workspace/KokkosKernels_PullRequest_A64FX_GCC1020
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_VEGA908_ROCM520 # 206 (click to expand)

Building remotely on caraway (Testbed) in workspace /home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA908_ROCM520
The recommended git tool is: NONE
No credentials specified
 > git rev-parse --resolve-git-dir /home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA908_ROCM520/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > git config remote.origin.url https://github.com/e10harvey/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/e10harvey/kokkos-kernels
 > git --version # timeout=10
 > git --version # 'git version 2.31.1'
Setting http proxy: proxy.sandia.gov:80
 > git fetch --tags --force --progress -- https://github.com/e10harvey/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > git rev-parse be9310d975035b7d9af75446f83285d33e96428e^{commit} # timeout=10
Checking out Revision be9310d975035b7d9af75446f83285d33e96428e (detached)
 > git config core.sparsecheckout # timeout=10
 > git checkout -f be9310d975035b7d9af75446f83285d33e96428e # timeout=10
Commit message: "Reduce BatchedGemm test coverage"
 > git rev-list --no-walk 4dbd51ecb1e9fe7301a58a39879218713c45b42f # timeout=10
The recommended git tool is: NONE
No credentials specified
 > git rev-parse --resolve-git-dir /home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA908_ROCM520/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > git --version # timeout=10
 > git --version # 'git version 2.31.1'
Setting http proxy: proxy.sandia.gov:80
 > git fetch --tags --force --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 65aa95e8280d1ff845894329a8c3e8b0d138debc (origin/develop)
 > git config core.sparsecheckout # timeout=10
 > git checkout -f 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
Commit message: "Merge pull request #5965 from dalg24/desul_numeric_limits_max"
 > git rev-list --no-walk 65aa95e8280d1ff845894329a8c3e8b0d138debc # timeout=10
[KokkosKernels_PullRequest_VEGA908_ROCM520] $ /bin/bash -el /tmp/jenkins6536766240912870061.sh
Merge made by the 'recursive' strategy.
 perf_test/Benchmark_Context.hpp                    |   3 +
 perf_test/CMakeLists.txt                           |   3 +
 .../KokkosBlas_dot_mv_perf_test_benchmark.cpp      | 142 ++++++++++++++++++++
 .../blas1/KokkosBlas_dot_perf_test_benchmark.cpp   | 140 ++++++++++++++++++++
 .../KokkosBlas_team_dot_perf_test_benchmark.cpp    | 146 +++++++++++++++++++++
 5 files changed, 434 insertions(+)
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_mv_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_dot_perf_test_benchmark.cpp
 create mode 100644 perf_test/blas/blas1/KokkosBlas_team_dot_perf_test_benchmark.cpp
/home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA908_ROCM520
Running on machine: caraway
KokkosKernels Repository Status:  c99a7066da9497ac7ff6e74918d8bfe01936e261 Merge remote-tracking branch 'upstream/develop' into HEAD

Kokkos Repository Status: 65aa95e8280d1ff845894329a8c3e8b0d138debc Merge pull request #5965 from dalg24/desul_numeric_limits_max

Going to test compilers: rocm/5.2.0
Testing compiler rocm/5.2.0
Unrecognized compiler rocm/5.2.0 when looking for Spack variants
Unrecognized compiler rocm/5.2.0 when looking for Spack variants
Unrecognized compiler rocm/5.2.0 when looking for Spack variants
Starting job rocm-5.2.0-Hip_Serial-release
Hip IS THE KOKKOS DEVICE
kokkos devices: Hip,Serial
kokkos arch: VEGA908
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED rocm-5.2.0-Hip_Serial-release
#######################################################
PASSED TESTS
#######################################################
rocm-5.2.0-Hip_Serial-release build_time=833 run_time=394
/home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA908_ROCM520
Finished: SUCCESS

@e10harvey
Copy link
Contributor Author

TODO: Re-run CUDA11 build number 391 manually. The console output indicates that CUDA11 timed out waiting for resources.

@e10harvey
Copy link
Contributor Author

TODO: Re-run CUDA11 build number 391 manually. The console output indicates that CUDA11 timed out waiting for resources.

Started re-run in number 404 via KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight.

@e10harvey
Copy link
Contributor Author

TODO: Re-run CUDA11 build number 391 manually. The console output indicates that CUDA11 timed out waiting for resources.

Started re-run in number 404 via KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight.

This build passed.

@e10harvey e10harvey merged commit aec946c into kokkos:develop Mar 21, 2023
ndellingwood pushed a commit that referenced this pull request Mar 28, 2023
Reduce BatchedGemm test coverage

(cherry picked from commit aec946c)
@ndellingwood
Copy link
Contributor

Cherry-picked to release-candidate-4.0.01 SHA c9d087a

ndellingwood pushed a commit that referenced this pull request Apr 20, 2023
Reduce BatchedGemm test coverage

(cherry picked from commit aec946c)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants