Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use float as accumulator for GEMV on half_t (Fix #1081) #1082

Merged
merged 1 commit into from
Aug 18, 2021

Conversation

brian-kelley
Copy link
Contributor

This avoid atomics with half_t, which are currently very slow on CUDA
(use lock tables, not native atomics).
This also improves the numerical accuracy of the output for all modes
(not just TwoLevel/LayoutLeft, which was using the atomics).

On 1M x 10 matrix, V100, LayoutLeft GEMV with half_t: was getting 1.3 mflops (!) but now gets 140-150 gflops which is about what float gets.

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection Is Not Necessary for this Pull Request.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light

  • Build Num: 413
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720

  • Build Num: 406
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light_LayoutRight

  • Build Num: 53
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720

  • Build Num: 397
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10

  • Build Num: 378
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL18

  • Build Num: 383
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight

  • Build Num: 189
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA9

  • Build Num: 373
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720_GCC740

  • Build Num: 371
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Using Repos:

Repo: KOKKOSKERNELS (brian-kelley/kokkos-kernels)
  • Branch: Fix1081
  • SHA: 58f6be1
  • Mode: TEST_REPO

Pull Request Author: brian-kelley

This avoid atomics with half_t, which are currently very slow on CUDA
(use lock tables, not native atomics).
This also improves the numerical accuracy of the output for all modes
(not just TwoLevel/LayoutLeft, which was using the atomics).
@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Error: Jenkins Jobs - A user has commited a change to the PR before testing completed. The original testing SHA = 58f6be1 Does not match the current commit SHA = 718fe40. The Jenkins Jobs will be shutdown; Testing of this PR must occur again.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs 30 Mins. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light

  • Build Num: 413
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720

  • Build Num: 406
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light_LayoutRight

  • Build Num: 53
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720

  • Build Num: 397
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10

  • Build Num: 378
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL18

  • Build Num: 383
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight

  • Build Num: 189
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA9

  • Build Num: 373
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720_GCC740

  • Build Num: 371
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 58f6be1
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS
Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC720_Light # 413 (click to expand)

15: Shortcut relative residual for iteration 13 is: 1.49353e-05
15: Shortcut relative residual for iteration 14 is: 7.13036e-06
15: True relative residual for iteration 14 is : 7.13036e-06
15: Shortcut relative residual for iteration 15 is: 3.53706e-06
15: Shortcut relative residual for iteration 16 is: 1.72347e-06
15: Shortcut relative residual for iteration 17 is: 8.50106e-07
15: Shortcut relative residual for iteration 18 is: 4.15334e-07
15: Shortcut relative residual for iteration 19 is: 2.05746e-07
15: Shortcut relative residual for iteration 20 is: 9.82943e-08
15: Shortcut relative residual for iteration 21 is: 4.63135e-08
15: Shortcut relative residual for iteration 22 is: 2.13531e-08
15: Shortcut relative residual for iteration 23 is: 9.69776e-09
15: Shortcut relative residual for iteration 24 is: 4.43873e-09
15: Shortcut relative residual for iteration 25 is: 2.01253e-09
15: Shortcut relative residual for iteration 26 is: 9.29927e-10
15: Shortcut relative residual for iteration 27 is: 4.2636e-10
15: Shortcut relative residual for iteration 28 is: 2.02506e-10
15: Shortcut relative residual for iteration 29 is: 9.64983e-11
15: True relative residual for iteration 29 is : 9.64983e-11
15: Ending relative residual is: 9.64983e-11
15: Solver converged! 
15: The solver completed 29 iterations.
15: =======================================
15: Verify from main: Ending residual is 9.64983e-11
15: Number of iterations is: 29
15: Diff of residual from main - residual from solver: 0
15: Convergence flag is : Converged
15: Test CGS2 Passed!
15: =======================================
15: 
15: 
15: Testing GMRES with MGS ortho:
15: Convergence tolerance is: 1e-10
15: Shortcut relative residual for iteration 0 is: 0.396496
15: Shortcut relative residual for iteration 1 is: 0.168525
15: Shortcut relative residual for iteration 2 is: 0.0731299
15: Shortcut relative residual for iteration 3 is: 0.0325159
15: Shortcut relative residual for iteration 4 is: 0.0143389
15: Shortcut relative residual for iteration 5 is: 0.00648654
15: Shortcut relative residual for iteration 6 is: 0.00295024
15: Shortcut relative residual for iteration 7 is: 0.00136986
15: Shortcut relative residual for iteration 8 is: 0.00064299
15: Shortcut relative residual for iteration 9 is: 0.000303306
15: Shortcut relative residual for iteration 10 is: 0.000144037
15: Shortcut relative residual for iteration 11 is: 6.72722e-05
15: Shortcut relative residual for iteration 12 is: 3.16747e-05
15: Shortcut relative residual for iteration 13 is: 1.49353e-05
15: Shortcut relative residual for iteration 14 is: 7.13036e-06
15: True relative residual for iteration 14 is : 7.13036e-06
15: Shortcut relative residual for iteration 15 is: 3.53706e-06
15: Shortcut relative residual for iteration 16 is: 1.72347e-06
15: Shortcut relative residual for iteration 17 is: 8.50106e-07
15: Shortcut relative residual for iteration 18 is: 4.15334e-07
15: Shortcut relative residual for iteration 19 is: 2.05746e-07
15: Shortcut relative residual for iteration 20 is: 9.82943e-08
15: Shortcut relative residual for iteration 21 is: 4.63135e-08
15: Shortcut relative residual for iteration 22 is: 2.13531e-08
15: Shortcut relative residual for iteration 23 is: 9.69776e-09
15: Shortcut relative residual for iteration 24 is: 4.43873e-09
15: Shortcut relative residual for iteration 25 is: 2.01253e-09
15: Shortcut relative residual for iteration 26 is: 9.29927e-10
15: Shortcut relative residual for iteration 27 is: 4.2636e-10
15: Shortcut relative residual for iteration 28 is: 2.02506e-10
15: Shortcut relative residual for iteration 29 is: 9.64983e-11
15: True relative residual for iteration 29 is : 9.64983e-11
15: Ending relative residual is: 9.64983e-11
15: Solver converged! 
15: The solver completed 29 iterations.
15: =======================================
15: Verify from main: Ending residual is 9.64983e-11
15: Number of iterations is: 29
15: Diff of residual from main - residual from solver: 0
15: Convergence flag is : Converged
15: Test MGS Passed!
15: =======================================
15: 
15: 
15/15 Test #15: gmres_test_real_A ................   Passed    0.03 sec

93% tests passed, 1 tests failed out of 15

Total Test time (real) = 110.68 sec

The following tests FAILED:
1 - blas_serial (Failed)
Errors while running CTest
Starting job gcc-7.2.0-OpenMP_Serial-release
kokkos devices: OpenMP,Serial
kokkos arch: Power8,Pascal60
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC720 # 406 (click to expand)

20: Shortcut relative residual for iteration 14 is: 7.13036e-06
20: True relative residual for iteration 14 is : 7.13036e-06
20: Shortcut relative residual for iteration 15 is: 3.53706e-06
20: Shortcut relative residual for iteration 16 is: 1.72347e-06
20: Shortcut relative residual for iteration 17 is: 8.50106e-07
20: Shortcut relative residual for iteration 18 is: 4.15334e-07
20: Shortcut relative residual for iteration 19 is: 2.05746e-07
20: Shortcut relative residual for iteration 20 is: 9.82943e-08
20: Shortcut relative residual for iteration 21 is: 4.63135e-08
20: Shortcut relative residual for iteration 22 is: 2.13531e-08
20: Shortcut relative residual for iteration 23 is: 9.69776e-09
20: Shortcut relative residual for iteration 24 is: 4.43873e-09
20: Shortcut relative residual for iteration 25 is: 2.01253e-09
20: Shortcut relative residual for iteration 26 is: 9.29927e-10
20: Shortcut relative residual for iteration 27 is: 4.2636e-10
20: Shortcut relative residual for iteration 28 is: 2.02506e-10
20: Shortcut relative residual for iteration 29 is: 9.64983e-11
20: True relative residual for iteration 29 is : 9.64983e-11
20: Ending relative residual is: 9.64983e-11
20: Solver converged! 
20: The solver completed 29 iterations.
20: =======================================
20: Verify from main: Ending residual is 9.64983e-11
20: Number of iterations is: 29
20: Diff of residual from main - residual from solver: 0
20: Convergence flag is : Converged
20: Test CGS2 Passed!
20: =======================================
20: 
20: 
20: Testing GMRES with MGS ortho:
20: Convergence tolerance is: 1e-10
20: Shortcut relative residual for iteration 0 is: 0.396496
20: Shortcut relative residual for iteration 1 is: 0.168525
20: Shortcut relative residual for iteration 2 is: 0.0731299
20: Shortcut relative residual for iteration 3 is: 0.0325159
20: Shortcut relative residual for iteration 4 is: 0.0143389
20: Shortcut relative residual for iteration 5 is: 0.00648654
20: Shortcut relative residual for iteration 6 is: 0.00295024
20: Shortcut relative residual for iteration 7 is: 0.00136986
20: Shortcut relative residual for iteration 8 is: 0.00064299
20: Shortcut relative residual for iteration 9 is: 0.000303306
20: Shortcut relative residual for iteration 10 is: 0.000144037
20: Shortcut relative residual for iteration 11 is: 6.72722e-05
20: Shortcut relative residual for iteration 12 is: 3.16747e-05
20: Shortcut relative residual for iteration 13 is: 1.49353e-05
20: Shortcut relative residual for iteration 14 is: 7.13036e-06
20: True relative residual for iteration 14 is : 7.13036e-06
20: Shortcut relative residual for iteration 15 is: 3.53706e-06
20: Shortcut relative residual for iteration 16 is: 1.72347e-06
20: Shortcut relative residual for iteration 17 is: 8.50106e-07
20: Shortcut relative residual for iteration 18 is: 4.15334e-07
20: Shortcut relative residual for iteration 19 is: 2.05746e-07
20: Shortcut relative residual for iteration 20 is: 9.82943e-08
20: Shortcut relative residual for iteration 21 is: 4.63135e-08
20: Shortcut relative residual for iteration 22 is: 2.13531e-08
20: Shortcut relative residual for iteration 23 is: 9.69776e-09
20: Shortcut relative residual for iteration 24 is: 4.43873e-09
20: Shortcut relative residual for iteration 25 is: 2.01253e-09
20: Shortcut relative residual for iteration 26 is: 9.29927e-10
20: Shortcut relative residual for iteration 27 is: 4.2636e-10
20: Shortcut relative residual for iteration 28 is: 2.02506e-10
20: Shortcut relative residual for iteration 29 is: 9.64983e-11
20: True relative residual for iteration 29 is : 9.64983e-11
20: Ending relative residual is: 9.64983e-11
20: Solver converged! 
20: The solver completed 29 iterations.
20: =======================================
20: Verify from main: Ending residual is 9.64983e-11
20: Number of iterations is: 29
20: Diff of residual from main - residual from solver: 0
20: Convergence flag is : Converged
20: Test MGS Passed!
20: =======================================
20: 
20: 
20/20 Test #20: gmres_test_real_A ................   Passed    0.03 sec

90% tests passed, 2 tests failed out of 20

Total Test time (real) = 158.04 sec

The following tests FAILED:
1 - blas_serial (Failed)
6 - blas_threads (Failed)
Errors while running CTest
Starting job gcc-7.2.0-OpenMP-release
kokkos devices: OpenMP
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC720_Light_LayoutRight # 53 (click to expand)

20: Shortcut relative residual for iteration 14 is: 7.13036e-06
20: True relative residual for iteration 14 is : 7.13036e-06
20: Shortcut relative residual for iteration 15 is: 3.53706e-06
20: Shortcut relative residual for iteration 16 is: 1.72347e-06
20: Shortcut relative residual for iteration 17 is: 8.50106e-07
20: Shortcut relative residual for iteration 18 is: 4.15334e-07
20: Shortcut relative residual for iteration 19 is: 2.05746e-07
20: Shortcut relative residual for iteration 20 is: 9.82943e-08
20: Shortcut relative residual for iteration 21 is: 4.63135e-08
20: Shortcut relative residual for iteration 22 is: 2.13531e-08
20: Shortcut relative residual for iteration 23 is: 9.69776e-09
20: Shortcut relative residual for iteration 24 is: 4.43873e-09
20: Shortcut relative residual for iteration 25 is: 2.01253e-09
20: Shortcut relative residual for iteration 26 is: 9.29927e-10
20: Shortcut relative residual for iteration 27 is: 4.2636e-10
20: Shortcut relative residual for iteration 28 is: 2.02506e-10
20: Shortcut relative residual for iteration 29 is: 9.64983e-11
20: True relative residual for iteration 29 is : 9.64983e-11
20: Ending relative residual is: 9.64983e-11
20: Solver converged! 
20: The solver completed 29 iterations.
20: =======================================
20: Verify from main: Ending residual is 9.64983e-11
20: Number of iterations is: 29
20: Diff of residual from main - residual from solver: 0
20: Convergence flag is : Converged
20: Test CGS2 Passed!
20: =======================================
20: 
20: 
20: Testing GMRES with MGS ortho:
20: Convergence tolerance is: 1e-10
20: Shortcut relative residual for iteration 0 is: 0.396496
20: Shortcut relative residual for iteration 1 is: 0.168525
20: Shortcut relative residual for iteration 2 is: 0.0731299
20: Shortcut relative residual for iteration 3 is: 0.0325159
20: Shortcut relative residual for iteration 4 is: 0.0143389
20: Shortcut relative residual for iteration 5 is: 0.00648654
20: Shortcut relative residual for iteration 6 is: 0.00295024
20: Shortcut relative residual for iteration 7 is: 0.00136986
20: Shortcut relative residual for iteration 8 is: 0.00064299
20: Shortcut relative residual for iteration 9 is: 0.000303306
20: Shortcut relative residual for iteration 10 is: 0.000144037
20: Shortcut relative residual for iteration 11 is: 6.72722e-05
20: Shortcut relative residual for iteration 12 is: 3.16747e-05
20: Shortcut relative residual for iteration 13 is: 1.49353e-05
20: Shortcut relative residual for iteration 14 is: 7.13036e-06
20: True relative residual for iteration 14 is : 7.13036e-06
20: Shortcut relative residual for iteration 15 is: 3.53706e-06
20: Shortcut relative residual for iteration 16 is: 1.72347e-06
20: Shortcut relative residual for iteration 17 is: 8.50106e-07
20: Shortcut relative residual for iteration 18 is: 4.15334e-07
20: Shortcut relative residual for iteration 19 is: 2.05746e-07
20: Shortcut relative residual for iteration 20 is: 9.82943e-08
20: Shortcut relative residual for iteration 21 is: 4.63135e-08
20: Shortcut relative residual for iteration 22 is: 2.13531e-08
20: Shortcut relative residual for iteration 23 is: 9.69776e-09
20: Shortcut relative residual for iteration 24 is: 4.43873e-09
20: Shortcut relative residual for iteration 25 is: 2.01253e-09
20: Shortcut relative residual for iteration 26 is: 9.29927e-10
20: Shortcut relative residual for iteration 27 is: 4.2636e-10
20: Shortcut relative residual for iteration 28 is: 2.02506e-10
20: Shortcut relative residual for iteration 29 is: 9.64983e-11
20: True relative residual for iteration 29 is : 9.64983e-11
20: Ending relative residual is: 9.64983e-11
20: Solver converged! 
20: The solver completed 29 iterations.
20: =======================================
20: Verify from main: Ending residual is 9.64983e-11
20: Number of iterations is: 29
20: Diff of residual from main - residual from solver: 0
20: Convergence flag is : Converged
20: Test MGS Passed!
20: =======================================
20: 
20: 
20/20 Test #20: gmres_test_real_A ................   Passed    0.03 sec

90% tests passed, 2 tests failed out of 20

Total Test time (real) = 144.74 sec

The following tests FAILED:
1 - blas_serial (Failed)
6 - blas_threads (Failed)
Errors while running CTest
Starting job gcc-7.2.0-OpenMP-release
kokkos devices: OpenMP
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args: --no-default-eti
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutRight
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_GCC720 # 397 (click to expand)

Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on blake (Testbed skylake) in workspace /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC720
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC720/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/brian-kelley/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/brian-kelley/kokkos-kernels
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/brian-kelley/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse refs/remotes/origin/Fix1081^{commit} # timeout=10
Checking out Revision 58f6be166c4247295d98be98242d1e3c34203b10 (refs/remotes/origin/Fix1081)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 58f6be166c4247295d98be98242d1e3c34203b10 # timeout=10
Commit message: "Use float as accumulator for GEMV on half_t"
First time build. Skipping changelog.
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC720/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 23cfe6f7edcdf66fe67a665eb6238ea23488100a (origin/develop)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 23cfe6f7edcdf66fe67a665eb6238ea23488100a # timeout=10
Commit message: "Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk c93ccb46856676872c7c7a7b9b2a0249ae4fb83a # timeout=10
[KokkosKernels_PullRequest_Tpls_GCC720] $ /bin/bash -el /tmp/jenkins321205496921930328.sh
salloc: Pending job allocation 1013118
salloc: job 1013118 queued and waiting for resources
salloc: job 1013118 has been allocated resources
salloc: Granted job allocation 1013118
salloc: Waiting for resource configuration
salloc: Nodes blake21 are ready for job
Running on machine: blake
KokkosKernels Repository Status:  58f6be166c4247295d98be98242d1e3c34203b10 Use float as accumulator for GEMV on half_t

Kokkos Repository Status: 23cfe6f7edcdf66fe67a665eb6238ea23488100a Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id

Going to test compilers: gcc/7.2.0
Testing compiler gcc/7.2.0
Starting job gcc-7.2.0-OpenMP_Serial-release
kokkos devices: OpenMP,Serial
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED gcc-7.2.0-OpenMP_Serial-release
#######################################################
PASSED TESTS
#######################################################
gcc-7.2.0-OpenMP_Serial-release build_time=350 run_time=135
salloc: Relinquishing job allocation 1013118
salloc: Job allocation 1013118 has been revoked.
/home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_GCC720
Finished: SUCCESS

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_CUDA10 # 378 (click to expand)

Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on White (Testbed Power8) in workspace /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA10
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA10/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/brian-kelley/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/brian-kelley/kokkos-kernels
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/brian-kelley/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse refs/remotes/origin/Fix1081^{commit} # timeout=10
Checking out Revision 58f6be166c4247295d98be98242d1e3c34203b10 (refs/remotes/origin/Fix1081)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 58f6be166c4247295d98be98242d1e3c34203b10 # timeout=10
Commit message: "Use float as accumulator for GEMV on half_t"
First time build. Skipping changelog.
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA10/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 23cfe6f7edcdf66fe67a665eb6238ea23488100a (origin/develop)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 23cfe6f7edcdf66fe67a665eb6238ea23488100a # timeout=10
Commit message: "Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id"
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-list --no-walk c93ccb46856676872c7c7a7b9b2a0249ae4fb83a # timeout=10
[KokkosKernels_PullRequest_Tpls_CUDA10] $ /bin/bash -el /tmp/jenkins1969159997006673855.sh
***Forced exclusive execution
Job <79807> is submitted to queue .
<>
<>
Running on machine: white
KokkosKernels Repository Status:  58f6be166c4247295d98be98242d1e3c34203b10 Use float as accumulator for GEMV on half_t

Kokkos Repository Status: 23cfe6f7edcdf66fe67a665eb6238ea23488100a Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id

Going to test compilers: cuda/10.1.105
Testing compiler cuda/10.1.105
Unrecognized compiler cuda/10.1.105 when looking for Spack variants
Unrecognized compiler cuda/10.1.105 when looking for Spack variants
Unrecognized compiler cuda/10.1.105 when looking for Spack variants
Starting job cuda-10.1.105-Cuda_Serial-release
kokkos devices: Cuda,Serial
kokkos arch: Power8,Pascal60
kokkos options:
kokkos cuda options: ,enable_lambda
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_INTEL18 # 383 (click to expand)

Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on blake (Testbed skylake) in workspace /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_INTEL18
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_INTEL18/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/brian-kelley/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/brian-kelley/kokkos-kernels
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/brian-kelley/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse refs/remotes/origin/Fix1081^{commit} # timeout=10
Checking out Revision 58f6be166c4247295d98be98242d1e3c34203b10 (refs/remotes/origin/Fix1081)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 58f6be166c4247295d98be98242d1e3c34203b10 # timeout=10
Commit message: "Use float as accumulator for GEMV on half_t"
First time build. Skipping changelog.
The recommended git tool is: NONE
No credentials specified
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse --resolve-git-dir /home/jenkins/blake-new/workspace/KokkosKernels_PullRequest_Tpls_INTEL18/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/x86-64/git/2.9.4/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/x86-64/git/2.9.4/bin/git --version # timeout=10
 > git --version # 'git version 2.9.4'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/x86-64/git/2.9.4/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git rev-parse 84ff7f1^{commit} # timeout=10
Checking out Revision 84ff7f13ce93b2e5b65fb842beb5b6c33860502c (detached)
 > /home/projects/x86-64/git/2.9.4/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/x86-64/git/2.9.4/bin/git checkout -f 84ff7f13ce93b2e5b65fb842beb5b6c33860502c # timeout=10
Commit message: "Merge pull request #3926 from masterleinad/fix_undefined_reference_testteam_vector"
 > /home/projects/x86-64/git/2.9.4/bin/git rev-list --no-walk 84ff7f13ce93b2e5b65fb842beb5b6c33860502c # timeout=10
[KokkosKernels_PullRequest_Tpls_INTEL18] $ /bin/bash -el /tmp/jenkins2449053629414519219.sh
salloc: Pending job allocation 1013120
salloc: job 1013120 queued and waiting for resources
salloc: job 1013120 has been allocated resources
salloc: Granted job allocation 1013120
Running on machine: blake
KokkosKernels Repository Status:  58f6be166c4247295d98be98242d1e3c34203b10 Use float as accumulator for GEMV on half_t

Kokkos Repository Status: 84ff7f13ce93b2e5b65fb842beb5b6c33860502c Merge pull request #3926 from masterleinad/fix_undefined_reference_testteam_vector

Going to test compilers: intel/18.1.163
Testing compiler intel/18.1.163
Unrecognized compiler intel/18.1.163 when looking for Spack variants
Unrecognized compiler intel/18.1.163 when looking for Spack variants
Unrecognized compiler intel/18.1.163 when looking for Spack variants
Starting job intel-18.1.163-OpenMP-release
kokkos devices: OpenMP
kokkos arch: SKX
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight # 189 (click to expand)

Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on White (Testbed Power8) in workspace /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/brian-kelley/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/brian-kelley/kokkos-kernels
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/brian-kelley/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse refs/remotes/origin/Fix1081^{commit} # timeout=10
Checking out Revision 58f6be166c4247295d98be98242d1e3c34203b10 (refs/remotes/origin/Fix1081)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 58f6be166c4247295d98be98242d1e3c34203b10 # timeout=10
Commit message: "Use float as accumulator for GEMV on half_t"
First time build. Skipping changelog.
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 23cfe6f7edcdf66fe67a665eb6238ea23488100a (origin/develop)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 23cfe6f7edcdf66fe67a665eb6238ea23488100a # timeout=10
Commit message: "Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id"
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-list --no-walk c93ccb46856676872c7c7a7b9b2a0249ae4fb83a # timeout=10
[KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight] $ /bin/bash -el /tmp/jenkins114156087274904549.sh
***Forced exclusive execution
Job <79808> is submitted to queue .
<>
<>
Running on machine: white
KokkosKernels Repository Status:  58f6be166c4247295d98be98242d1e3c34203b10 Use float as accumulator for GEMV on half_t

Kokkos Repository Status: 23cfe6f7edcdf66fe67a665eb6238ea23488100a Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id

Going to test compilers: cuda/10.1.105
Testing compiler cuda/10.1.105
Unrecognized compiler cuda/10.1.105 when looking for Spack variants
Unrecognized compiler cuda/10.1.105 when looking for Spack variants
Unrecognized compiler cuda/10.1.105 when looking for Spack variants
Starting job cuda-10.1.105-Cuda_Serial-release
kokkos devices: Cuda,Serial
kokkos arch: Power8,Pascal60
kokkos options:
kokkos cuda options: ,enable_lambda
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized
extra_args: --no-default-eti
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutRight
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_CUDA9 # 373 (click to expand)

Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on White (Testbed Power8) in workspace /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA9
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA9/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/brian-kelley/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/brian-kelley/kokkos-kernels
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/brian-kelley/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse refs/remotes/origin/Fix1081^{commit} # timeout=10
Checking out Revision 58f6be166c4247295d98be98242d1e3c34203b10 (refs/remotes/origin/Fix1081)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 58f6be166c4247295d98be98242d1e3c34203b10 # timeout=10
Commit message: "Use float as accumulator for GEMV on half_t"
First time build. Skipping changelog.
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_CUDA9/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 23cfe6f7edcdf66fe67a665eb6238ea23488100a (origin/develop)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 23cfe6f7edcdf66fe67a665eb6238ea23488100a # timeout=10
Commit message: "Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id"
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-list --no-walk c93ccb46856676872c7c7a7b9b2a0249ae4fb83a # timeout=10
[KokkosKernels_PullRequest_Tpls_CUDA9] $ /bin/bash -el /tmp/jenkins5389878031957957590.sh
***Forced exclusive execution
Job <79809> is submitted to queue .
<>
<>
Running on machine: white
KokkosKernels Repository Status:  58f6be166c4247295d98be98242d1e3c34203b10 Use float as accumulator for GEMV on half_t

Kokkos Repository Status: 23cfe6f7edcdf66fe67a665eb6238ea23488100a Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id

Going to test compilers: cuda/9.2.88
Testing compiler cuda/9.2.88
Unrecognized compiler cuda/9.2.88 when looking for Spack variants
Unrecognized compiler cuda/9.2.88 when looking for Spack variants
Unrecognized compiler cuda/9.2.88 when looking for Spack variants
Starting job cuda-9.2.88-Cuda_OpenMP-release
kokkos devices: Cuda,OpenMP
kokkos arch: Power8,Pascal60
kokkos options:
kokkos cuda options: force_uvm,enable_lambda
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_GCC720_GCC740 # 371 (click to expand)

Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on White (Testbed Power8) in workspace /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_GCC720_GCC740
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_GCC720_GCC740/kokkos-kernels/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/brian-kelley/kokkos-kernels # timeout=10
Fetching upstream changes from https://github.com/brian-kelley/kokkos-kernels
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/brian-kelley/kokkos-kernels +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse refs/remotes/origin/Fix1081^{commit} # timeout=10
Checking out Revision 58f6be166c4247295d98be98242d1e3c34203b10 (refs/remotes/origin/Fix1081)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 58f6be166c4247295d98be98242d1e3c34203b10 # timeout=10
Commit message: "Use float as accumulator for GEMV on half_t"
First time build. Skipping changelog.
The recommended git tool is: NONE
No credentials specified
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse --resolve-git-dir /home/jenkins/white-new/workspace/KokkosKernels_PullRequest_Tpls_GCC720_GCC740/kokkos/.git # timeout=10
Fetching changes from the remote Git repository
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config remote.origin.url https://github.com/kokkos/kokkos.git # timeout=10
Fetching upstream changes from https://github.com/kokkos/kokkos.git
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git --version # timeout=10
 > git --version # 'git version 2.10.1'
Setting http proxy: proxy.sandia.gov:80
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git fetch --tags --progress -- https://github.com/kokkos/kokkos.git +refs/heads/*:refs/remotes/origin/* # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-parse origin/develop^{commit} # timeout=10
Checking out Revision 23cfe6f7edcdf66fe67a665eb6238ea23488100a (origin/develop)
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git config core.sparsecheckout # timeout=10
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git checkout -f 23cfe6f7edcdf66fe67a665eb6238ea23488100a # timeout=10
Commit message: "Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id"
 > /home/projects/pwr8-rhel73-lsf/git/2.10.1/bin/git rev-list --no-walk c93ccb46856676872c7c7a7b9b2a0249ae4fb83a # timeout=10
[KokkosKernels_PullRequest_Tpls_GCC720_GCC740] $ /bin/bash -el /tmp/jenkins738491029188376355.sh
***Forced exclusive execution
Job <79810> is submitted to queue .
<>
<>
Running on machine: white
KokkosKernels Repository Status:  58f6be166c4247295d98be98242d1e3c34203b10 Use float as accumulator for GEMV on half_t

Kokkos Repository Status: 23cfe6f7edcdf66fe67a665eb6238ea23488100a Merge pull request #4235 from masterleinad/require_kokkos_cxx_compiler_id

Going to test compilers: gcc/7.2.0 gcc/7.4.0
Testing compiler gcc/7.2.0
Starting job gcc-7.2.0-Serial-release
kokkos devices: Serial
kokkos arch: Power8,Pascal60
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
PASSED gcc-7.2.0-Serial-release
Testing compiler gcc/7.4.0
Starting job gcc-7.2.0-OpenMP-release
kokkos devices: OpenMP
kokkos arch: Power8,Pascal60
kokkos options:
kokkos cuda options:
kokkos cxxflags: -O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wignored-qualifiers -Wempty-body -Wclobbered -Wuninitialized
extra_args:
kokkoskernels scalars: 'double,complex_double'
kokkoskernels ordinals: int
kokkoskernels offsets: int,size_t
kokkoskernels layouts: LayoutLeft
Build was aborted
Aborted by Evan Harvey
Finished: ABORTED

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection Is Not Necessary for this Pull Request.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light

  • Build Num: 414
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720

  • Build Num: 407
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light_LayoutRight

  • Build Num: 54
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720

  • Build Num: 398
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10

  • Build Num: 379
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL18

  • Build Num: 384
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight

  • Build Num: 190
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA9

  • Build Num: 374
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720_GCC740

  • Build Num: 372
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Using Repos:

Repo: KOKKOSKERNELS (brian-kelley/kokkos-kernels)
  • Branch: Fix1081
  • SHA: 718fe40
  • Mode: TEST_REPO

Pull Request Author: brian-kelley

Copy link
Contributor

@srajama1 srajama1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If there a simple pattern for why we do sometimes float(alpha * something) and some other time alpha * float(something)

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Jenkins Testing: all Jobs PASSED

Pull Request Auto Testing has PASSED (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light

  • Build Num: 414
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720

  • Build Num: 407
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC720_Light_LayoutRight

  • Build Num: 54
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720

  • Build Num: 398
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10

  • Build Num: 379
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL18

  • Build Num: 384
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA10_LayoutRight

  • Build Num: 190
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_CUDA9

  • Build Num: 374
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GCC720_GCC740

  • Build Num: 372
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_BRANCH Fix1081
KOKKOSKERNELS_SOURCE_REPO https://github.com/brian-kelley/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 718fe40
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA c151fbb
PR_LABELS enhancement
PULLREQUESTNUM 1082
TEST_REPO_ALIAS KOKKOSKERNELS

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Merge Inspection' - - This Pull Request Requires Inspection... The code must be inspected by a member of the Team before Testing/Merging
THE LAST COMMIT TO THIS PULL REQUEST HAS BEEN REVIEWED, BUT NOT ACCEPTED OR REQUIRES CHANGES

@kokkos-devops-admin
Copy link

All Jobs Finished; status = PASSED, However Inspection must be performed before merge can occur...

@brian-kelley
Copy link
Contributor Author

@srajama1 The details are a little different depending on how the result is computed (array reducer, scalar reducer or atomics) but alpha is always multiplied by the high precision result (never half_t). Only after the full result is calculated in float is it converted back to half_t to write back to y.

Copy link
Contributor

@srajama1 srajama1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @brian-kelley ! Approving it.

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Merge Inspection' - SUCCESS: The last commit to this Pull Request has been INSPECTED AND APPROVED by [ srajama1 ]!

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Pull Request MUST BE MERGED MANUALLY BY Project Team - This Repo does not support Automerge

@brian-kelley brian-kelley merged commit 3b7e9eb into kokkos:develop Aug 18, 2021
@brian-kelley brian-kelley deleted the Fix1081 branch August 18, 2021 16:04
brian-kelley added a commit to brian-kelley/kokkos-kernels that referenced this pull request Mar 11, 2022
(same change that was done in kokkos#1082 for scalar = half_t)
This improves answer accuracy and also performance on GPU, since
there isn't an atomic_add for these types but there is for float.
brian-kelley added a commit that referenced this pull request Mar 15, 2022
(same change that was done in #1082 for scalar = half_t)
This improves answer accuracy and also performance on GPU, since
there isn't an atomic_add for these types but there is for float.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants