-
Notifications
You must be signed in to change notification settings - Fork 575
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Address failures from merge of updated TriBITS (#10614, TriBITSPub/TriBITS#299) #10774
Comments
Just to document, I saw issues when building CISM-Albany (a land-ice modeling code) on top of Trilinos and Albany. The nature of the problem was that Trilinos_LIBRARY_DIRS, previously populated with the right path, has become blank. I fixed it by setting this in the CMakeLists.txt file of CISM-Albany (see attachment). I think this fix is OK for our needs, but please let me know if there is a better one. |
@ikalash, what is missing is linking to actual targets. I bet if you use CMake 3.23+ and set:
then the configure will fail and it will show you were it is trying to link against raw string names See: |
There is a bug in the TriBITS-generated NOTE: My local testing of SPARC + Trilinos would not have caught this because the Trilinos build dir does not get removed before configuring SPARC. So this was only going to be caught after the merge to 'develop'. Details for find_package(Trilinos) errors due to pointing into the build dirNote that we are also see SPARC Trilinos integration failures starting SPARC testing day 2022-07-19. These are configure-time errors inside of
That code that is failing in
Hum, that is the wrong directory location:
Oops, that is a bug! How did that slip through TriBITS testing? |
The top builds to prioritize fixing are the ones used in SPARC Trilinos Integration testing listed here and are:
|
FYI: I created the branch 10774-pre-tribits-update-ref off of the commit bdcac66 which corresponds to the version of Trilinos 'develop' for the 'atdm-nighlty' branch for testing day 2022-07-18 just before the merge of PR #10614. Details are below. Creating the branch '10774-pre-tribits-update-ref' for ATDM Trilinos testing day 2022-07-18 details: (click to expand)The version of that ATDM Trilinos builds for the testing day 2022-07-18 just before PR #10614 was synced was f15068c. That comes from:
So the Trilinos version of 'develop' for the ATDM Trilinos testing day So I create the Trilinos branch '10774-pre-tribits-update-ref' off of 'develop' at the commit bdcac66:
|
…all-tree (trilinos#10774) Addresses TriBITSPub/TriBITS#500 which is part of trilinos#10774.
FYI: The fix for the bug with installed |
…nual-updates (#10774) Addresses TriBITSPub/TriBITS#500
I manually merged the PR branch for #10784 into the 'atdm-nightly-manual-updates' branch so this will get run in the ATDM Trilinos builds tomorrow and we should see (some of) the SPARC Trilinos Integration builds clear up the day after that. Update of 'atdm-nightly-manual-updates' branch: (click to expand)
|
Looking over all of the errors in the builds listed above for ATDM Trilinos testing day 2022-07-19 falls into a few different categories: 1) Link errors for libcgns missing hdf5 symbolsThese errors look like:
This includes the builds:
I think these can all be fixed by adding ToDo: Characterize the other failures! |
This should fix the build errors for the SEACAS exectuable SEACASIoss_Utst_structured_decomp.exe on a bunch of ATDM Trilinos builds (see
…rilinos#10774) A defect added as part of work on TriBITSPub/TriBITS#299.
…rilinos/Trilinos#10774) THe logic for constructing the full path for the FindTPL<tplName>Dependencies.cmake file is not correct when <tplName>_FINDMOD is an absolute path instead of a relative path. This causes all of the 'TribitsExampleProject2_find_tpl_parts' tests to fail.
Origin repo remote tracking branch: 'github/master' Origin repo remote repo URL: 'github = git@github.com:TriBITSPub/TriBITS.git' Git describe: Vera4.0-RC1-start-1219-g8b3872ed At commit: commit 4b26997a2b19c29cbc6deaba5ad303b2336b63e6 Author: Roscoe A. Bartlett <rabartl@sandia.gov> Date: Thu Jul 21 10:35:22 2022 -0600 Summary: Add dependency of CGNS on HDF5 (#10774)
Continuing from above, the next failure seen in many different builds is: 2) CUDA build failures due to duplicate CUDA library function definitions when creating libIntercept.so.13.5These errors look like:
We see these in the CUDA builds:
ToDo: Characterize the rest of the failures! |
…stall-tree Fix <tplName>Config.cmake files to not point into build dir (#10774) We are force merging this PR because all of the PR tests passed except the new CUDA 11 build, which had 2 failing tests. There were 2 Tpetra tests that failed that have been randomly failing for other PRs and the failures are not related to these changes.
Continuing from above, the next failure seen in many different builds is: 3) Link errors missing '__dlopen'These errors look like:
These errors start in Sacado for the 'ats1'
and these errors start in SEACAS for the 'ats1'
That is all of the systematic errors that impact all of the builds. |
4) Compile errors in Kokkos for coming from gtest.hThe last new set of build errors are just for the build: and they look like:
and they are all in Kokkos. I am very curious to see what is causing this. But from this query it seems these build errors started on testing day 2022-05-21 which was the first since testing day 2022-05-16 as shown below: So these errors are unrelated to the TriBITS update (which is good because I have no idea how the TriBITS updates could be causing this). |
FYI: The failure to tentatively enable the BinUtils TPL shown above for the build Trilinos-atdm-ats1-knl_intel-19.0.4_mpich-7.7.15_openmp_static_opt was occurring before the merge of PR #10614 on 2022-07-18. If you go back to the build Trilinos-atdm-ats1-knl_intel-19.0.4_mpich-7.7.15_openmp_static_opt on testing day 2022-07-17 you will see:
And looking at one of the recent builds today for Trilinos-atdm-ats1-knl_intel-19.0.4_mpich-7.7.15_openmp_static_opt you can see:
Therefore, the failure to tentatively enable the BinUtils TPL has nothing to do with the TriBITS upgrade merged in PR #10614. |
With the post of PR #10930, all of these issues should be addressed. Putting this issue In Review. |
With the merge of PR #10930, this is (finally) complete! |
…s:develop' (8906842). * trilinos-develop: (128 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate Testing on Geminga: Do not disable Kokkos in Epetra build ...
…s:develop' (8906842). * trilinos-develop: (130 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test MueLu: Allow to print Kokkos config when default node type is used Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate ...
…s:develop' (8906842). * trilinos-develop: (130 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test MueLu: Allow to print Kokkos config when default node type is used Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate ...
…s:develop' (8906842). * trilinos-develop: (130 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test MueLu: Allow to print Kokkos config when default node type is used Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate ...
…s:develop' (8906842). * trilinos-develop: (130 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test MueLu: Allow to print Kokkos config when default node type is used Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate ...
…s:develop' (8906842). * trilinos-develop: (130 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test MueLu: Allow to print Kokkos config when default node type is used Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate ...
…s:develop' (8906842). * trilinos-develop: (130 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test MueLu: Allow to print Kokkos config when default node type is used Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate ...
…s:develop' (8906842). * trilinos-develop: (130 commits) Intrepid2: update TensorData.setFirstComponentExtentInDimension0 to modify extents_[0] (trilinos#10929) Tpetra: Adding configure option to disable Kokkos integration test MueLu: Allow to print Kokkos config when default node type is used Automatic snapshot commit from tribits at 142e5362 Disable Pliris tests in ATS2 GenConfig builds (trilinos#10931) Force disable Pliris in ATS2 builds (trilinos#10931) Automatic snapshot commit from tribits at ab419429 Change cmake_minimum_required() from 3.17.1 to 3.0 (TriBITSPub/TriBITS#522) Pliris: Remove local var hiding cache var Pliris_ENABLE_DREAL (trilinos#10774, TriBITSPub/TriBITS#516) Remove printing of vars that are now empty (TriBITSPub/TriBITS#299) Panzer: move periodic helper typedefs into namespace Revert incorrect fix in previous commit Fix typos in some docs fix scratch typos STK: Snapshot 08-22-22 12:44 Phalanx: remove cuda compiler warnings and add test for new use case for vov changed a double to a scalar_type to compile for complex arith MueLu: Fix signed vs unsigned comparison in Aggregates_kokkos.cpp Amesos2 : trying to fix MKL header including issues MueLu: Add Aggregates_kokkos.ComputeNodesInAggregate ...
…os#10774, TriBITSPub/TriBITS#516) A TriBITS update is exporting package cache vars to the <Package>Config.cmake file and you can't have a local var with the same name as a cache var with different values. In this case, it was just lucky that no downstream package was reading this var (through the cache var) because they would have gotten the wrong value. It seems that only code in CMakeLists.txt files under packages/pliris/ were reading this var.
Description
With the merge of the updated TriBITS yesterday with the merge of PR #10614, we seem to be seeing a lot of ATDM Trilinos configurations with new build failures for ATDM Trilinos testing day 2022-07-19 after the merge of PR #10614 compared to the previous ATDM Trilinos testing day 2022-07-18 before the merge of PR #10614.
In particular, we see new build failures for 20 builds:
Trilinos-atdm-ats1-hsw_intel-19.0.4_mpich-7.7.15_openmp_static_dbg
Trilinos-atdm-ats1-hsw_intel-19.0.4_mpich-7.7.15_openmp_static_opt
Trilinos-atdm-ats1-knl_intel-19.0.4_mpich-7.7.15_openmp_static_dbg
Trilinos-atdm-ats1-knl_intel-19.0.4_mpich-7.7.15_openmp_static_opt
Trilinos-atdm-ats2-gnu-7.3.1-spmpi-rolling_serial_static_dbg
Trilinos-atdm-ats2-gnu-7.3.1-spmpi-rolling_serial_static_opt
Trilinos-atdm-cee-rhel7_cuda-10.1.243_gnu-7.2.0_openmpi-4.0.3_shared_dbg
Trilinos-atdm-cee-rhel7_cuda-10.1.243_gnu-7.2.0_openmpi-4.0.3_shared_opt
Trilinos-atdm-cee-rhel7_intel-19.0.3_intelmpi-2018.4_serial_static_opt
Trilinos-atdm-cee-rhel7_intel-19.0.3_mpich2-3.2_openmp_static_opt
Trilinos-atdm-cee-rhel7_mini-no-mpi_intel-19.0.3_static_opt
Trilinos-atdm-cee-rhel7_mini_intel-19.0.3_mpich2-3.2_static_opt
Trilinos-atdm-cts1-intel-19.0.4_openmpi-4.0.3_openmp_static_dbg
Trilinos-atdm-cts1-intel-19.0.4_openmpi-4.0.3_openmp_static_opt
Trilinos-atdm-cts1empire-intel-18.0.2_openmpi-4.0.1_openmp_static_dbg
Trilinos-atdm-cts1empire-intel-18.0.2_openmpi-4.0.1_openmp_static_opt
Trilinos-atdm-sems-rhel7-cuda-10.1-Volta70-complex-shared-release-debug
All of the build errors I have looked at so far in the above list are link errors due to missing symbols. My guess is that these are due to missing TPL dependencies.
I will have to do reference builds for each of these and then builds with updated TriBITS (merged from the branch in PR #10614 carefully) and compare the link lines carefully.
NOTE: The build errors for the builds:
Trilinos-atdm-van1-tx2_arm-20.1_openmpi-4.0.5_openmp_static_dbg
Trilinos-atdm-van1-tx2_arm-20.1_openmpi-4.0.5_openmp_static_opt
were due to a a compiler license check like:
NOTE: the build errors for the builds:
Trilinos-atdm-sems-rhel7-clang-7.0.1-openmp-shared-release-debug
Trilinos-atdm-sems-rhel7-clang-7.0.1-openmp-shared-release
were due to unrelated problems with the Krino package (see #10524).
NOTE: The build errors for the build:
Trilinos-atdm-tlcc2-intel-opt-openmp
are source compile errors that seem to have nothing to do with the TriBITS changes (see below).
Tasks
<tplName>Config.cmake
files pointing into the build tree: TriBITSPub/TriBITS#500 ... Addressed in PR Fix <tplName>Config.cmake files to not point into build dir (#10774) #10784__dlopen
for 'ats1' builds (fixed in Change type to IMPORTED STATIC for *.a file (trilinos/Trilinos#10774) TriBITSPub/TriBITS#504)find_package()
of related TriBITS packages from different subdirs (an example from a SEACAS user, see Fix find_package(<Package>) calls from TriBITS-generated <Package>Config.cmake files from different dirs (#299) TriBITSPub/TriBITS#505) ... Addressed in TriBITS PR Fix find_package() from separate subdirs for <Package>Config.cmake files (#505) TriBITSPub/TriBITS#506 and Trilinos PR Add Trilinos install tests, test demo app, fix cmake --install, fix PR errors (#10774, #10810, #10842) #10813<Package>_ENABLE_<feature>
variables into<Package>Config.cmake
files based ontribits_add_option_and_define()
calls (see Anasazi: code built against Trilinos with rbgen not finding rbgen #10794 (comment), below, and Export informational variables to<Package>Config.cmake
files for downstream CMake projects TriBITSPub/TriBITS#516) ... Addressed in PR Final fixes for TriBITS upgrade to modern CMake targets (#10614, #10774) #10930cmake_minumum_required()
calls to<Package>Config.cmake
files to avoid cmake warning causing errors? (See TRILINOSHD-128)... Addressed in PR Final fixes for TriBITS upgrade to modern CMake targets (#10614, #10774) #10930mpi_usempif08
(see Broken tests on Weaver sandialabs/Albany#822).Trilinos_ENABLE_EXAMPLES=OFF
causesTpetraCore_ENABLE_Tests=OFF
andTpetraTSQR_ENABLE_TESTS=OFF
even thoughTpetra_ENABLE_Tests=ON
( see Tpetra: unexercised test; missing ROCm test; change in sub package test behavior #11002) ... Fixed in Fix package test/example enable logic for global test/example case (#11002) #11099.NOTE: Above, a task is marked as complete if the PR that fixes it is ready to merge and is just being held up by a final approval or PR testing.
Deferred Scope
HDF5_ALLOW_PACKAGE_PREFIND:BOOL=TRUE
with currentFindTPLHDF5.cmake
module (see below) ... Going to wait to upgrade the min version of CMake to 3.22 to use modern targets from find_package(HDF5)The text was updated successfully, but these errors were encountered: