Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update ROCm to version 5.6.1 #9050

Conversation

fwyzard
Copy link
Contributor

@fwyzard fwyzard commented Mar 6, 2024

Update ROCm to version 5.6.1:

Add the ROCm packages to support OpenMP and Thrust:

  • the OpenMP packages are required to compile any source that uses #pragma omp ... with hipcc;
  • the Thrust packages are needed for compatibility with CUDA, and are used by the HIP backend of PyTorch.

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 6, 2024

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

A new Pull Request was created by @fwyzard for branch IB/CMSSW_14_1_X/master.

@iarspider, @smuzaffar, @aandvalenzuela can you please review it and eventually sign? Thanks.
@rappoccio, @antoniovilela, @sextonkennedy you are the release manager for this.
cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

cms-bot internal usage

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37928/summary.html
COMMIT: 64510f3
CMSSW: CMSSW_14_1_X_2024-03-05-2300/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37928/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

Build successful rocm.
(, , ) done
Trying to install the rpm package external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e just built.
Checking local path dependency for rpm package external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e just build.
RPM installation stderr rocm:
error: Failed dependencies:
	libffi.so.6()(64bit) is needed by external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e-1-1.x86_64
	libpython3.6m.so.1.0()(64bit) is needed by external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e-1-1.x86_64

Failed to install RPM for rocm
Requested to quit.


@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 6, 2024

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37928/summary.html COMMIT: 64510f3 CMSSW: CMSSW_14_1_X_2024-03-05-2300/el8_amd64_gcc12 User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37928/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

Build successful rocm.
(, , ) done
Trying to install the rpm package external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e just built.
Checking local path dependency for rpm package external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e just build.
RPM installation stderr rocm:
error: Failed dependencies:
libffi.so.6()(64bit) is needed by external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e-1-1.x86_64
libpython3.6m.so.1.0()(64bit) is needed by external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e-1-1.x86_64

Failed to install RPM for rocm
Requested to quit.

@iarspider @aandvalenzuela do you have any idea why we see these errors ?

ROCm 5.6.1 should not be significantly different from ROCm 5.6.0, and that one used to build correctly.

@fwyzard fwyzard force-pushed the IB/CMSSW_14_1_X/master_rocm-5.6.1 branch from 64510f3 to 9d9dab6 Compare March 6, 2024 13:12
@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

Pull request #9050 was updated.

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 6, 2024

please test

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 6, 2024

please test for slc7_amd64_gcc12

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 6, 2024

please test for el9_amd64_gcc12

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 6, 2024

please test for el8_aarch64_gcc12

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 6, 2024

please test for el8_ppc64le_gcc12

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37937/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/el8_aarch64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37937/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37939/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/el8_ppc64le_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37939/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37938/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/el9_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37938/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37932/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/el8_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37932/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 6, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37936/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/slc7_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37936/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 7, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37938/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/el9_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37938/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 7, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37936/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/slc7_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37936/install.sh to create a dev area with all the needed externals and cmssw changes.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 7, 2024

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-c3f070/37938/summary.html
COMMIT: 9d9dab6
CMSSW: CMSSW_14_1_X_2024-03-05-2300/el9_amd64_gcc12
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/9050/37938/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially added 343 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 53561 differences found in the comparisons
  • DQMHistoTests: Total files compared: 48
  • DQMHistoTests: Total histograms compared: 3293028
  • DQMHistoTests: Total failures: 113485
  • DQMHistoTests: Total nulls: 242
  • DQMHistoTests: Total successes: 3179281
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: -9.552000000000001 KiB( 47 files compared)
  • DQMHistoSizes: changed ( 10224.0 ): 0.596 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 11634.0,... ): -0.439 KiB Physics/NanoAODDQM
  • DQMHistoSizes: changed ( 13034.0 ): 0.586 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 250202.181 ): 0.405 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 25202.0 ): 0.244 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 7.3 ): 0.481 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 8.0 ): -4.839 KiB SiStrip/MechanicalView
  • Checked 202 log files, 165 edm output root files, 48 DQM output files

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 7, 2024

type bugfix

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 7, 2024

@iarspider @aandvalenzuela @smuzaffar can any of you approve this for 14.1.x ?

@iarspider
Copy link
Contributor

+externals

@iarspider iarspider merged commit f8fff94 into cms-sw:IB/CMSSW_14_1_X/master Mar 7, 2024
33 checks passed
@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 7, 2024

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_14_1_X/master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @rappoccio, @sextonkennedy, @antoniovilela (and backports should be raised in the release meeting by the corresponding L2)

@smuzaffar
Copy link
Contributor

libffi.so.6()(64bit) is needed by external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e-1-1.x86_64
libpython3.6m.so.1.0()(64bit) is needed by external+rocm+5.6.1-6320ba1f36f64f70428881e859817e3e-1-1.x86_64

ROCm 5.6.1 should not be significantly different from ROCm 5.6.0, and that one used to build correctly.

@fwyzard , this is because the newly added openmp-extras-devel-16.56.0.50601-93.el%{rhel}.%{_arch}.rpm is build against system python 3.6. Following are the two libs which are linked against python 3.6

llvm/lib-debug/ompd/ompdModule.so
llvm/lib-debug/ompd/ompdModule.cpython-36m-x86_64-linux-gnu.so

@fwyzard
Copy link
Contributor Author

fwyzard commented Mar 11, 2024

I see... I guess we should drop those specific files ?
The only alternative is that we start building ROCm from sources, so we can link our version of python.

@fwyzard fwyzard deleted the IB/CMSSW_14_1_X/master_rocm-5.6.1 branch April 7, 2024 10:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants