Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add functionality for rebuilding software: try it on OpenMPI 4.1.x to fix smcuda issue #488

Merged
merged 63 commits into from
Mar 26, 2024

Conversation

bedroge
Copy link
Collaborator

@bedroge bedroge commented Mar 1, 2024

Attempt to implement a solution for #312 with --fakeroot.

Copy link

eessi-bot bot commented Mar 1, 2024

Instance eessi-bot-mc-aws is configured to build:

  • arch x86_64/generic for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/generic for repo eessi-hpc.org-2023.06-software
  • arch x86_64/generic for repo eessi.io-2023.06-compat
  • arch x86_64/generic for repo eessi.io-2023.06-software
  • arch x86_64/intel/haswell for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi-hpc.org-2023.06-software
  • arch x86_64/intel/haswell for repo eessi.io-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi.io-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi.io-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi.io-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi.io-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi.io-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi.io-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi.io-2023.06-software
  • arch aarch64/generic for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/generic for repo eessi-hpc.org-2023.06-software
  • arch aarch64/generic for repo eessi.io-2023.06-compat
  • arch aarch64/generic for repo eessi.io-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi-hpc.org-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi.io-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi.io-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi-hpc.org-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi.io-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi.io-2023.06-software

EESSI-install-software.sh Outdated Show resolved Hide resolved
@bedroge
Copy link
Collaborator Author

bedroge commented Mar 1, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1

Copy link

eessi-bot bot commented Mar 1, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Mar 1, 2024

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/6916

date job status comment
Mar 01 12:51:54 UTC 2024 submitted job id 6916 awaits release by job manager
Mar 01 12:52:27 UTC 2024 released job awaits launch by Slurm scheduler
Mar 01 12:56:55 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-6916.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Mar 01 12:56:55 UTC 2024 test result (no tests yet)

@bedroge
Copy link
Collaborator Author

bedroge commented Mar 1, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1

Copy link

eessi-bot bot commented Mar 1, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Mar 1, 2024

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/6917

date job status comment
Mar 01 13:00:03 UTC 2024 submitted job id 6917 awaits release by job manager
Mar 01 13:00:08 UTC 2024 released job awaits launch by Slurm scheduler
Mar 01 13:01:19 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-6917.out
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Mar 01 13:01:19 UTC 2024 test result (no tests yet)

EESSI-install-software.sh Outdated Show resolved Hide resolved
@bedroge
Copy link
Collaborator Author

bedroge commented Mar 1, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1

Copy link

eessi-bot bot commented Mar 1, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Mar 1, 2024

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/6918

date job status comment
Mar 01 13:04:07 UTC 2024 submitted job id 6918 awaits release by job manager
Mar 01 13:04:31 UTC 2024 released job awaits launch by Slurm scheduler
Mar 01 13:05:42 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-6918.out
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_v1-1709298309.tar.gzsize: 0 MiB (180914 bytes)
entries: 3
modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/neoverse_v1/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/neoverse_v1
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
Mar 01 13:05:42 UTC 2024 test result (no tests yet)

eessi_container.sh Outdated Show resolved Hide resolved
@bedroge
Copy link
Collaborator Author

bedroge commented Mar 1, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1

Copy link

eessi-bot bot commented Mar 1, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Mar 1, 2024

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/6919

date job status comment
Mar 01 13:09:15 UTC 2024 submitted job id 6919 awaits release by job manager
Mar 01 13:10:01 UTC 2024 released job awaits launch by Slurm scheduler
Mar 01 13:11:11 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-6919.out
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Mar 01 13:11:11 UTC 2024 test result (no tests yet)

@bedroge
Copy link
Collaborator Author

bedroge commented Mar 1, 2024

Ah, this doesn't work, we're changing back to eessi in the loop... I guess we need to run the loop twice...

@bedroge
Copy link
Collaborator Author

bedroge commented Mar 1, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1

@casparvl
Copy link
Collaborator

bot: build repo:eessi.io-2023.06-software arch:aarch64/generic
bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_n1
bot: build repo:eessi.io-2023.06-software arch:aarch64/neoverse_v1
bot: build repo:eessi.io-2023.06-software arch:x86_64/generic
bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2
bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3
bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/haswell
bot: build repo:eessi.io-2023.06-software arch:x86_64/intel/skylake_avx512

Copy link

eessi-bot bot commented Mar 26, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8577

date job status comment
Mar 26 13:40:02 UTC 2024 submitted job id 8577 awaits release by job manager
Mar 26 13:40:12 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:44:48 UTC 2024 running job 8577 is running
Mar 26 14:44:35 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8577.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1711464128.tar.gzsize: 13 MiB (14244746 bytes)
entries: 3656
modules under 2023.06/software/linux/aarch64/generic/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/aarch64/generic/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/aarch64/generic
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 14:44:35 UTC 2024 test result (no tests yet)
Mar 26 15:03:54 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-aarch64-generic-1711464128.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_n1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8578

date job status comment
Mar 26 13:40:06 UTC 2024 submitted job id 8578 awaits release by job manager
Mar 26 13:40:15 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:44:51 UTC 2024 running job 8578 is running
Mar 26 14:44:37 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8578.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_n1-1711464083.tar.gzsize: 13 MiB (14318957 bytes)
entries: 3656
modules under 2023.06/software/linux/aarch64/neoverse_n1/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/aarch64/neoverse_n1/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/aarch64/neoverse_n1
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 14:44:37 UTC 2024 test result (no tests yet)
Mar 26 15:04:15 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-aarch64-neoverse_n1-1711464083.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8579

date job status comment
Mar 26 13:40:09 UTC 2024 submitted job id 8579 awaits release by job manager
Mar 26 13:40:18 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:44:54 UTC 2024 running job 8579 is running
Mar 26 14:26:57 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8579.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_v1-1711463143.tar.gzsize: 13 MiB (14333465 bytes)
entries: 3656
modules under 2023.06/software/linux/aarch64/neoverse_v1/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/aarch64/neoverse_v1/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/aarch64/neoverse_v1
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 14:26:57 UTC 2024 test result (no tests yet)
Mar 26 15:03:31 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-aarch64-neoverse_v1-1711463143.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8580

date job status comment
Mar 26 13:40:13 UTC 2024 submitted job id 8580 awaits release by job manager
Mar 26 13:41:27 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:42:40 UTC 2024 running job 8580 is running
Mar 26 15:01:16 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8580.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-generic-1711465158.tar.gzsize: 13 MiB (14304700 bytes)
entries: 3671
modules under 2023.06/software/linux/x86_64/generic/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/x86_64/generic/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/x86_64/generic
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 15:01:16 UTC 2024 test result (no tests yet)
Mar 26 15:04:35 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-generic-1711465158.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8581

date job status comment
Mar 26 13:40:17 UTC 2024 submitted job id 8581 awaits release by job manager
Mar 26 13:41:22 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:42:37 UTC 2024 running job 8581 is running
Mar 26 15:02:19 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8581.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1711465249.tar.gzsize: 13 MiB (14364320 bytes)
entries: 3671
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen2/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/x86_64/amd/zen2
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 15:02:19 UTC 2024 test result (no tests yet)
Mar 26 15:04:55 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-1711465249.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8582

date job status comment
Mar 26 13:40:21 UTC 2024 submitted job id 8582 awaits release by job manager
Mar 26 13:41:25 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:46:09 UTC 2024 running job 8582 is running
Mar 26 14:45:51 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8582.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1711464261.tar.gzsize: 13 MiB (14363644 bytes)
entries: 3671
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen3/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/x86_64/amd/zen3
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 14:45:51 UTC 2024 test result (no tests yet)
Mar 26 15:05:15 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen3-1711464261.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-haswell for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8583

date job status comment
Mar 26 13:40:24 UTC 2024 submitted job id 8583 awaits release by job manager
Mar 26 13:41:30 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:47:28 UTC 2024 running job 8583 is running
Mar 26 15:02:21 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8583.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-haswell-1711465292.tar.gzsize: 13 MiB (14380504 bytes)
entries: 3671
modules under 2023.06/software/linux/x86_64/intel/haswell/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/x86_64/intel/haswell/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/x86_64/intel/haswell
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 15:02:21 UTC 2024 test result (no tests yet)
Mar 26 15:05:34 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-haswell-1711465292.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Mar 26, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-intel-skylake_avx512 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.03/pr_488/8584

date job status comment
Mar 26 13:40:28 UTC 2024 submitted job id 8584 awaits release by job manager
Mar 26 13:41:33 UTC 2024 released job awaits launch by Slurm scheduler
Mar 26 13:47:30 UTC 2024 running job 8584 is running
Mar 26 14:44:40 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-8584.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1711464177.tar.gzsize: 13 MiB (14312282 bytes)
entries: 3671
modules under 2023.06/software/linux/x86_64/intel/skylake_avx512/modules/all
OpenMPI/4.1.4-GCC-12.2.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
software under 2023.06/software/linux/x86_64/intel/skylake_avx512/software
OpenMPI/4.1.4-GCC-12.2.0
OpenMPI/4.1.5-GCC-12.3.0
OpenMPI/4.1.6-GCC-13.2.0
other under 2023.06/software/linux/x86_64/intel/skylake_avx512
.lmod/cache/spiderT.lua
.lmod/cache/spiderT.luac_5.1
.lmod/cache/timestamp
.lmod/SitePackage.lua
Mar 26 14:44:40 UTC 2024 test result (no tests yet)
Mar 26 15:05:54 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1711464177.tar.gz to S3 bucket succeeded

@casparvl casparvl added the bot:deploy Ask bot to deploy missing software installations to EESSI label Mar 26, 2024
@bedroge
Copy link
Collaborator Author

bedroge commented Mar 26, 2024

Before closing/merging the staging PRs, I will manually open a transaction on the Stratum and remove the following directories:

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_n1/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_n1/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_n1/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_v1/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_v1/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_v1/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/generic/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/generic/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/generic/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/haswell/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/haswell/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/haswell/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/skylake_avx512/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/skylake_avx512/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/skylake_avx512/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.6-GCC-13.2.0

In the same transaction, I'll then extract the 8 tarballs from this PRs, and update the Lmod caches. Finally, I'll publish the transaction. The staging PRs can then be either closed (in that case they'll be rejected/ignored) or even merged (then the tarballs will just be ingested again, which should be harmless).

@casparvl
Copy link
Collaborator

Just to log the full procedure: we decided to do this manually, so we can first remove the current OpenMPI installations. If we would deploy this PR automatically, the new files would be ingested (and overwrite existing ones), but old files that are not overwritten would remain in those prefixes. Not a huge deal, but this time, we figured to do a clean install manually. The steps we are taking for that are:

  • do bot:deploy
  • wait for the staging PRs. tarballs should then already be available on the Stratum 0
  • open a transaction
  • remove all OpenMPI software dirs
  • extract all 8 tarballs
  • run the lmod cache update script
  • publish the transaction
  • close all staging PRs (could even merge them, then they get ingested again, which should be harmless)

@bedroge
Copy link
Collaborator Author

bedroge commented Mar 26, 2024

This is the manually created script that I used to remove the old versions, extract the tarballs containing the new versions, and update the Lmod caches:

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_n1/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_n1/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_n1/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_v1/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_v1/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_v1/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/generic/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/generic/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/generic/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/haswell/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/haswell/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/haswell/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/skylake_avx512/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/skylake_avx512/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/intel/skylake_avx512/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.6-GCC-13.2.0

rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.4-GCC-12.2.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.5-GCC-12.3.0
rm -rf /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.6-GCC-13.2.0

cd /cvmfs/software.eessi.io/versions

tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-x86_64-intel-skylake_avx512-1711464177.tar.gz
tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-x86_64-intel-haswell-1711465292.tar.gz
tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-x86_64-generic-1711465158.tar.gz
tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-x86_64-amd-zen3-1711464261.tar.gz
tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-x86_64-amd-zen2-1711465249.tar.gz
tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-aarch64-neoverse_v1-1711463143.tar.gz
tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-aarch64-neoverse_n1-1711464083.tar.gz
tar xvzf /srv/tmp/tarballs/eessi-2023.06-software-linux-aarch64-generic-1711464128.tar.gz

/opt/eessi/filesystem-layer/scripts/update_lmod_caches.sh /cvmfs/software.eessi.io/versions/2023.06

@bedroge
Copy link
Collaborator Author

bedroge commented Mar 26, 2024

All tarballs have been published. 🎉

@casparvl
Copy link
Collaborator

Ok, now we only need #513 to fix our CI, and then we can merge this...

bedroge added a commit to Neves-P/software-layer that referenced this pull request Mar 26, 2024
Copy link
Contributor

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2023.06-software.eessi.io 2023.06 version of software.eessi.io bot:deploy Ask bot to deploy missing software installations to EESSI enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Procedure for reinstalling software with the bot (and preventing chown/chmod issues in the container)
4 participants