Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{lib}[GCCcore/13.2.0] CUDA v12.4.0, NCCL v2.20.5, magma v2.7.2, ... w/ CUDA 12.4.0 #20107

Merged

Conversation

Flamefire
Copy link
Contributor

@Flamefire Flamefire commented Mar 13, 2024

(created using eb --new-pr)

This is important in so far that this is the first CUDA working with GCC 13 (foss/2023b)

Requires the include-fixed from GCC 13.2. E.g. rebuilding GCCcore with easybuilders/easybuild-easyblocks#3254

…2.4.0.eb, magma-2.7.2-foss-2023b-CUDA-12.4.0.eb, UCX-CUDA-1.15.0-GCCcore-13.2.0-CUDA-12.4.0.eb
@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 4 out of 4 (4 easyconfigs in total)
i8025 - Linux Rocky Linux 8.7 (Green Obsidian), x86_64, AMD EPYC 7352 24-Core Processor (zen2), 8 x NVIDIA NVIDIA A100-SXM4-40GB, 545.23.08, Python 3.8.13
See https://gist.github.com/Flamefire/f09907d94a615014769a1ef422d9ca68 for a full test report.

@boegel boegel added the update label Mar 13, 2024
@boegel boegel added this to the release after 4.9.0 milestone Mar 13, 2024
Copy link
Contributor

@Micket Micket left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@migueldiascosta
Copy link
Member

Can't upload a test report (system behind a proxy) but tested successfully on Arm (Grace-Hopper)

Copy link
Contributor

@Micket Micket left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@Micket
Copy link
Contributor

Micket commented Mar 19, 2024

@boegelbot please test @ jsc-zen3
CORE_CNT=16

@Micket
Copy link
Contributor

Micket commented Mar 19, 2024

Test report by @Micket
SUCCESS
Build succeeded for 4 out of 4 (4 easyconfigs in total)
vera-gpu2 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) Gold 6130 CPU @ 2.10GHz, 1 x NVIDIA Tesla V100-SXM2-32GB, 550.54.14, Python 3.6.8
See https://gist.github.com/Micket/21cf38c5e6ea94d28cf27c4b7b395ab9 for a full test report.

@boegelbot
Copy link
Collaborator

@Micket: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=20107 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_20107 --ntasks="16" ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 3803

Test results coming soon (I hope)...

- notification for comment with ID 2007303081 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 4 out of 4 (4 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.3, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.18
See https://gist.github.com/boegelbot/acca0a92b8df0dea3fbb194720c1b8cc for a full test report.

@Micket
Copy link
Contributor

Micket commented Mar 19, 2024

Going in, thanks @Flamefire!

@Micket Micket merged commit e788852 into easybuilders:develop Mar 19, 2024
9 checks passed
@Flamefire Flamefire deleted the 20240313135157_new_pr_CUDA1240 branch March 19, 2024 15:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants