{bio}[foss/2023a] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1 #19942

ThomasHoffmann77 · 2024-02-20T11:47:09Z

(created using eb --new-pr)
requires:

migueldiascosta · 2024-03-04T07:31:01Z

fwiw, I'm getting error: HWCAP_NEON was not declared in this scope when building OpenMM from this PR on NVIDIA Grace-Hopper

it looks like openmm-8.0.0/cmake_modules/TargetArch.cmake is detecting Grace-Hopper as arm instead of armv8,

which then leads to openmm-8.0.0/CMakeLists.txt setting -D__ARM__=1 instead of -D__ARM64__=1,

which in turn leads openmm-8.0.0/openmmapi/include/openmm/internal/vectorize_neon.h to use HWCAP_NEON instead of HWCAP_ASIMD

forcing TARGET_ARCH to be armv8 in openmm-8.0.0/CMakeLists.txt fixed the issue for me

ThomasHoffmann77 · 2024-03-04T08:58:06Z

fwiw, I'm getting error: HWCAP_NEON was not declared in this scope when building OpenMM from this PR on NVIDIA Grace-Hopper

it looks like openmm-8.0.0/cmake_modules/TargetArch.cmake is detecting Grace-Hopper as arm instead of armv8,

which then leads to openmm-8.0.0/CMakeLists.txt setting -D__ARM__=1 instead of -D__ARM64__=1,

which in turn leads openmm-8.0.0/openmmapi/include/openmm/internal/vectorize_neon.h to use HWCAP_NEON instead of HWCAP_ASIMD

forcing TARGET_ARCH to be armv8 in openmm-8.0.0/CMakeLists.txt fixed the issue for me

#18911

VRehnberg · 2024-04-25T14:25:04Z

Fyi, I've got a draft #20421 that might become relevant for this one as well. Perhaps you'll have opinions :).

VRehnberg

Thanks for adding this.

So typical use would be(?):

CPU only job to get features (gpu not detected)
Job-array to GPUs to run predictions (features.pkl found and --only-model-pred="${SLURM_ARRAY_TASK_ID}")
Single job with GPU to run relaxation (possibly in parallel, [How is this launched, or is it not run separately???])

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2_EMBLpipeline002.patch

ThomasHoffmann77 · 2024-05-21T13:43:50Z

Thanks for adding this.

So typical use would be(?):

CPU only job to get features (gpu not detected)

Job-array to GPUs to run predictions (features.pkl found and --only-model-pred="${SLURM_ARRAY_TASK_ID}")

yes, for monomer jobs.
For multimer, you need to translate the array ID to X,Y with X in [1..5], Y in [0..4] (if you run with --num_multimer_predictions_per_model=5)

Single job with GPU to run relaxation (possibly in parallel, [How is this launched, or is it not run separately???])

In order to get the ranking, you can run a quick CPU job after running the predictions.
--models_to_relax default is changed from best to none. Therefore the pipeline stops after the predictions.
You can resume with the relaxation by restarting with --models_to_relax=all (or best).

ThomasHoffmann77 · 2024-07-24T10:12:32Z

accidentally closed

…e-12.3.0.eb

…igs/k/Kalign/Kalign-3.4.0-GCCcore-12.3.0.eb

…nfigs/d/dm-haiku/dm-haiku-0.0.11-foss-2023a-CUDA-12.1.1.eb_

…foss-2023a-CUDA-12.1.1.eb_

…sybuild/easyconfigs/t/tensorstore/tensorstore-0.1.53-foss-2023a.eb

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2-foss-2023a-CUDA-12.1.1.eb

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2_BioPythonSCOPData.patch

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2_data-dep-paths-shebang-UniRef30_2023_02.patch

…DA-12.1.1.eb

…PData.patch

…old232

…23a.eb

…hs-shebang-UniRef30_2023_02.patch

…0.0.12

requested changes done

boegel

lgtm

boegel · 2024-10-11T18:02:47Z

@boegelbot please test @ jsc-zen3-a100

boegelbot · 2024-10-11T18:10:14Z

@boegel: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=19942 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_19942 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

exit code: 0
output:

Submitted batch job 5066

Test results coming soon (I hope)...

- notification for comment with ID 2407894839 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

boegel · 2024-10-11T18:17:29Z

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
node3901.accelgor.os - Linux RHEL 8.8, x86_64, AMD EPYC 7413 24-Core Processor, 1 x NVIDIA NVIDIA A100-SXM4-80GB, 545.23.08, Python 3.6.8
See https://gist.github.com/boegel/cb450cdab5ed9c44eb1dd6e80e9541d2 for a full test report.

boegelbot · 2024-10-11T19:19:15Z

Test report by @boegelbot
SUCCESS
Build succeeded for 4 out of 4 (3 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.4, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 555.42.06, Python 3.9.18
See https://gist.github.com/boegelbot/c87f706d4ad650a5227d1ab0da8297d1 for a full test report.

boegel · 2024-10-11T19:37:32Z

Going in, thanks @ThomasHoffmann77!

jfgrimm added this to the 4.x milestone Feb 22, 2024

jfgrimm added new update labels Feb 22, 2024

easybuilders deleted a comment from boegelbot Feb 22, 2024

migueldiascosta mentioned this pull request Mar 4, 2024

{bio}[foss/2023a] OpenMM v8.0.0 w/ CUDA 12.1.1 #18911

Open

ThomasHoffmann77 mentioned this pull request Apr 25, 2024

AlphaFold, new flags for resource utilisation #20421

Closed

VRehnberg reviewed May 21, 2024

View reviewed changes

ThomasHoffmann77 closed this Jul 24, 2024

ThomasHoffmann77 force-pushed the 20240220124705_new_pr_AlphaFold232 branch from 0ebfd8f to e7367ff Compare July 24, 2024 10:04

Add files via upload

8cd3639

ThomasHoffmann77 reopened this Aug 16, 2024

ThomasHoffmann77 added 10 commits August 16, 2024 10:32

Add files via upload

78ec12f

Add files via upload

e25efa5

Add files via upload

be54439

Add files via upload

dfdb612

Rename Kalign-3.4.0-GCCcore-12.3.0.eb to k/Kalign/Kalign-3.4.0-GCCcor…

ecf79ff

…e-12.3.0.eb

Rename k/Kalign/Kalign-3.4.0-GCCcore-12.3.0.eb to easybuild/eeasyconf…

3478d12

…igs/k/Kalign/Kalign-3.4.0-GCCcore-12.3.0.eb

Delete easybuild/eeasyconfigs/k/Kalign directory

4b2d49e

Rename dm-haiku-0.0.11-foss-2023a-CUDA-12.1.1.eb to easybuild/eeasyco…

c8231ba

…nfigs/d/dm-haiku/dm-haiku-0.0.11-foss-2023a-CUDA-12.1.1.eb_

Rename dm-haiku-0.0.11-foss-2023a-CUDA-12.1.1.eb_ to dm-haiku-0.0.11-…

66f01b3

…foss-2023a-CUDA-12.1.1.eb_

Rename easybuild/easyconfigs/t/tensorstore-0.1.53-foss-2023a.eb to ea…

d5d7bf5

…sybuild/easyconfigs/t/tensorstore/tensorstore-0.1.53-foss-2023a.eb

akesandgren previously requested changes Aug 29, 2024

View reviewed changes

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2-foss-2023a-CUDA-12.1.1.eb Outdated Show resolved Hide resolved

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2-foss-2023a-CUDA-12.1.1.eb Outdated Show resolved Hide resolved

akesandgren reviewed Aug 29, 2024

View reviewed changes

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2_BioPythonSCOPData.patch Outdated Show resolved Hide resolved

akesandgren reviewed Aug 29, 2024

View reviewed changes

easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2_data-dep-paths-shebang-UniRef30_2023_02.patch Outdated Show resolved Hide resolved

ThomasHoffmann77 marked this pull request as draft October 11, 2024 06:53

ThomasHoffmann77 and others added 7 commits October 11, 2024 10:44

update dm-haiku to 0.0.12; @akesandgren suggestions

b2651fc

sanity_check_commands order

572eb15

Update dm-haiku-0.0.12-foss-2023a-CUDA-12.1.1.eb

030559a

Delete easybuild/easyconfigs/d/dm-haiku/dm-haiku-0.0.11-foss-2023a-CU…

04513f5

…DA-12.1.1.eb

Delete easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2_BioPythonSCO…

34926eb

…PData.patch

revert TF to ver 2.13.0; uniref30 default version env var

4fee5a7

update tensorstore to 0.1.65

5c938cf

ThomasHoffmann77 marked this pull request as ready for review October 11, 2024 14:10

ThomasHoffmann77 added 3 commits October 11, 2024 16:10

Merge branch 'easybuilders:develop' into 20240220124705_new_pr_AlphaF…

69898e1

…old232

Delete easybuild/easyconfigs/t/tensorstore/tensorstore-0.1.53-foss-20…

5fa5548

…23a.eb

Delete easybuild/easyconfigs/a/AlphaFold/AlphaFold-2.3.2_data-dep-pat…

9da4bac

…hs-shebang-UniRef30_2023_02.patch

ThomasHoffmann77 changed the title ~~{bio}[foss/2023a,GCCcore/12.3.0] AlphaFold v2.3.2, Kalign v3.4.0, dm-haiku v0.0.12, tensorstore v0.1.65, HH-Suite v3.3.0 w/ CUDA v12.1.1~~ {bio}[foss/2023a,GCCcore/12.3.0] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1 Oct 11, 2024

ThomasHoffmann77 changed the title ~~{bio}[foss/2023a,GCCcore/12.3.0] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1~~ {bio}[foss/2023a] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1 Oct 11, 2024

use standalone dependency for Optax 0.2.2 as dependency for dm-haiku …

c81b3e5

…0.0.12

boegel approved these changes Oct 11, 2024

View reviewed changes

boegel modified the milestones: 4.x, release after 4.9.4 Oct 11, 2024

boegel merged commit 96515d3 into easybuilders:develop Oct 11, 2024
9 checks passed

boegel mentioned this pull request Oct 24, 2024

RFdiffusion/ProteinMPNN/AlphaFold pipeline + ColabDesign vscentrum/vsc-software-stack#283

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

{bio}[foss/2023a] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1 #19942

{bio}[foss/2023a] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1 #19942

ThomasHoffmann77 commented Feb 20, 2024 •

edited by boegel

Loading

migueldiascosta commented Mar 4, 2024

ThomasHoffmann77 commented Mar 4, 2024

VRehnberg commented Apr 25, 2024

VRehnberg left a comment

ThomasHoffmann77 commented May 21, 2024 •

edited

Loading

ThomasHoffmann77 commented Jul 24, 2024

boegel left a comment

boegel commented Oct 11, 2024

boegelbot commented Oct 11, 2024

boegel commented Oct 11, 2024

boegelbot commented Oct 11, 2024

boegel commented Oct 11, 2024

{bio}[foss/2023a] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1 #19942

{bio}[foss/2023a] AlphaFold v2.3.2, dm-haiku v0.0.12, tensorstore v0.1.65 w/ CUDA v12.1.1 #19942

Conversation

ThomasHoffmann77 commented Feb 20, 2024 • edited by boegel Loading

migueldiascosta commented Mar 4, 2024

ThomasHoffmann77 commented Mar 4, 2024

VRehnberg commented Apr 25, 2024

VRehnberg left a comment

Choose a reason for hiding this comment

ThomasHoffmann77 commented May 21, 2024 • edited Loading

ThomasHoffmann77 commented Jul 24, 2024

boegel left a comment

Choose a reason for hiding this comment

boegel commented Oct 11, 2024

boegelbot commented Oct 11, 2024

boegel commented Oct 11, 2024

boegelbot commented Oct 11, 2024

boegel commented Oct 11, 2024

ThomasHoffmann77 commented Feb 20, 2024 •

edited by boegel

Loading

ThomasHoffmann77 commented May 21, 2024 •

edited

Loading