Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Resolve Whether to Install NVidia GPU operator or Nvidia Device Plugin #88

Closed
aedenj opened this issue Dec 28, 2024 · 1 comment
Closed

Comments

@aedenj
Copy link
Contributor

aedenj commented Dec 28, 2024

This remark would indicate that the gpu operator doesn't need to be installed and only the device plugin, but does that apply to the older AL2_x86_64_GPU AMI. Additionally the AWS Labs repo only installs the device plugin so that's most likely correct.

This announcement would indicate perhaps the operator is the way to go now with the newer AL2023_x86_64_NVIDIA AMI

@aedenj
Copy link
Contributor Author

aedenj commented Dec 28, 2024

This error

      Message:      failed to create containerd task: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: /usr/local/nvidia/toolkit/nvidia-container-cli.real: /lib64/libc.so.6: version `GLIBC_2.27' not found (required by /usr/local/nvidia/toolkit/libnvidia-container.so.1): unknown

is resolved with the use of the gpu-operator by using latest AL2023_x86_64_NVIDIA AMI.

@aedenj aedenj closed this as completed Dec 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant