Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cuDNN version mismatch for TensorFlow 2.5.0 #13796

Open
verdurin opened this issue Aug 23, 2021 · 1 comment
Open

cuDNN version mismatch for TensorFlow 2.5.0 #13796

verdurin opened this issue Aug 23, 2021 · 1 comment

Comments

@verdurin
Copy link
Member

According to https://www.tensorflow.org/install/gpu#software_requirements TF 2.5.0 , cuDNN 8.1 is required.
We have version 8.0.4.30 for CUDA 11.1 in fosscuda/2020b.

We have also seen some problems with our install of TF 2.5.0, which we suspect are related to this version mismatch:

2021-08-17 07:37:18.488694: F ./tensorflow/core/kernels/random_op_gpu.h:244] Non-OK-status: GpuLaunchKernel(FillPhiloxRandomKernelLaunch<Distribution>, num_blocks, block_size, 0, d.stream(), key, counter, gen, data, size, dist) status: Internal: no kernel image is available for execution on the device

cuDNN 8.1.1.33 is available for download, albeit for CUDA 11.2. Technically, it's in the part of the NVIDIA archive labelled 11.0, 11.1, 11.2 but the explicit version in the tarball is 11.2.

In summary, we would like to try TF 2.5.0 with cuDNN 8.1 in fosscuda/2020b, which is based on CUDA11.1, while the cuDNNrelease appears to be intended forCUDA 11.2`.

@Flamefire
Copy link
Contributor

no kernel image is available for execution on the device

This usually means that the GPUs compute capability is not included in the binaries. Make sure that TF is build with the correct cuda-compute-capabilities options for your GPUs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants