Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cmake_windows_gpu job fails #4440

Closed
datumbox opened this issue Sep 17, 2021 · 2 comments · Fixed by #4419
Closed

cmake_windows_gpu job fails #4440

datumbox opened this issue Sep 17, 2021 · 2 comments · Fixed by #4419

Comments

@datumbox
Copy link
Contributor

datumbox commented Sep 17, 2021

🐛 Describe the bug

The cmake_windows_gpu job on latest main started failing with:

-- Found CUDA: C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1 (found version "10.1") 
-- Caffe2: CUDA detected: 10.1
-- Caffe2: CUDA nvcc is: C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1/bin/nvcc.exe
-- Caffe2: CUDA toolkit directory: C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v10.1
CMake Error at C:/tools/miniconda3/Lib/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:42 (message):
  PyTorch requires CUDA 10.2 or above.
Call Stack (most recent call first):
  C:/tools/miniconda3/Lib/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:88 (include)
  C:/tools/miniconda3/Lib/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
  CMakeLists.txt:17 (find_package)

It seems that despite defining CUDA10.2 below, something setups 10.1 on the paths:

vision/.circleci/config.yml

Lines 1704 to 1707 in c1ddb93

- cmake_windows_gpu:
cu_version: cu102
name: cmake_windows_gpu
python_version: '3.8'

This version mismatch was probably always there but the 10.1 version check and enforcement was probably introduced on PyTorch recently because this job passed yesterday.

Versions

latest main a2b4c65

@seemethere
Copy link
Member

You need to upgrade this job to CUDA 10.2 were no longer supporting CUDA 10.1

@seemethere seemethere assigned datumbox and unassigned seemethere Sep 17, 2021
@datumbox
Copy link
Contributor Author

datumbox commented Sep 17, 2021

@seemethere My question was, shouldn't this be using already CUDA 10.2? See below:

vision/.circleci/config.yml

Lines 1704 to 1707 in c1ddb93

- cmake_windows_gpu:
cu_version: cu102
name: cmake_windows_gpu
python_version: '3.8'

I believe there is a bug on the current scripts that you use for installing the libraries. Could you please have a brief look as you are more familiar?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants