-
Notifications
You must be signed in to change notification settings - Fork 522
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
YOLO-NAS-POSE doesn't work with GPU #1886
Comments
I have tried the command you said in this comment and it didn't find that branch
|
Yes, that's because since PR was already merged, it was automatically deleted that branch. You can now install from |
I have done the following steps: pip uninstall super-gradientes Now I have super-gradients==3.6.0+master This is my code:
But I still get the same error:
|
It looks like your
But your |
The command I used to install pytorch was this one: This is the command that is on the pytorch page |
I beleive this command is meant to run on clean environment where you don't have torchvision installed as it most likely will keep existing torchvision.
Compare with what you have reported:
As you can see - a torchvision has no cuda prefix, and since nms operation is indeed implemented as C++ CUDA layer in torchvision this may cause some internal inconsistency at runtime. This is my best guess so far. |
With |
🐛 Describe the bug
I'm trying to do inference of an image with cuda but it doesn't work. With cpu it works fine.
This is my code:
This is the error I get:
I try it on Google Colab and it works fine, but on my computer it doesn't.
Versions
Collecting environment information...
PyTorch version: 2.2.0+cu118
Is debug build: False
CUDA used to build PyTorch: 11.8
ROCM used to build PyTorch: N/A
OS: Microsoft Windows 10 Enterprise LTSC
GCC version: Could not collect
Clang version: Could not collect
CMake version: Could not collect
Libc version: N/A
Python version: 3.8.0 (tags/v3.8.0:fa919fd, Oct 14 2019, 19:37:50) [MSC v.1916 64 bit (AMD64)] (64-bit runtime)
Python platform: Windows-10-10.0.17763-SP0
Is CUDA available: True
CUDA runtime version: 11.8.89
CUDA_MODULE_LOADING set to: LAZY
GPU models and configuration: GPU 0: NVIDIA GeForce GTX 1650
Nvidia driver version: 551.52
cuDNN version: C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.8\bin\cudnn_ops_train64_8.dll
HIP runtime version: N/A
MIOpen runtime version: N/A
Is XNNPACK available: True
CPU:
Architecture=9
CurrentClockSpeed=1992
DeviceID=CPU0
Family=198
L2CacheSize=2048
L2CacheSpeed=
Manufacturer=GenuineIntel
MaxClockSpeed=1992
Name=Intel(R) Core(TM) i7-10700TE CPU @ 2.00GHz
ProcessorType=3
Revision=
Versions of relevant libraries:
[pip3] mypy-extensions==1.0.0
[pip3] numpy==1.23.0
[pip3] onnx==1.13.0
[pip3] onnx-graphsurgeon==0.3.12
[pip3] onnxruntime==1.13.1
[pip3] onnxsim==0.4.35
[pip3] pytorch-quantization==2.1.2
[pip3] torch==2.2.0+cu118
[pip3] torchaudio==2.2.0+cu118
[pip3] torchmetrics==0.8.0
[pip3] torchvision==0.17.0
[conda] Could not collect
super-gradients==3.6.0
The text was updated successfully, but these errors were encountered: