-
Notifications
You must be signed in to change notification settings - Fork 285
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect memory usage for nvidia driver higher than R510 #141
Comments
Before updating the driver was it fine? If so, can you please let me know of the driver version? This is probably due to the mismatch between pynvml and nvidia driver version. We pinned nvidia-ml-py version at 11.495.46, this is not compatible with very recent drivers. Please see #107 for details. As a workaround, you can try installing nvidia-ml-py (it will complain about the incompatibility with gpustat v1.0.0) to the latest release, e.g. |
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as duplicate.
This comment was marked as duplicate.
Can you please try the following (pynvml 11.510.69+ required): import pynvml
pynvml.nvmlInit()
handle = pynvml.nvmlDeviceGetHandleByIndex(0)
print(pynvml.nvmlDeviceGetMemoryInfo(handle))
print(pynvml.nvmlDeviceGetMemoryInfo(handle, version=2)) It looks like Related to XuehaiPan/nvitop#13 |
It seems works. nvitop patch it in PR XuehaiPan/nvitop#30 >>> print(pynvml.nvmlDeviceGetMemoryInfo(h,version=pynvml.nvmlMemory_v2))
c_nvmlMemory_v2_t(version: 33554472 B, total: 25769803776 B, reserved: 322633728 B, free: 25447038976 B, used: 131072 B) |
I hate NVIDIA breaking existing functions in new versions of drivers. For drivers higher than 510.39.01, perhaps the only way to get the correct memory usage information is to use A consequence is that a different pynvml version must be used depending on the ndivida drivers (this is something difficult to resolve during installation and build time), and if incompatible versions are found then a proper warning messages should be printed. I'm going to relax the pynvml requirement |
Fixed via #143 (comment). Released in v1.1. |
When I update my nvidia driver to 515.76,
gpustat
always shows there are 308MB memory used even no process using gpus.But
nvidia-smi
shows there are no memory being used, so I am very confused...Here is my environment:
ubuntu 18.04
gpustat 1.0.0
nvidia-driver 515.76
kernel 4.15.0-171-generic
And here are the outputs of
data:image/s3,"s3://crabby-images/3a5b5/3a5b527db55a555fa9ef1e2fe480c3b37f827a74" alt="nvidia-smi"
data:image/s3,"s3://crabby-images/55ccb/55ccb0adf5fc6cf7a974b6eb7c65f5065a635612" alt="gpustat"
nvidia-smi
andgpustat
:The text was updated successfully, but these errors were encountered: