Windows doesn't support cudaMemPrefetchAsync() #453

stoperro · 2023-05-29T06:36:42Z

Also memory oversubscription is not supported https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#system-requirements which I presume means paged optimizer that overcomes memory spikes won't work on windows.

This results in below error in QLoRA training:

rror invalid device ordinal at line 359 in file F:\Buildy\bitsandbytes_acpopescu\csrc\pythonInterface.c
C:\arrow\cpp\src\arrow\filesystem\s3fs.cc:2598:  arrow::fs::FinalizeS3 was not called even though S3 was initialized.  This could lead to a segmentation fault at exit

(note: above wasn't caused by old transformers version)

The text was updated successfully, but these errors were encountered:

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453

stoperro · 2023-05-29T06:40:42Z

Above linked solution that works for me - checking capabilities for that feature before running the call as hinted in https://stackoverflow.com/a/43430831/950131 . The call is fast, so maybe no need to cache the answer.

jllllll · 2023-05-29T22:24:07Z

Does this issue only apply to QLoRA training?

phalexo · 2023-05-31T16:01:19Z

Since I am unable to rebuild bitsandbytes because of Maxwell architecture incompatibility with synchronization primitives, I am trying a different solution, i.e. trapping SIGSEGV signal.

I has not dumped core yet, but I am not sure what it is doing. Python seems to be running but I don't see any activity on the GPUs for about one hour either.

I am running on Ubuntu 20.04, not Windows. So it may be a wider issue than the OS.

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453 (cherry picked from commit e02f078)

TimDettmers · 2023-07-16T18:01:06Z

This is a duplicate of #477, please redirect all discussion there.

TL;DR: I need to think if I will support Maxwell or not. There might be a workaround for Maxwell support by excluding Paged Optimziers.

TimDettmers · 2023-07-17T05:15:02Z

This has been fixed and pushed to pip. Memory problems might remain, but these are Windows-specific and there is nothing I can do about that. Thank you for the fix @stoperro , this was an important bugfix.

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453 (cherry picked from commit e02f078)

…d to run on Windows (#13957) [Windows doesn't support cudaMemPrefetchAsync()](bitsandbytes-foundation/bitsandbytes#453) which is used in the call to `prefetch` in the test. [urEnqueueUSMPrefetch](https://github.com/oneapi-src/unified-runtime/blob/c0c607c3a88933b4c5c20a0aca4539781c678411/source/adapters/cuda/enqueue.cpp#L1629) is also commented with a note for not having the support for CUDA on Windows.

…d to run on Windows (intel#13957) [Windows doesn't support cudaMemPrefetchAsync()](bitsandbytes-foundation/bitsandbytes#453) which is used in the call to `prefetch` in the test. [urEnqueueUSMPrefetch](https://github.com/oneapi-src/unified-runtime/blob/c0c607c3a88933b4c5c20a0aca4539781c678411/source/adapters/cuda/enqueue.cpp#L1629) is also commented with a note for not having the support for CUDA on Windows.

stoperro added a commit to stoperro/bitsandbytes_windows that referenced this issue May 29, 2023

Disables cudaMemPrefetchAsync() in unsupported systems (e.g. Windows). …

e02f078

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453

stoperro mentioned this issue May 29, 2023

Error while trying to run training in Windows artidoro/qlora#73

Open

jllllll pushed a commit to jllllll/bitsandbytes that referenced this issue Jul 8, 2023

Disables cudaMemPrefetchAsync() in unsupported systems (e.g. Windows). …

a04320b

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453 (cherry picked from commit e02f078)

jllllll pushed a commit to jllllll/bitsandbytes that referenced this issue Jul 8, 2023

Disables cudaMemPrefetchAsync() in unsupported systems (e.g. Windows). …

88b2fc5

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453 (cherry picked from commit e02f078)

TimDettmers closed this as completed Jul 16, 2023

TimDettmers added bug Something isn't working duplicate This issue or pull request already exists high priority (first issues that will be worked on) labels Jul 16, 2023

jllllll pushed a commit to jllllll/bitsandbytes that referenced this issue Jul 17, 2023

Disables cudaMemPrefetchAsync() in unsupported systems (e.g. Windows). …

056d9eb

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453 (cherry picked from commit e02f078)

jllllll pushed a commit to jllllll/bitsandbytes that referenced this issue Jul 17, 2023

Disables cudaMemPrefetchAsync() in unsupported systems (e.g. Windows). …

7f61994

…Fixes artidoro/qlora#73 and bitsandbytes-foundation#453 (cherry picked from commit e02f078)

This was referenced May 29, 2024

[SYCL][TEST-E2E] Disallow dep_events.cpp test built for CUDA backend to run on Windows intel/llvm#13957

Merged

prefetch on sycl with cuda backend on Windows oneapi-src/unified-runtime#1700

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Windows doesn't support cudaMemPrefetchAsync() #453

Windows doesn't support cudaMemPrefetchAsync() #453

stoperro commented May 29, 2023 •

edited

Loading

stoperro commented May 29, 2023

jllllll commented May 29, 2023

phalexo commented May 31, 2023 •

edited

Loading

TimDettmers commented Jul 16, 2023

TimDettmers commented Jul 17, 2023

Windows doesn't support cudaMemPrefetchAsync() #453

Windows doesn't support cudaMemPrefetchAsync() #453

Comments

stoperro commented May 29, 2023 • edited Loading

stoperro commented May 29, 2023

jllllll commented May 29, 2023

phalexo commented May 31, 2023 • edited Loading

TimDettmers commented Jul 16, 2023

TimDettmers commented Jul 17, 2023

stoperro commented May 29, 2023 •

edited

Loading

phalexo commented May 31, 2023 •

edited

Loading