64-bit indexing Adam #1765

eqy · 2023-12-26T22:31:27Z

I think the tests pass without the changes to multi_tensor_apply.cuh, but leaving it as-is makes me a bit nervous...

TODOs: graph-capturable Adam, and all other optimizers if people really need 64-bit indexing there...

crcrpar · 2023-12-30T17:00:04Z

csrc/multi_tensor_apply.cuh

@@ -85,9 +85,9 @@ void multi_tensor_apply(
      tl.addresses[d][loc_tensor_info] = tensor_lists[d][t].data_ptr();
    loc_tensor_info++;

-    int chunks_this_tensor = (tensor_lists[0][t].numel() + chunk_size - 1)/chunk_size;
+    auto chunks_this_tensor = (tensor_lists[0][t].numel() + chunk_size - 1)/chunk_size;


would chunks_this_tensor tend to be int64_t given chunk_size being so?

## The Issue Applying `FusedAdam` on large tensors will cause an error `CUDA error: an illegal memory access was encountered`. #3429 NVIDIA/apex#1654 ## PR Content Following the solution in the apex repository (NVIDIA/apex#1765), changing indexing type to `int64` if necessary. --------- Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>

## The Issue Applying `FusedAdam` on large tensors will cause an error `CUDA error: an illegal memory access was encountered`. microsoft#3429 NVIDIA/apex#1654 ## PR Content Following the solution in the apex repository (NVIDIA/apex#1765), changing indexing type to `int64` if necessary. --------- Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>

eqy added 3 commits December 26, 2023 22:29

all i want for christmas is larger binaries and longer compile times

8fa43bb

actually compare

f2bc833

woops

05751cb

crcrpar approved these changes Jan 4, 2024

View reviewed changes

crcrpar merged commit 87c4deb into NVIDIA:master Jan 5, 2024

garrett4wade mentioned this pull request Feb 24, 2024

64bit indexing fused adam microsoft/DeepSpeed#5187

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

64-bit indexing Adam #1765

64-bit indexing Adam #1765

eqy commented Dec 26, 2023 •

edited

Loading

crcrpar Dec 30, 2023

64-bit indexing Adam #1765

64-bit indexing Adam #1765

Conversation

eqy commented Dec 26, 2023 • edited Loading

crcrpar Dec 30, 2023

Choose a reason for hiding this comment

eqy commented Dec 26, 2023 •

edited

Loading