Apply device context when converting to torch tensors #135

edknv · 2023-04-17T03:59:14Z

Similar to #132 but this PR applies the torch.cuda.device context only when converting numpy/cupy arrays to torch tensors. Unlike #132, it doesn't use cupy.cuda.Device, which did not work with tensorflow as discovered in Models multi-gpu tests.

Note: In addition to the fix in this PR, users have to set cupy.cuda.Device() manually. See here for details. Follow-up: Enable setting device in Core/Dataset. Add 2GPU unit tests in dataloader.

Apply device context when converting to torch tensors

6f02abd

edknv force-pushed the multi_gpu_cupy_device branch from 147655a to 6f02abd Compare April 17, 2023 04:09

This was referenced Apr 17, 2023

Update multi-gpu notebook to set cupy device NVIDIA-Merlin/Transformers4Rec#675

Merged

Device assignment does not work in PyTorch #131

Open

edknv requested review from karlhigley, oliverholworthy and jperez999 April 17, 2023 16:11

edknv self-assigned this Apr 17, 2023

edknv added the bug Something isn't working label Apr 17, 2023

edknv added this to the Merlin 23.04 milestone Apr 17, 2023

edknv marked this pull request as ready for review April 17, 2023 16:12

karlhigley approved these changes Apr 17, 2023

View reviewed changes

edknv merged commit d83285e into NVIDIA-Merlin:main Apr 17, 2023

edknv deleted the multi_gpu_cupy_device branch April 17, 2023 19:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply device context when converting to torch tensors #135

Apply device context when converting to torch tensors #135

edknv commented Apr 17, 2023 •

edited

Loading

Apply device context when converting to torch tensors #135

Apply device context when converting to torch tensors #135

Conversation

edknv commented Apr 17, 2023 • edited Loading

edknv commented Apr 17, 2023 •

edited

Loading