TEST/CUDA: Test CUDA Memcpy with various contexts #10531

tvegas1 · 2025-03-04T18:40:53Z

What?

Exercise cuda copies with various combinations of parameters (device/ctx/stream/memcpy).

iyastreb · 2025-03-05T07:51:59Z

test/apps/test_cuda_ctx_memcpy.cu

+    printf("Using %d CUDA device(s)\n", device_count);
+
+    for (i = 0; i < MAX_THREADS; i++) {
+        thread[i].tid    = i;


iyastreb · 2025-03-05T07:54:26Z

test/apps/test_cuda_ctx_memcpy.cu

+
+
+/* Context and associated resources */
+struct context {


maybe use typedef for both structs

iyastreb · 2025-03-05T07:58:05Z

test/apps/test_cuda_ctx_memcpy.cu

+    void      *mem_managed;
+};
+
+struct context context[MAX_CTX * MAX_THREADS * MAX_DEV];


maybe better to have an independent context per each thread?

they share the content

iyastreb · 2025-03-05T08:02:53Z

test/apps/test_cuda_ctx_memcpy.cu

+
+        ptr_a  = context[j].mem;
+        ptr_b  = context[k].mem;
+        ptr_a_managed = context[j].mem_managed;


iyastreb · 2025-03-05T08:03:59Z

test/apps/test_cuda_ctx_memcpy.cu

+#include <unistd.h>
+
+
+#define MAX_THREADS 2


should we pass these from cmd line?

as it is scale is already huge so i will add if we need other parameters?

test/apps/test_cuda_ctx_memcpy.cu

rakhmets · 2025-03-05T15:25:50Z

test/apps/test_cuda_ctx_memcpy.cu

+    for (k = 0; k < MAX_CTX * MAX_THREADS * count; k++) {
+    /* each stream (every thread every device) */
+    for (l = 0; l < MAX_CTX * MAX_THREADS * count; l++) {
+        CHECK_D(cuCtxSetCurrent(context[i].ctx));


There is no need to set CUDA context here. Even in case of CUDA Driver API.
The context is required for stream creation using CUDA Driver API.

intent was to confirm that we do not care about current ctx being set, even if it is different from stream's ctx

The context is required only for stream creation. Memory copies can be executed even without setting any context.

removed setting the context

rakhmets · 2025-03-05T16:11:12Z

test/apps/Makefile.am

+test_cuda_ctx_memcpy_DEPBASE  = $(DEPDIR)/test_cuda_ctx_memcpy
+test_cuda_ctx_memcpy_COMPILE  = \
+    $(NVCC) $(DEFS) $(DEFAULT_INCLUDES) $(INCLUDES) \
+    $(test_cuda_ctx_memcpy_CPPFLAGS)


You can make your file .c and remove this.

rakhmets · 2025-03-05T16:17:20Z

test/apps/test_cuda_ctx_memcpy.cu

+
+        for (j = 1; j < MAX_CTX; j++) {
+            CHECK_D(cuCtxCreate(&context[index + j].ctx, 0, i));
+            CHECK_D(cuCtxSetCurrent(context[index + j].ctx));


The line can be removed since cuCtxCreate pushes the newly created context.

rakhmets · 2025-03-05T16:20:21Z

test/apps/test_cuda_ctx_memcpy.cu

+    CUcontext ctx;
+
+    for (i = 0; i < count; i++) {
+        CHECK(cudaSetDevice(i));


This call retains the primary device context, and sets it as current context.
Maybe replace by cuDeviceGet(&device, i). And pass device to cuCtxCreate.

tvegas1 requested review from rakhmets and brminich March 4, 2025 18:41

tvegas1 force-pushed the cuda_ctx_memcpy branch 2 times, most recently from 837f312 to 700702c Compare March 5, 2025 07:13

TEST/CUDA: Test CUDA Memcpy with various contexts

b1593fe

tvegas1 force-pushed the cuda_ctx_memcpy branch from 700702c to b1593fe Compare March 5, 2025 07:14

iyastreb reviewed Mar 5, 2025

View reviewed changes

TEST/CUDA: Test CUDA Memcpy with various contexts

902aa3e

rakhmets reviewed Mar 5, 2025

View reviewed changes

TEST/CUDA: Test CUDA Memcpy with various contexts

2549651

rakhmets reviewed Mar 5, 2025

View reviewed changes

TEST/CUDA: Test CUDA Memcpy with various contexts

d4bb24b

rakhmets reviewed Mar 5, 2025

View reviewed changes

tvegas1 added 4 commits March 5, 2025 16:30

TEST/CUDA: Test CUDA Memcpy with various contexts

e861f6b

TEST/CUDA: Test CUDA Memcpy with various contexts

f79e35a

TEST/CUDA: Test CUDA Memcpy with various contexts

e62f155

TEST/CUDA: Test CUDA Memcpy with various contexts

e800de8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TEST/CUDA: Test CUDA Memcpy with various contexts #10531

TEST/CUDA: Test CUDA Memcpy with various contexts #10531

tvegas1 commented Mar 4, 2025

iyastreb Mar 5, 2025

tvegas1 Mar 5, 2025

iyastreb Mar 5, 2025

tvegas1 Mar 5, 2025

iyastreb Mar 5, 2025

tvegas1 Mar 5, 2025

iyastreb Mar 5, 2025

tvegas1 Mar 5, 2025

iyastreb Mar 5, 2025

tvegas1 Mar 5, 2025

rakhmets Mar 5, 2025

tvegas1 Mar 5, 2025

rakhmets Mar 5, 2025

tvegas1 Mar 5, 2025

rakhmets Mar 5, 2025

tvegas1 Mar 5, 2025

rakhmets Mar 5, 2025

tvegas1 Mar 5, 2025

rakhmets Mar 5, 2025

tvegas1 Mar 5, 2025

		#include <unistd.h>


		#define MAX_THREADS 2

TEST/CUDA: Test CUDA Memcpy with various contexts #10531

Are you sure you want to change the base?

TEST/CUDA: Test CUDA Memcpy with various contexts #10531

Conversation

tvegas1 commented Mar 4, 2025

What?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment