Cache CK convolution kernel objects as shared_ptr in Invoker lambda #2336

amberhassaan · 2023-08-24T16:27:28Z

CK has a DeviceOpFactory::GetInstances() API used by MIOpen Solvers that returns a vector<unique_ptr<T>, where T is an abstraction for a kernel. unique_ptr<T> makes it uncacheable inside MIOpen's Invoker lambda thus preventing reuse and forcing multiple calls to GetInstances() especially at the time of invoking a kernel.

I will put up a PR that promotes unique_ptr<T> to shared_ptr<T> thus allowing us to cacheable inside MIOpen's Invoker.

The text was updated successfully, but these errors were encountered:

atamazov · 2023-08-25T16:45:59Z

[Notice] IIUC this ticket is related to all CK solvers where this optimization is possible (see #2305 (comment))

amberhassaan self-assigned this Aug 24, 2023

amberhassaan added the performance label Aug 24, 2023

amberhassaan mentioned this issue Aug 25, 2023

Post-merge fixups: Replace environment variable check with problem config check and reduce lambda capture for Invoker obj #2305

Merged

atamazov mentioned this issue Aug 25, 2023

Post-merge issues of #2125 #2301

Closed

2 tasks

CAHEK7 mentioned this issue Sep 6, 2023

Ck kernels invocation refactoring #2379

Merged

CAHEK7 linked a pull request Sep 8, 2023 that will close this issue

Ck kernels invocation refactoring #2379

Merged

junliume closed this as completed in #2379 Sep 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache CK convolution kernel objects as shared_ptr in Invoker lambda #2336

Cache CK convolution kernel objects as shared_ptr in Invoker lambda #2336

amberhassaan commented Aug 24, 2023

atamazov commented Aug 25, 2023 •

edited

Loading

Cache CK convolution kernel objects as shared_ptr in Invoker lambda #2336

Cache CK convolution kernel objects as shared_ptr in Invoker lambda #2336

Comments

amberhassaan commented Aug 24, 2023

atamazov commented Aug 25, 2023 • edited Loading

atamazov commented Aug 25, 2023 •

edited

Loading