[topi][CuDNN] Removed requirement for GPU from topi conv2d_cudnn.cuda and conv3d_cudnn.cuda #8276

Lunderberg · 2021-06-17T18:09:18Z

Previously, conv2d_cudnn.cuda would use cudnn's benchmarking function to select a forward convolution when cfg.is_fallback, and conv3d_cudnn.cuda would use cudnn's benchmarking at all times. After this commit, both expose the cudnn algorithm choice as an option. If cfg.is_fallback, the local device will be benchmarked if present, otherwise will select a default cudnn implementation.

In the future, to better support RPC use-cases, the fallback config should be based on cudnn-specific parameters saved in the Target object.

Lunderberg · 2021-06-17T18:11:51Z

Related PR #8275 is for the same goal of allowing CuDNN modules to be build on a local non-GPU machine for use on a remote GPU machine. The two implementations are independent, and are separate PRs for reviewing purposes.

Potential reviewer: @mdw-octoml

… and conv3d_cudnn.cuda Previously, `conv2d_cudnn.cuda` would use cudnn's benchmarking function to select a forward convolution when `cfg.is_fallback`, and `conv3d_cudnn.cuda` would use cudnn's benchmarking at all times. After this commit, both expose the cudnn algorithm choice as an option. If `cfg.is_fallback`, the local device will be benchmarked if present, otherwise will select a default cudnn implementation. In the future, to better support RPC use-cases, the fallback config should be based on cudnn-specific parameters saved in the Target object.

jwfromm

Really nice change, thanks Eric!

… and conv3d_cudnn.cuda (apache#8276) Previously, `conv2d_cudnn.cuda` would use cudnn's benchmarking function to select a forward convolution when `cfg.is_fallback`, and `conv3d_cudnn.cuda` would use cudnn's benchmarking at all times. After this commit, both expose the cudnn algorithm choice as an option. If `cfg.is_fallback`, the local device will be benchmarked if present, otherwise will select a default cudnn implementation. In the future, to better support RPC use-cases, the fallback config should be based on cudnn-specific parameters saved in the Target object. Co-authored-by: Eric Lunderberg <elunderberg@octoml.ai>

Lunderberg mentioned this pull request Jun 17, 2021

[CuDNN] Remove GPU dependency from tvm.contrib.cudnn.conv_output_shape #8275

Merged

Lunderberg force-pushed the cudnn_conv_find_algo branch from 0cdeef6 to f7fa507 Compare June 17, 2021 18:13

jwfromm approved these changes Jun 17, 2021

View reviewed changes

masahi merged commit bf3f000 into apache:main Jun 18, 2021

Lunderberg deleted the cudnn_conv_find_algo branch June 18, 2021 12:28

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[topi][CuDNN] Removed requirement for GPU from topi conv2d_cudnn.cuda and conv3d_cudnn.cuda #8276

[topi][CuDNN] Removed requirement for GPU from topi conv2d_cudnn.cuda and conv3d_cudnn.cuda #8276

Lunderberg commented Jun 17, 2021

Lunderberg commented Jun 17, 2021

jwfromm left a comment

[topi][CuDNN] Removed requirement for GPU from topi conv2d_cudnn.cuda and conv3d_cudnn.cuda #8276

[topi][CuDNN] Removed requirement for GPU from topi conv2d_cudnn.cuda and conv3d_cudnn.cuda #8276

Conversation

Lunderberg commented Jun 17, 2021

Lunderberg commented Jun 17, 2021

jwfromm left a comment

Choose a reason for hiding this comment