Skip to content

Commit

Permalink
[fix] change conv default algo and set with corresponding correct index
Browse files Browse the repository at this point in the history
  • Loading branch information
IctHu committed Oct 30, 2024
1 parent 76ed936 commit e92bca2
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions src/ppl/nn/engines/cuda/optimizer/algos/algo_conv_hmma.cc
Original file line number Diff line number Diff line change
Expand Up @@ -80,8 +80,8 @@ double TuringHMMAImpgemm::ExcuteTimer(const ir::Node* node, OptKernelOptions& op
return 0.0f;
} else { // Give the default kernel
#ifdef PPLNN_CUDA_ENABLE_KERNEL_CUT
attr_param_.extra_param.algo_info.algo_name = "nvSwzlSm75Fp16Conv_hmma1688_nhwc_fn_b32x256_w32x64_k8_buf2";
attr_param_.extra_param.algo_info.kid = 685;
attr_param_.extra_param.algo_info.algo_name = "nv2spkSm75Fp16Conv_hmma1688_nhwc_fn_b64x128_w64x32_k8_s8_buf1";
attr_param_.extra_param.algo_info.kid = 210;
#else
attr_param_.extra_param.algo_info.algo_name = "nvSwzlSm75Fp16Conv_hmma1688_nhwc_fn_b128x64_w64x32_k64_buf2";
attr_param_.extra_param.algo_info.kid = 5197;
Expand Down

0 comments on commit e92bca2

Please sign in to comment.