[NVIDIA] Set proper math type for convolution #702

apavliuk55 · 2023-08-08T13:19:39Z

Details:

Sets proper convolution math type according to the math type of the selected algo

nkogteva · 2023-08-08T13:38:53Z

And what about hardcoded value CUDNN_TENSOR_OP_MATH_ALLOW_CONVERSION, which you try to change previosly? Should we avoid convertion if user specify inference precision == f32?

apavliuk55 · 2023-08-08T14:27:14Z

And what about hardcoded value CUDNN_TENSOR_OP_MATH_ALLOW_CONVERSION, which you try to change previosly? Should we avoid convertion if user specify inference precision == f32?

We tried CUDNN_TENSOR_OP_MATH value but it didn't help to solve the precision issue for some tests with FP32 precision.
So we left the previous value, it won't affect FP16 precision.
For FP32 precision, I think, we should leave it as is because there were no problems with FP32 accuracy with this setting.

nkogteva · 2023-08-09T05:50:42Z

And what about hardcoded value CUDNN_TENSOR_OP_MATH_ALLOW_CONVERSION, which you try to change previosly? Should we avoid convertion if user specify inference precision == f32?

We tried CUDNN_TENSOR_OP_MATH value but it didn't help to solve the precision issue for some tests with FP32 precision. So we left the previous value, it won't affect FP16 precision. For FP32 precision, I think, we should leave it as is because there were no problems with FP32 accuracy with this setting.

It's not just about tests. If the user has explicitly specified an inference precision f32, we should avoid converting to lower precision, is not it?

apavliuk55 · 2023-08-09T12:59:20Z

And what about hardcoded value CUDNN_TENSOR_OP_MATH_ALLOW_CONVERSION, which you try to change previosly? Should we avoid convertion if user specify inference precision == f32?

We tried CUDNN_TENSOR_OP_MATH value but it didn't help to solve the precision issue for some tests with FP32 precision. So we left the previous value, it won't affect FP16 precision. For FP32 precision, I think, we should leave it as is because there were no problems with FP32 accuracy with this setting.

It's not just about tests. If the user has explicitly specified an inference precision f32, we should avoid converting to lower precision, is not it?

Changed to CUDNN_TENSOR_OP_MATH

apavliuk55 requested a review from a team as a code owner August 8, 2023 13:19

github-actions bot added the category: NVIDIA plugin OpenVINO NVIDIA plugin label Aug 8, 2023

apavliuk55 mentioned this pull request Aug 8, 2023

[NVIDIA] Convolution Test Fixes Using Average Threshold #677

Merged

apavliuk55 changed the title ~~[NVIDIA] Set proper math type for convloution~~ [NVIDIA] Set proper math type for convoloution Aug 8, 2023

apavliuk55 changed the title ~~[NVIDIA] Set proper math type for convoloution~~ [NVIDIA] Set proper math type for convolution Aug 8, 2023

[NVIDIA] Set proper math type for convloution

8d74365

apavliuk55 force-pushed the fix/proper-convolution-math-type branch from 6a5af66 to 8d74365 Compare August 9, 2023 12:57

nkogteva approved these changes Aug 10, 2023

View reviewed changes

nkogteva merged commit 274c876 into openvinotoolkit:master Aug 10, 2023
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NVIDIA] Set proper math type for convolution #702

[NVIDIA] Set proper math type for convolution #702

apavliuk55 commented Aug 8, 2023

nkogteva commented Aug 8, 2023

apavliuk55 commented Aug 8, 2023

nkogteva commented Aug 9, 2023

apavliuk55 commented Aug 9, 2023

[NVIDIA] Set proper math type for convolution #702

[NVIDIA] Set proper math type for convolution #702

Conversation

apavliuk55 commented Aug 8, 2023

Details:

nkogteva commented Aug 8, 2023

apavliuk55 commented Aug 8, 2023

nkogteva commented Aug 9, 2023

apavliuk55 commented Aug 9, 2023