[TENSORRT] Improvements and fixes for TensorRT #11203

mbaret · 2022-05-03T19:05:58Z

A number of small fixes and refactors to improve the robustness of the TensorRT integration.

Co-authored-by: Mark Shields mbs@octoml.ai

A number of small fixes and refactors to improve the robustness of the TensorRT integration. Co-authored-by: Mark Shields <mbs@octoml.ai>

mbaret · 2022-05-04T13:40:28Z

cc @mbs-octoml @mikepapadim

mbs-octoml · 2022-05-04T14:09:29Z

The background on the bits I fiddled with:

Finish the transition from operator predicate based to pattern based for the partition_for_tensorrt function. The current state in main is broken.
Remove choice of operator vs predicate method since they are indistinguishable from the outside.
Though most operators can be translated looking only at the operator call, nn.batch_norm can only be translated in the form of a sub-graph nn.batch_norm(...).0. So switch the translation to follow the standard composite style.

mbs-octoml

LGTM, thank you!

I wonder if we should warn for the conv3d attributes you found trigger numerical inaccuracy? Say in the predicate?

mbaret · 2022-05-04T14:16:55Z

Unfortunately, the attributes aren't particularly well-defined. For example, in that specific test it was failing with channels between 28 and 45. I think so long as TensorRT is selecting strategies with different numerical behaviour based on tuning we won't be able to give a sane warning here.

mbs-octoml · 2022-05-04T18:10:42Z

Ack. We do what we can.

mikepapadim · 2022-05-05T08:51:37Z

LGTM, thanks

jwfromm

Thanks for these improvements @mbaret, @mikepapadim, and @mbs-octoml!

A number of small fixes and refactors to improve the robustness of the TensorRT integration. Co-authored-by: Mark Shields <mbs@octoml.ai> Co-authored-by: Mark Shields <mbs@octoml.ai>

tiandiao123 · 2022-06-09T21:01:24Z

nice job!

[TENSORRT] Improvements and fixes for TensorRT

1c05fc8

A number of small fixes and refactors to improve the robustness of the TensorRT integration. Co-authored-by: Mark Shields <mbs@octoml.ai>

mbs-octoml approved these changes May 4, 2022

View reviewed changes

mikepapadim approved these changes May 5, 2022

View reviewed changes

jwfromm approved these changes May 10, 2022

View reviewed changes

jwfromm merged commit be2ae94 into apache:main May 10, 2022

mbs-octoml mentioned this pull request Jun 27, 2022

[BYOC] InlineCompilerFunctions helper pass #11923

Merged

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Lunderberg mentioned this pull request Nov 16, 2022

[Tracking][Contrib] Known failing unit tests #8901

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TENSORRT] Improvements and fixes for TensorRT #11203

[TENSORRT] Improvements and fixes for TensorRT #11203

mbaret commented May 3, 2022

mbaret commented May 4, 2022

mbs-octoml commented May 4, 2022

mbs-octoml left a comment

mbaret commented May 4, 2022

mbs-octoml commented May 4, 2022

mikepapadim commented May 5, 2022

jwfromm left a comment

tiandiao123 commented Jun 9, 2022

[TENSORRT] Improvements and fixes for TensorRT #11203

[TENSORRT] Improvements and fixes for TensorRT #11203

Conversation

mbaret commented May 3, 2022

mbaret commented May 4, 2022

mbs-octoml commented May 4, 2022

mbs-octoml left a comment

Choose a reason for hiding this comment

mbaret commented May 4, 2022

mbs-octoml commented May 4, 2022

mikepapadim commented May 5, 2022

jwfromm left a comment

Choose a reason for hiding this comment

tiandiao123 commented Jun 9, 2022