TRTorch v0.3.0 #466
narendasan
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
TRTorch v0.3.0
Support for PyTorch 1.8.x (by default 1.8.1), Introducing Plugin Library, PTQ from Python, Arbitrary TRT engine embedding, Preview Release of Partial Compilation, New Converters, Bug Fixes
This is the third beta release of TRTorch, targeting PyTorch 1.8.x, CUDA 11.1 (on x86_64), TensorRT 7.2, cuDNN 8. TRTorch 0.3.0 binary releases target PyTorch 1.8.1 specifically, these builds are not compatible with 1.8.0, though the source code remains compatible with any PyTorch 1.8.x version. On aarch64 TRTorch targets JetPack 4.5.x. This release introduces
libtrtorch_plugins.so
. This library is a portable distribution of all TensorRT plugins used in TRTorch. The intended usecase is to support TRTorch programs that utilize TensorRT plugins deployed on systems with only the runtime library available or in the case that TRTorch was used to create a TensorRT engine to be run outside the TRTorch runtime, which makes uses of TRTorch plugins. An example on how to use this library can be found here: https://github.com/NVIDIA/TRTorch/tree/v0.3.0/notebooks/examples/sample_rt_app. TRTorch 0.3.0 also now allows users to repurpose PyTorch Dataloaders to do post training quantization in Python similar to the workflow supported in C++ currently. It also introduces a new API to wrap arbitrary TensorRT engines in a PyTorch Module wrapper, making the serializable bytorch.jit.save
and completely compatible with other PyTorch modules. Finally, TRTorch 0.3.0 also includes a preview of the new partial compilation capability of the TRTorch compiler. With this feature, users can now instruct TRTorch to keep operations that are not supported but TRTorch/TensorRT in PyTorch. Partial compilation should be considered alpha stability and we are seeking feedback on bugs, pain points and feature requests surrounding using this feature.Dependencies:
0.3.0 (2021-05-13)
Bug Fixes
//plugins: Readding cuBLAS BUILD to allow linking of libnvinfer_plugin on Jetson (a8008f4)
//tests/../concat: Concat test fix (2432fb8)
//tests/core/partitioning: Fixing some issues with the partition (ff89059)
erase the repetitive nodes in dependency analysis (80b1038)
fix a typo for debug (c823ebd)
fix typo bug (e491bb5)
aten::linear: Fixes new issues in 1.8 that cause script based (c5057f8)
register the torch_fallback attribute in Python API (8b7919f)
support expand/repeat with IValue type input (a4882c6)
support shape inference for add_, support non-tensor arguments for segmented graphs (46950bb)
feat!: Updating versions of CUDA, cuDNN, TensorRT and PyTorch (71c4dcb)
feat(WORKSPACE)!: Updating PyTorch version to 1.8.1 (c9aa99a)
Features
BREAKING CHANGES
Default CUDA version is CUDA 11.1
TensorRT version is TensorRT 7.2.3.4
cuDNN version is now cuDNN 8.1
Signed-off-by: Naren Dasan naren@narendasan.com
Signed-off-by: Naren Dasan narens@nvidia.com
and 1.8.1 in the Torch Python API, TRTorch 0.3.0 compiled for 1.8.0 does not
work with PyTorch 1.8.1 and will show an error about use_input_stats.
If you see this error make sure the version of libtorch you are
compiling with is PyTorch 1.8.1
TRTorch 0.3.0 will target PyTorch 1.8.1. There is no backwards
compatability with 1.8.0. If you need this specific version compile from
source with the dependencies in WORKSPACE changed
Signed-off-by: Naren Dasan naren@narendasan.com
Signed-off-by: Naren Dasan narens@nvidia.com
Supported Operators in TRTorch v0.3.0
Operators Currently Supported Through Converters
Operators Currently Supported Through Evaluators
This discussion was created from the release TRTorch v0.3.0.
Beta Was this translation helpful? Give feedback.
All reactions