🐛 [Bug] Error while loading Torch-TensorRT model (torch.jit.load) #973

pauline6 · 2022-04-12T15:20:08Z

Bug Description

The model below is converted in a Torch-TensorRT model, the sub_function module is excluded from the conversion. While loading the module with torch.jit.load, this error is raised.

Traceback (most recent call last):
    model = torch.jit.load('model_trt.ts')
  File "/usr/local/lib/python3.8/dist-packages/torch/jit/_serialization.py", line 161, in load
    cpp_module = torch._C.import_ir_module(cu, str(f), map_location, _extra_files)
RuntimeError: expected ) but found 'number' here:
Serialized   File "code/__torch__.py", line 6
  __torch___function_trt_engine_0x77ea7da0 : __torch__.torch.classes.tensorrt.Engine
  def forward(self_1: __torch__.function_trt,
    x.1: Tensor,
     ~~ <--- HERE
    kernel.1: Tensor) -> Tensor:
    __torch___function_trt_engine_0x77ea7da0 = self_1.__torch___function_trt_engine_0x77ea7da0

To Reproduce

import torch
import torch_tensorrt
import torch.nn as nn
import torch.nn.functional as F

class function(nn.Module):
    def __init__( self ):
        super(function, self).__init__()
        self.conv_kernel = nn.Sequential(
                nn.Conv2d(256, 256, 3, bias=False),
                nn.BatchNorm2d(256),
        )
        self.sub_function = sub_function()

    def forward( self, x, kernel ):
        # type: (Tensor, Tensor) -> Tensor 
        kernel = self.conv_kernel(kernel)
        x = x.view(1, 256 , x.size(2), x.size(3))
        kernel = kernel.view(256, 1, kernel.size(2), kernel.size(3))
        out = self.sub_function( x, kernel ) 
        return out
    
class sub_function(nn.Module):
    def __init__( self ):
        super(sub_function, self).__init__()

    # type: (Tensor, Tensor) -> Tensor 
    def forward( self, x, kernel ):
        out = F.conv2d(x, kernel, groups=256)
        return out


model = function()
model_script = torch.jit.script(model)
model_script.cuda().eval()

compile_settings = {
            "inputs": [
                torch_tensorrt.Input([1, 256, 29, 29], dtype=torch.float32),
                torch_tensorrt.Input([1, 256, 7, 7], dtype=torch.float32),
            ],
            "enabled_precisions": {torch.float32}
        }

model_trt = torch_tensorrt.ts.compile( 
    model_script, 
    **compile_settings, 
    require_full_compilation=False, 
    torch_executed_modules=['sub_function']
)

torch.jit.save(model_trt, 'model_trt.ts')

model = torch.jit.load('model_trt.ts')

Expected behavior

The Torch-TensorRT model should be load in model to be used.

Environment

Build information about Torch-TensorRT can be found by turning on debug messages

Torch-TensorRT Version (e.g. 1.0.0): 1.0.0
PyTorch Version (e.g. 1.10.0+cu113): 1.10.0+cu113
CPU Architecture:
OS (e.g., Linux):
How you installed PyTorch (conda, pip, libtorch, source): pip
Build command you used (if compiling from source):
Are you using local sources or building from archives:
Python version: python 3.8.10
CUDA version: 11.6
GPU models and configuration:
Any other relevant information:

The text was updated successfully, but these errors were encountered:

handoku · 2022-04-18T14:17:00Z

Any progress here? Same problem.

mjack3 · 2022-05-10T11:10:51Z

Same problem here. Have someone any news?

narendasan · 2022-05-18T21:42:38Z

@bowang007 can you take a look?

mjack3 · 2022-05-19T09:17:00Z

@handoku can you try this in your work environment?

import tensorrt
tensorrt.__version__

handoku · 2022-05-19T09:44:44Z

@mjack3 I was using docker image ngc-pytorch 22.02, the tensorrt version should be 8.2.3.0 according to this release notes

edric1261234 · 2022-05-26T22:08:41Z

Same problem

bowang007 · 2022-06-04T00:45:18Z

Took a look and I'm thinking about if the issue comes from module fallback. Are you using module fallback in your models? @edric1261234 , @handoku , @mjack3
Not so familiar with module fallback, so also need some time to take a look into it.

handoku · 2022-06-04T07:30:35Z

Took a look and I'm thinking about if the issue comes from module fallback. Are you using module fallback in your models? @edric1261234 , @handoku , @mjack3 Not so familiar with module fallback, so also need some time to take a look into it.

Yes, full_compiled TRTengine is good，only happened with some module fallback

bowang007 · 2022-06-13T19:01:44Z

Looks like @Njuapp had a similar issue #1112 , I tried his fix #1109, which could have your provided model supported.

pauline6 added the bug Something isn't working label Apr 12, 2022

narendasan added the component: core Issues re: The core compiler label May 18, 2022

narendasan assigned bowang007 May 23, 2022

bowang007 mentioned this issue Jun 13, 2022

🐛 [Bug] Parse failure in fallback cases #1112

Closed

bowang007 mentioned this issue Jun 28, 2022

fix: fix the parsing related model loading bug #1148

Merged

7 tasks

peri044 closed this as completed in #1148 Jul 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 [Bug] Error while loading Torch-TensorRT model (torch.jit.load) #973

🐛 [Bug] Error while loading Torch-TensorRT model (torch.jit.load) #973

pauline6 commented Apr 12, 2022

handoku commented Apr 18, 2022

mjack3 commented May 10, 2022

narendasan commented May 18, 2022

mjack3 commented May 19, 2022 •

edited

Loading

handoku commented May 19, 2022

edric1261234 commented May 26, 2022

bowang007 commented Jun 4, 2022

handoku commented Jun 4, 2022

bowang007 commented Jun 13, 2022

🐛 [Bug] Error while loading Torch-TensorRT model (torch.jit.load) #973

🐛 [Bug] Error while loading Torch-TensorRT model (torch.jit.load) #973

Comments

pauline6 commented Apr 12, 2022

Bug Description

To Reproduce

Expected behavior

Environment

handoku commented Apr 18, 2022

mjack3 commented May 10, 2022

narendasan commented May 18, 2022

mjack3 commented May 19, 2022 • edited Loading

handoku commented May 19, 2022

edric1261234 commented May 26, 2022

bowang007 commented Jun 4, 2022

handoku commented Jun 4, 2022

bowang007 commented Jun 13, 2022

mjack3 commented May 19, 2022 •

edited

Loading