Cannot find op_type: "LayerNormalization" when convert the onnx model, using TensorRT 8.6 #2875

dongjinxin123 · 2023-04-14T09:05:19Z

Description

root@50203672e3df:/workspace/onnx# LD_PRELOAD="/workspace/out/libnvinfer_plugin.so" /usr/src/tensorrt/bin/trtexec --onnx=unet.opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:877x768,controlnet_cond:4x3x1024x1024
&&&& RUNNING TensorRT.trtexec [TensorRT v8503] # /usr/src/tensorrt/bin/trtexec --onnx=unet.opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:877x768,controlnet_cond:4x3x1024x1024
[04/14/2023-08:56:14] [I] === Model Options ===
[04/14/2023-08:56:14] [I] Format: ONNX
[04/14/2023-08:56:14] [I] Model: unet.opt.onnx
[04/14/2023-08:56:14] [I] Output:
[04/14/2023-08:56:14] [I] === Build Options ===
[04/14/2023-08:56:14] [I] Max batch: explicit batch
[04/14/2023-08:56:14] [I] Memory Pools: workspace: default, dlaSRAM: default, dlaLocalDRAM: default, dlaGlobalDRAM: default
[04/14/2023-08:56:14] [I] minTiming: 1
[04/14/2023-08:56:14] [I] avgTiming: 8
[04/14/2023-08:56:14] [I] Precision: FP32
[04/14/2023-08:56:14] [I] LayerPrecisions:
[04/14/2023-08:56:14] [I] Calibration:
[04/14/2023-08:56:14] [I] Refit: Disabled
[04/14/2023-08:56:14] [I] Sparsity: Disabled
[04/14/2023-08:56:14] [I] Safe mode: Disabled
[04/14/2023-08:56:14] [I] DirectIO mode: Disabled
[04/14/2023-08:56:14] [I] Restricted mode: Disabled
[04/14/2023-08:56:14] [I] Build only: Disabled
[04/14/2023-08:56:14] [I] Save engine: unet.opt.plan
[04/14/2023-08:56:14] [I] Load engine:
[04/14/2023-08:56:14] [I] Profiling verbosity: 0
[04/14/2023-08:56:14] [I] Tactic sources: Using default tactic sources
[04/14/2023-08:56:14] [I] timingCacheMode: local
[04/14/2023-08:56:14] [I] timingCacheFile:
[04/14/2023-08:56:14] [I] Heuristic: Disabled
[04/14/2023-08:56:14] [I] Preview Features: Use default preview flags.
[04/14/2023-08:56:14] [I] Input(s)s format: fp32:CHW
[04/14/2023-08:56:14] [I] Output(s)s format: fp32:CHW
[04/14/2023-08:56:14] [I] Input build shape: sample=2x4x32x32+4x4x64x64+8x4x128x128
[04/14/2023-08:56:14] [I] Input build shape: encoder_hidden_states=2x77x768+4x77x768+8x768
[04/14/2023-08:56:14] [I] Input build shape: controlnet_cond=2x3x256x256+4x3x512x512+4x3x1024x1024
[04/14/2023-08:56:14] [I] Input calibration shapes: model
[04/14/2023-08:56:14] [I] === System Options ===
[04/14/2023-08:56:14] [I] Device: 0
[04/14/2023-08:56:14] [I] DLACore:
[04/14/2023-08:56:14] [I] Plugins:
[04/14/2023-08:56:14] [I] === Inference Options ===
[04/14/2023-08:56:14] [I] Batch: Explicit
[04/14/2023-08:56:14] [I] Input inference shape: controlnet_cond=4x3x512x512
[04/14/2023-08:56:14] [I] Input inference shape: encoder_hidden_states=4x77x768
[04/14/2023-08:56:14] [I] Input inference shape: sample=4x4x64x64
[04/14/2023-08:56:14] [I] Iterations: 10
[04/14/2023-08:56:14] [I] Duration: 3s (+ 200ms warm up)
[04/14/2023-08:56:14] [I] Sleep time: 0ms
[04/14/2023-08:56:14] [I] Idle time: 0ms
[04/14/2023-08:56:14] [I] Streams: 1
[04/14/2023-08:56:14] [I] ExposeDMA: Disabled
[04/14/2023-08:56:14] [I] Data transfers: Enabled
[04/14/2023-08:56:14] [I] Spin-wait: Disabled
[04/14/2023-08:56:14] [I] Multithreading: Disabled
[04/14/2023-08:56:14] [I] CUDA Graph: Disabled
[04/14/2023-08:56:14] [I] Separate profiling: Disabled
[04/14/2023-08:56:14] [I] Time Deserialize: Disabled
[04/14/2023-08:56:14] [I] Time Refit: Disabled
[04/14/2023-08:56:14] [I] NVTX verbosity: 0
[04/14/2023-08:56:14] [I] Persistent Cache Ratio: 0
[04/14/2023-08:56:14] [I] Inputs:
[04/14/2023-08:56:14] [I] === Reporting Options ===
[04/14/2023-08:56:14] [I] Verbose: Disabled
[04/14/2023-08:56:14] [I] Averages: 10 inferences
[04/14/2023-08:56:14] [I] Percentiles: 90,95,99
[04/14/2023-08:56:14] [I] Dump refittable layers:Disabled
[04/14/2023-08:56:14] [I] Dump output: Disabled
[04/14/2023-08:56:14] [I] Profile: Disabled
[04/14/2023-08:56:14] [I] Export timing to JSON file:
[04/14/2023-08:56:14] [I] Export output to JSON file:
[04/14/2023-08:56:14] [I] Export profile to JSON file:
[04/14/2023-08:56:14] [I]
[04/14/2023-08:56:14] [I] === Device Information ===
[04/14/2023-08:56:14] [I] Selected Device: Tesla T4
[04/14/2023-08:56:14] [I] Compute Capability: 7.5
[04/14/2023-08:56:14] [I] SMs: 40
[04/14/2023-08:56:14] [I] Compute Clock Rate: 1.59 GHz
[04/14/2023-08:56:14] [I] Device Global Memory: 15109 MiB
[04/14/2023-08:56:14] [I] Shared Memory per SM: 64 KiB
[04/14/2023-08:56:14] [I] Memory Bus Width: 256 bits (ECC enabled)
[04/14/2023-08:56:14] [I] Memory Clock Rate: 5.001 GHz
[04/14/2023-08:56:14] [I]
[04/14/2023-08:56:14] [I] TensorRT version: 8.5.3
[04/14/2023-08:56:14] [I] [TRT] [MemUsageChange] Init CUDA: CPU +12, GPU +0, now: CPU 28, GPU 103 (MiB)
[04/14/2023-08:56:16] [I] [TRT] [MemUsageChange] Init builder kernel library: CPU +265, GPU +76, now: CPU 347, GPU 179 (MiB)
[04/14/2023-08:56:16] [I] Start parsing network model
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:604] Reading dangerously large protocol message. If the message turns out to be larger than 2147483647 bytes, parsing will be halted for security reasons. To increase the limit (or to disable these warnings), see CodedInputStream::SetTotalBytesLimit() in google/protobuf/io/coded_stream.h.
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:81] The total number of bytes read was 723472052
[04/14/2023-08:56:16] [I] [TRT] ----------------------------------------------------------------
[04/14/2023-08:56:16] [I] [TRT] Input filename: unet.opt.onnx
[04/14/2023-08:56:16] [I] [TRT] ONNX IR version: 0.0.8
[04/14/2023-08:56:16] [I] [TRT] Opset version: 17
[04/14/2023-08:56:16] [I] [TRT] Producer name: pytorch
[04/14/2023-08:56:16] [I] [TRT] Producer version: 1.14.0
[04/14/2023-08:56:16] [I] [TRT] Domain:
[04/14/2023-08:56:16] [I] [TRT] Model version: 0
[04/14/2023-08:56:16] [I] [TRT] Doc string:
[04/14/2023-08:56:16] [I] [TRT] ----------------------------------------------------------------
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:604] Reading dangerously large protocol message. If the message turns out to be larger than 2147483647 bytes, parsing will be halted for security reasons. To increase the limit (or to disable these warnings), see CodedInputStream::SetTotalBytesLimit() in google/protobuf/io/coded_stream.h.
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:81] The total number of bytes read was 723472052
[04/14/2023-08:56:17] [W] [TRT] onnx2trt_utils.cpp:377: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[04/14/2023-08:56:17] [W] [TRT] onnx2trt_utils.cpp:403: One or more weights outside the range of INT32 was clamped
[04/14/2023-08:56:17] [I] [TRT] No importer registered for op: LayerNormalization. Attempting to import as plugin.
[04/14/2023-08:56:17] [I] [TRT] Searching for plugin: LayerNormalization, plugin_version: 1, plugin_namespace:
[04/14/2023-08:56:17] [E] [TRT] ModelImporter.cpp:726: While parsing node number 293 [LayerNormalization -> "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"]:
[04/14/2023-08:56:17] [E] [TRT] ModelImporter.cpp:727: --- Begin node ---
[04/14/2023-08:56:17] [E] [TRT] ModelImporter.cpp:728: input: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/Cast_output_0"
input: "onnx::LayerNormalization_4060"
input: "onnx::LayerNormalization_4059"
output: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"
name: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization"
op_type: "LayerNormalization"
attribute {
name: "axis"
i: -1
type: INT
}
attribute {
name: "epsilon"
f: 1e-05
type: FLOAT
}

[04/14/2023-08:56:17] [E] [TRT] ModelImporter.cpp:729: --- End node ---
[04/14/2023-08:56:17] [E] [TRT] ModelImporter.cpp:732: ERROR: builtin_op_importers.cpp:5428 In function importFallbackPluginImporter:
[8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?"
[04/14/2023-08:56:17] [E] Failed to parse onnx file
[04/14/2023-08:56:17] [I] Finish parsing network model
[04/14/2023-08:56:17] [E] Parsing model failed
[04/14/2023-08:56:17] [E] Failed to create engine from model or file.
[04/14/2023-08:56:17] [E] Engine set up failed
&&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/tensorrt/bin/trtexec --onnx=unet.opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

Environment

TensorRT Version: 8.6
NVIDIA GPU: T4
NVIDIA Driver Version: 450.
CUDA Version: cuda-12.0
CUDNN Version:
Operating System:
Python Version (if applicable):
Tensorflow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if so, version): nvcr.io/nvidia/pytorch:23.02-py3

Relevant Files

This is the mode path
https://drive.google.com/file/d/1b-7wg4IkErgQg8AAtRPgJjtMpqNjoWJY/view?usp=share_link

Steps To Reproduce

run in docker image nvcr.io/nvidia/pytorch:23.02-py3
compile tensorRT plugin and put into path /workspace/out/libnvinfer_plugin.so

LD_PRELOAD="/workspace/out/libnvinfer_plugin.so" /usr/src/tensorrt/bin/trtexec --onnx=unet.opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

zerollzeng · 2023-04-15T00:06:54Z

the model downloaded from the above link is controlnet_opt.onnx. would you please double check?

dongjinxin123 · 2023-04-15T00:08:10Z

the model downloaded from the above link is controlnet_opt.onnx. would you please double check?

Yes, I this is the right model, I rename it to unet.opt.onnx

dongjinxin123 · 2023-04-15T00:09:01Z

the model downloaded from the above link is controlnet_opt.onnx. would you please double check?
LD_PRELOAD="/workspace/out/libnvinfer_plugin.so" /usr/src/tensorrt/bin/trtexec --onnx=controlnet_opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

dongjinxin123 · 2023-04-15T00:13:43Z

And I just notice that, if I do not use dynamic image shape, the conversion will success.
The same thing happened when I convert stable diffusion UNET, when I add flag --build-dynamic-shape
python3 demo_txt2img.py "a beautiful photograph of Mt. Fuji during cherry blossom" --hf-token=$HF_TOKEN -v --version 1.5 --build-dynamic-shape
I got the same error.
Seems like the layerNormPlugin doesn't support dynamic image shape?

zerollzeng · 2023-04-15T02:12:25Z

I feel like you dynamic shape is invalid

[04/15/2023-02:11:07] [I] Finished parsing network model. Parse time: 7.07821
[04/15/2023-02:11:07] [E] Required optimization profile is invalid
[04/15/2023-02:11:07] [E] Network And Config setup failed
[04/15/2023-02:11:07] [E] Building engine failed
[04/15/2023-02:11:07] [E] Failed to create engine from model or file.
[04/15/2023-02:11:07] [E] Engine set up failed
&&&& FAILED TensorRT.trtexec [TensorRT v8601] # trtexec --onnx=unet.opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,timestep:1,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,timestep:1,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,timestep:1,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024 --verbose

zerollzeng · 2023-04-15T02:13:01Z

And I didn't see the parsing error.

dongjinxin123 · 2023-04-15T06:32:12Z

/usr/src/tensorrt/bin/trtexec --onnx=controlnet_opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

https://drive.google.com/file/d/1I_l0eOIf_Y4aItCWeJDUpOOqJeKf8zez/view?usp=share_link
Please check this model, and the error is still this

[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:726: While parsing node number 293 [LayerNormalization -> "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"]:
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:727: --- Begin node ---
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:728: input: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/Cast_output_0"
input: "onnx::LayerNormalization_4060"
input: "onnx::LayerNormalization_4059"
output: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"
name: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization"
op_type: "LayerNormalization"
attribute {
name: "axis"
i: -1
type: INT
}
attribute {
name: "epsilon"
f: 1e-05
type: FLOAT
}

[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:729: --- End node ---
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:732: ERROR: builtin_op_importers.cpp:5428 In function importFallbackPluginImporter:
[8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?"
[04/15/2023-06:06:42] [E] Failed to parse onnx file
[04/15/2023-06:06:42] [I] Finish parsing network model
[04/15/2023-06:06:42] [E] Parsing model failed
[04/15/2023-06:06:42] [E] Failed to create engine from model or file.
[04/15/2023-06:06:42] [E] Engine set up failed
&&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/

zerollzeng · 2023-04-16T14:52:31Z

TensorRT Version: 8.6

You said the version is 8.6 but &&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/ the log shows you are using 8.5.3.

dongjinxin123 · 2023-04-18T02:12:05Z

TensorRT Version: 8.6

You said the version is 8.6 but &&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/ the log shows you are using 8.5.3.

Hi this is the whole log of TensorRT 8.6, I use TensorRT 8.6

&&& RUNNING TensorRT.trtexec [TensorRT v8600] # /workspace/out/trtexec --onnx=unet.opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024
[04/18/2023-02:10:27] [I] === Model Options ===
[04/18/2023-02:10:27] [I] Format: ONNX
[04/18/2023-02:10:27] [I] Model: unet.opt.onnx
[04/18/2023-02:10:27] [I] Output:
[04/18/2023-02:10:27] [I] === Build Options ===
[04/18/2023-02:10:27] [I] Max batch: explicit batch
[04/18/2023-02:10:27] [I] Memory Pools: workspace: default, dlaSRAM: default, dlaLocalDRAM: default, dlaGlobalDRAM: default
[04/18/2023-02:10:27] [I] minTiming: 1
[04/18/2023-02:10:27] [I] avgTiming: 8
[04/18/2023-02:10:27] [I] Precision: FP32
[04/18/2023-02:10:27] [I] LayerPrecisions:
[04/18/2023-02:10:27] [I] Layer Device Types:
[04/18/2023-02:10:27] [I] Calibration:
[04/18/2023-02:10:27] [I] Refit: Disabled
[04/18/2023-02:10:27] [I] Version Compatible: Disabled
[04/18/2023-02:10:27] [I] TensorRT runtime: full
[04/18/2023-02:10:27] [I] Lean DLL Path:
[04/18/2023-02:10:27] [I] Tempfile Controls: { in_memory: allow, temporary: allow }
[04/18/2023-02:10:27] [I] Exclude Lean Runtime: Disabled
[04/18/2023-02:10:27] [I] Sparsity: Disabled
[04/18/2023-02:10:27] [I] Safe mode: Disabled
[04/18/2023-02:10:27] [I] DirectIO mode: Disabled
[04/18/2023-02:10:27] [I] Restricted mode: Disabled
[04/18/2023-02:10:27] [I] Skip inference: Disabled
[04/18/2023-02:10:27] [I] Save engine: unet.opt.plan
[04/18/2023-02:10:27] [I] Load engine:
[04/18/2023-02:10:27] [I] Profiling verbosity: 0
[04/18/2023-02:10:27] [I] Tactic sources: Using default tactic sources
[04/18/2023-02:10:27] [I] timingCacheMode: local
[04/18/2023-02:10:27] [I] timingCacheFile:
[04/18/2023-02:10:27] [I] Heuristic: Disabled
[04/18/2023-02:10:27] [I] Preview Features: Use default preview flags.
[04/18/2023-02:10:27] [I] MaxAuxStreams: -1
[04/18/2023-02:10:27] [I] BuilderOptimizationLevel: 3
[04/18/2023-02:10:27] [I] Input(s)s format: fp32:CHW
[04/18/2023-02:10:27] [I] Output(s)s format: fp32:CHW
[04/18/2023-02:10:27] [I] Input build shape: sample=2x4x32x32+4x4x64x64+8x4x128x128
[04/18/2023-02:10:27] [I] Input build shape: encoder_hidden_states=2x77x768+4x77x768+8x768
[04/18/2023-02:10:27] [I] Input build shape: controlnet_cond=2x3x256x256+4x3x512x512+4x3x1024x1024
[04/18/2023-02:10:27] [I] Input calibration shapes: model
[04/18/2023-02:10:27] [I] === System Options ===
[04/18/2023-02:10:27] [I] Device: 0
[04/18/2023-02:10:27] [I] DLACore:
[04/18/2023-02:10:27] [I] Plugins:
[04/18/2023-02:10:27] [I] setPluginsToSerialize:
[04/18/2023-02:10:27] [I] dynamicPlugins:
[04/18/2023-02:10:27] [I] ignoreParsedPluginLibs: 0
[04/18/2023-02:10:27] [I]
[04/18/2023-02:10:27] [I] === Inference Options ===
[04/18/2023-02:10:27] [I] Batch: Explicit
[04/18/2023-02:10:27] [I] Input inference shape: controlnet_cond=4x3x512x512
[04/18/2023-02:10:27] [I] Input inference shape: encoder_hidden_states=4x77x768
[04/18/2023-02:10:27] [I] Input inference shape: sample=4x4x64x64
[04/18/2023-02:10:27] [I] Iterations: 10
[04/18/2023-02:10:27] [I] Duration: 3s (+ 200ms warm up)
[04/18/2023-02:10:27] [I] Sleep time: 0ms
[04/18/2023-02:10:27] [I] Idle time: 0ms
[04/18/2023-02:10:27] [I] Inference Streams: 1
[04/18/2023-02:10:27] [I] ExposeDMA: Disabled
[04/18/2023-02:10:27] [I] Data transfers: Enabled
[04/18/2023-02:10:27] [I] Spin-wait: Disabled
[04/18/2023-02:10:27] [I] Multithreading: Disabled
[04/18/2023-02:10:27] [I] CUDA Graph: Disabled
[04/18/2023-02:10:27] [I] Separate profiling: Disabled
[04/18/2023-02:10:27] [I] Time Deserialize: Disabled
[04/18/2023-02:10:27] [I] Time Refit: Disabled
[04/18/2023-02:10:27] [I] NVTX verbosity: 0
[04/18/2023-02:10:27] [I] Persistent Cache Ratio: 0
[04/18/2023-02:10:27] [I] Inputs:
[04/18/2023-02:10:27] [I] === Reporting Options ===
[04/18/2023-02:10:27] [I] Verbose: Disabled
[04/18/2023-02:10:27] [I] Averages: 10 inferences
[04/18/2023-02:10:27] [I] Percentiles: 90,95,99
[04/18/2023-02:10:27] [I] Dump refittable layers:Disabled
[04/18/2023-02:10:27] [I] Dump output: Disabled
[04/18/2023-02:10:27] [I] Profile: Disabled
[04/18/2023-02:10:27] [I] Export timing to JSON file:
[04/18/2023-02:10:27] [I] Export output to JSON file:
[04/18/2023-02:10:27] [I] Export profile to JSON file:
[04/18/2023-02:10:27] [I]
[04/18/2023-02:10:27] [I] === Device Information ===
[04/18/2023-02:10:27] [I] Selected Device: Tesla T4
[04/18/2023-02:10:27] [I] Compute Capability: 7.5
[04/18/2023-02:10:27] [I] SMs: 40
[04/18/2023-02:10:27] [I] Device Global Memory: 15109 MiB
[04/18/2023-02:10:27] [I] Shared Memory per SM: 64 KiB
[04/18/2023-02:10:27] [I] Memory Bus Width: 256 bits (ECC enabled)
[04/18/2023-02:10:27] [I] Application Compute Clock Rate: 1.59 GHz
[04/18/2023-02:10:27] [I] Application Memory Clock Rate: 5.001 GHz
[04/18/2023-02:10:27] [I]
[04/18/2023-02:10:27] [I] Note: The application clock rates do not reflect the actual clock rates that the GPU is currently running at.
[04/18/2023-02:10:27] [I]
[04/18/2023-02:10:27] [I] TensorRT version: 8.6.0
[04/18/2023-02:10:27] [I] Loading standard plugins
[04/18/2023-02:10:27] [I] [TRT] [MemUsageChange] Init CUDA: CPU +3, GPU +0, now: CPU 25, GPU 103 (MiB)
[04/18/2023-02:10:29] [I] [TRT] [MemUsageChange] Init builder kernel library: CPU +265, GPU +76, now: CPU 344, GPU 179 (MiB)
[04/18/2023-02:10:29] [I] Start parsing network model.
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:604] Reading dangerously large protocol message. If the message turns out to be larger than 2147483647 bytes, parsing will be halted for security reasons. To increase the limit (or to disable these warnings), see CodedInputStream::SetTotalBytesLimit() in google/protobuf/io/coded_stream.h.
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:81] The total number of bytes read was 723472052
[04/18/2023-02:10:29] [I] [TRT] ----------------------------------------------------------------
[04/18/2023-02:10:29] [I] [TRT] Input filename: unet.opt.onnx
[04/18/2023-02:10:29] [I] [TRT] ONNX IR version: 0.0.8
[04/18/2023-02:10:29] [I] [TRT] Opset version: 17
[04/18/2023-02:10:29] [I] [TRT] Producer name: pytorch
[04/18/2023-02:10:29] [I] [TRT] Producer version: 1.14.0
[04/18/2023-02:10:29] [I] [TRT] Domain:
[04/18/2023-02:10:29] [I] [TRT] Model version: 0
[04/18/2023-02:10:29] [I] [TRT] Doc string:
[04/18/2023-02:10:29] [I] [TRT] ----------------------------------------------------------------
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:604] Reading dangerously large protocol message. If the message turns out to be larger than 2147483647 bytes, parsing will be halted for security reasons. To increase the limit (or to disable these warnings), see CodedInputStream::SetTotalBytesLimit() in google/protobuf/io/coded_stream.h.
[libprotobuf WARNING google/protobuf/io/coded_stream.cc:81] The total number of bytes read was 723472052
[04/18/2023-02:10:29] [W] [TRT] onnx2trt_utils.cpp:377: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[04/18/2023-02:10:29] [W] [TRT] onnx2trt_utils.cpp:403: One or more weights outside the range of INT32 was clamped
[04/18/2023-02:10:30] [I] [TRT] No importer registered for op: LayerNormalization. Attempting to import as plugin.
[04/18/2023-02:10:30] [I] [TRT] Searching for plugin: LayerNormalization, plugin_version: 1, plugin_namespace:
[04/18/2023-02:10:30] [E] [TRT] ModelImporter.cpp:726: While parsing node number 293 [LayerNormalization -> "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"]:
[04/18/2023-02:10:30] [E] [TRT] ModelImporter.cpp:727: --- Begin node ---
[04/18/2023-02:10:30] [E] [TRT] ModelImporter.cpp:728: input: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/Cast_output_0"
input: "onnx::LayerNormalization_4060"
input: "onnx::LayerNormalization_4059"
output: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"
name: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization"
op_type: "LayerNormalization"
attribute {
name: "axis"
i: -1
type: INT
}
attribute {
name: "epsilon"
f: 1e-05
type: FLOAT
}

[04/18/2023-02:10:30] [E] [TRT] ModelImporter.cpp:729: --- End node ---
[04/18/2023-02:10:30] [E] [TRT] ModelImporter.cpp:732: ERROR: builtin_op_importers.cpp:5428 In function importFallbackPluginImporter:
[8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?"
[04/18/2023-02:10:30] [E] Failed to parse onnx file
Segmentation fault (core dumped)

chenluuyou · 2023-04-18T09:33:39Z

/usr/src/tensorrt/bin/trtexec --onnx=controlnet_opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

https://drive.google.com/file/d/1I_l0eOIf_Y4aItCWeJDUpOOqJeKf8zez/view?usp=share_link Please check this model, and the error is still this

[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:726: While parsing node number 293 [LayerNormalization -> "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"]: [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:727: --- Begin node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:728: input: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/Cast_output_0" input: "onnx::LayerNormalization_4060" input: "onnx::LayerNormalization_4059" output: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0" name: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization" op_type: "LayerNormalization" attribute { name: "axis" i: -1 type: INT } attribute { name: "epsilon" f: 1e-05 type: FLOAT }

[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:729: --- End node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:732: ERROR: builtin_op_importers.cpp:5428 In function importFallbackPluginImporter: [8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?" [04/15/2023-06:06:42] [E] Failed to parse onnx file [04/15/2023-06:06:42] [I] Finish parsing network model [04/15/2023-06:06:42] [E] Parsing model failed [04/15/2023-06:06:42] [E] Failed to create engine from model or file. [04/15/2023-06:06:42] [E] Engine set up failed &&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/

hello, can you tell me how you converted controlnet to onnx alone？

dongjinxin123 · 2023-04-19T05:05:12Z

/usr/src/tensorrt/bin/trtexec --onnx=controlnet_opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

https://drive.google.com/file/d/1I_l0eOIf_Y4aItCWeJDUpOOqJeKf8zez/view?usp=share_link Please check this model, and the error is still this
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:726: While parsing node number 293 [LayerNormalization -> "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"]: [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:727: --- Begin node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:728: input: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/Cast_output_0" input: "onnx::LayerNormalization_4060" input: "onnx::LayerNormalization_4059" output: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0" name: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization" op_type: "LayerNormalization" attribute { name: "axis" i: -1 type: INT } attribute { name: "epsilon" f: 1e-05 type: FLOAT }
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:729: --- End node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:732: ERROR: builtin_op_importers.cpp:5428 In function importFallbackPluginImporter: [8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?" [04/15/2023-06:06:42] [E] Failed to parse onnx file [04/15/2023-06:06:42] [I] Finish parsing network model [04/15/2023-06:06:42] [E] Parsing model failed [04/15/2023-06:06:42] [E] Failed to create engine from model or file. [04/15/2023-06:06:42] [E] Engine set up failed &&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/

hello, can you tell me how you converted controlnet to onnx alone？

Yes, I change the code of demo/Diffusion, add the model to model.py and export the model's onnx

here is an example, I add this model to model.py, and without a dynamic shape, it works correctly. But when with dynamic shape, it gives me an error about layerNorm plugin.
'''
class ControlNetCanny(BaseModel):
def init(self,
hf_token,
fp16=False,
device='cuda',
verbose=True,
path="",
max_batch_size=16,
embedding_dim=768,
text_maxlen=77,
unet_dim=4
):
super(ControlNetCanny, self).init(hf_token, fp16=fp16, device=device, verbose=verbose, path=path, max_batch_size=max_batch_size, embedding_dim=embedding_dim, text_maxlen=text_maxlen)
self.unet_dim = unet_dim
self.name = "ControlNetCanny"

def get_model(self):
model_opts = {'revision': 'fp16', 'torch_dtype': torch.float16} if self.fp16 else {}
self.control = ControlNetModel.from_pretrained(
"lllyasviel/sd-controlnet-canny",
torch_dtype=torch.float16
).eval().to(self.device)
return self.control

def get_input_names(self):
return ['sample', 'timestep', 'encoder_hidden_states', 'controlnet_cond']
'''

But I found a much easier way to reproduce this issue.
Just run the example in demo/Diffusion
and I follow the instruction
I use docker run --rm -it --gpus all -v $PWD:/workspace nvcr.io/nvidia/pytorch:23.02-py3 /bin/bash
Then I run the cmd:
and I add --build-dynamic-shape
python3 demo_txt2img.py "Sandra Oh, best quality, extremely detailed" "Kim Kardashian, best quality, extremely detailed" "Kim Kardashian, best quality, extremely detailed" --negative-prompt "monochrome, lowres, bad anatomy, worst quality, low quality" --hf-token=$HF_TOKEN -v --version 1.5 --build-dynamic-shape

just add --build-dynamic-shape this flag.
And I got a similar error as I got from ControlNet

[I] Configuring with profiles: [Profile().add('sample', min=(2, 4, 32, 32), opt=(6, 4, 64, 64), max=(8, 4, 128, 128)).add('encoder_hidden_states', min=(2, 77, 768), opt=(6, 77, 768), max=(8, 77, 768)).add('timestep', min=[1], opt=[1], max=[1])]
[I] Building engine with configuration:
Flags | [FP16]
Engine Capability | EngineCapability.DEFAULT
Memory Pools | [WORKSPACE: 10910.75 MiB, TACTIC_DRAM: 15109.75 MiB]
Tactic Sources | []
Profiling Verbosity | ProfilingVerbosity.DETAILED
Preview Features | [DISABLE_EXTERNAL_TACTIC_SOURCES_FOR_CORE_0805]
[W] Using PreviewFeature::kFASTER_DYNAMIC_SHAPES_0805 can help improve performance and resolve potential functional issues.
[W] Myelin graph with multiple dynamic values may have poor performance if they differ.
[E] 10: Could not find any implementation for node {ForeignNode[onnx::LayerNormalization_9032 + (Unnamed Layer* 1211) [Shuffle].../down_blocks.0/attentions.0/Reshape_1 + /down_blocks.0/attentions.0/Transpose_1]}.
[E] 10: [optimizer.cpp::computeCosts::3873] Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[onnx::LayerNormalization_9032 + (Unnamed Layer* 1211) [Shuffle].../down_blocks.0/attentions.0/Reshape_1 + /down_blocks.0/attentions.0/Transpose_1]}.)
[!] Invalid Engine. Please ensure the engine was built correctly
Traceback (most recent call last):
File "demo_txt2img.py", line 76, in
demo.loadEngines(args.engine_dir, args.onnx_dir, args.onnx_opset,
File "/workspace/demo/Diffusion/stable_diffusion_pipeline.py", line 290, in loadEngines
engine.build(onnx_opt_path,
File "/workspace/demo/Diffusion/utilities.py", line 206, in build
engine = engine_from_network(
File "", line 3, in engine_from_network
File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/base/loader.py", line 42, in call
return self.call_impl(*args, **kwargs)
File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/trt/loader.py", line 530, in call_impl
return engine_from_bytes(super().call_impl)
File "", line 3, in engine_from_bytes
File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/base/loader.py", line 42, in call
return self.call_impl(*args, **kwargs)
File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/trt/loader.py", line 554, in call_impl
buffer, owns_buffer = util.invoke_if_callable(self._serialized_engine)
File "/usr/local/lib/python3.8/dist-packages/polygraphy/util/util.py", line 661, in invoke_if_callable
ret = func(*args, **kwargs)
File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/trt/loader.py", line 488, in call_impl
G_LOGGER.critical("Invalid Engine. Please ensure the engine was built correctly")
File "/usr/local/lib/python3.8/dist-packages/polygraphy/logger/logger.py", line 597, in critical
raise PolygraphyException(message) from None
polygraphy.exception.exception.PolygraphyException: Invalid Engine. Please ensure the engine was built correctly

dongjinxin123 · 2023-04-19T05:06:29Z

/usr/src/tensorrt/bin/trtexec --onnx=controlnet_opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

https://drive.google.com/file/d/1I_l0eOIf_Y4aItCWeJDUpOOqJeKf8zez/view?usp=share_link Please check this model, and the error is still this
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:726: While parsing node number 293 [LayerNormalization -> "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"]: [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:727: --- Begin node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:728: input: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/Cast_output_0" input: "onnx::LayerNormalization_4060" input: "onnx::LayerNormalization_4059" output: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0" name: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization" op_type: "LayerNormalization" attribute { name: "axis" i: -1 type: INT } attribute { name: "epsilon" f: 1e-05 type: FLOAT }
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:729: --- End node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:732: ERROR: builtin_op_importers.cpp:5428 In function importFallbackPluginImporter: [8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?" [04/15/2023-06:06:42] [E] Failed to parse onnx file [04/15/2023-06:06:42] [I] Finish parsing network model [04/15/2023-06:06:42] [E] Parsing model failed [04/15/2023-06:06:42] [E] Failed to create engine from model or file. [04/15/2023-06:06:42] [E] Engine set up failed &&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/

hello, can you tell me how you converted controlnet to onnx alone？

Yes, I change the code of demo/Diffusion, add the model to model.py and export the model's onnx

here is an example, I add this model to model.py, and without a dynamic shape, it works correctly. But when with dynamic shape, it gives me an error about layerNorm plugin. ''' class ControlNetCanny(BaseModel): def init(self, hf_token, fp16=False, device='cuda', verbose=True, path="", max_batch_size=16, embedding_dim=768, text_maxlen=77, unet_dim=4 ): super(ControlNetCanny, self).init(hf_token, fp16=fp16, device=device, verbose=verbose, path=path, max_batch_size=max_batch_size, embedding_dim=embedding_dim, text_maxlen=text_maxlen) self.unet_dim = unet_dim self.name = "ControlNetCanny"

def get_model(self): model_opts = {'revision': 'fp16', 'torch_dtype': torch.float16} if self.fp16 else {} self.control = ControlNetModel.from_pretrained( "lllyasviel/sd-controlnet-canny", torch_dtype=torch.float16 ).eval().to(self.device) return self.control

def get_input_names(self): return ['sample', 'timestep', 'encoder_hidden_states', 'controlnet_cond'] '''

But I found a much easier way to reproduce this issue. Just run the example in demo/Diffusion and I follow the instruction I use docker run --rm -it --gpus all -v $PWD:/workspace nvcr.io/nvidia/pytorch:23.02-py3 /bin/bash Then I run the cmd: and I add --build-dynamic-shape python3 demo_txt2img.py "Sandra Oh, best quality, extremely detailed" "Kim Kardashian, best quality, extremely detailed" "Kim Kardashian, best quality, extremely detailed" --negative-prompt "monochrome, lowres, bad anatomy, worst quality, low quality" --hf-token=$HF_TOKEN -v --version 1.5 --build-dynamic-shape

just add --build-dynamic-shape this flag. And I got a similar error as I got from ControlNet

[I] Configuring with profiles: [Profile().add('sample', min=(2, 4, 32, 32), opt=(6, 4, 64, 64), max=(8, 4, 128, 128)).add('encoder_hidden_states', min=(2, 77, 768), opt=(6, 77, 768), max=(8, 77, 768)).add('timestep', min=[1], opt=[1], max=[1])] [I] Building engine with configuration: Flags | [FP16] Engine Capability | EngineCapability.DEFAULT Memory Pools | [WORKSPACE: 10910.75 MiB, TACTIC_DRAM: 15109.75 MiB] Tactic Sources | [] Profiling Verbosity | ProfilingVerbosity.DETAILED Preview Features | [DISABLE_EXTERNAL_TACTIC_SOURCES_FOR_CORE_0805] [W] Using PreviewFeature::kFASTER_DYNAMIC_SHAPES_0805 can help improve performance and resolve potential functional issues. [W] Myelin graph with multiple dynamic values may have poor performance if they differ. [E] 10: Could not find any implementation for node {ForeignNode[onnx::LayerNormalization_9032 + (Unnamed Layer* 1211) [Shuffle].../down_blocks.0/attentions.0/Reshape_1 + /down_blocks.0/attentions.0/Transpose_1]}. [E] 10: [optimizer.cpp::computeCosts::3873] Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[onnx::LayerNormalization_9032 + (Unnamed Layer* 1211) [Shuffle].../down_blocks.0/attentions.0/Reshape_1 + /down_blocks.0/attentions.0/Transpose_1]}.) [!] Invalid Engine. Please ensure the engine was built correctly Traceback (most recent call last): File "demo_txt2img.py", line 76, in demo.loadEngines(args.engine_dir, args.onnx_dir, args.onnx_opset, File "/workspace/demo/Diffusion/stable_diffusion_pipeline.py", line 290, in loadEngines engine.build(onnx_opt_path, File "/workspace/demo/Diffusion/utilities.py", line 206, in build engine = engine_from_network( File "", line 3, in engine_from_network File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/base/loader.py", line 42, in call return self.call_impl(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/trt/loader.py", line 530, in call_impl return engine_from_bytes(super().call_impl) File "", line 3, in engine_from_bytes File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/base/loader.py", line 42, in call return self.call_impl(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/trt/loader.py", line 554, in call_impl buffer, owns_buffer = util.invoke_if_callable(self._serialized_engine) File "/usr/local/lib/python3.8/dist-packages/polygraphy/util/util.py", line 661, in invoke_if_callable ret = func(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/polygraphy/backend/trt/loader.py", line 488, in call_impl G_LOGGER.critical("Invalid Engine. Please ensure the engine was built correctly") File "/usr/local/lib/python3.8/dist-packages/polygraphy/logger/logger.py", line 597, in critical raise PolygraphyException(message) from None polygraphy.exception.exception.PolygraphyException: Invalid Engine. Please ensure the engine was built correctly

/usr/src/tensorrt/bin/trtexec --onnx=controlnet_opt.onnx --saveEngine=unet.opt.plan --minShapes=sample:2x4x32x32,encoder_hidden_states:2x77x768,controlnet_cond:2x3x256x256 --optShapes=sample:4x4x64x64,encoder_hidden_states:4x77x768,controlnet_cond:4x3x512x512 --maxShapes=sample:8x4x128x128,encoder_hidden_states:8*77x768,controlnet_cond:4x3x1024x1024

https://drive.google.com/file/d/1I_l0eOIf_Y4aItCWeJDUpOOqJeKf8zez/view?usp=share_link Please check this model, and the error is still this
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:726: While parsing node number 293 [LayerNormalization -> "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0"]: [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:727: --- Begin node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:728: input: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/Cast_output_0" input: "onnx::LayerNormalization_4060" input: "onnx::LayerNormalization_4059" output: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization_output_0" name: "/down_blocks.0/attentions.0/transformer_blocks.0/norm1/LayerNormalization" op_type: "LayerNormalization" attribute { name: "axis" i: -1 type: INT } attribute { name: "epsilon" f: 1e-05 type: FLOAT }
[04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:729: --- End node --- [04/15/2023-06:06:42] [E] [TRT] ModelImporter.cpp:732: ERROR: builtin_op_importers.cpp:5428 In function importFallbackPluginImporter: [8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?" [04/15/2023-06:06:42] [E] Failed to parse onnx file [04/15/2023-06:06:42] [I] Finish parsing network model [04/15/2023-06:06:42] [E] Parsing model failed [04/15/2023-06:06:42] [E] Failed to create engine from model or file. [04/15/2023-06:06:42] [E] Engine set up failed &&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/

hello, can you tell me how you converted controlnet to onnx alone？

you can reproduce this issue with offical demo code

dongjinxin123 · 2023-04-22T03:16:23Z

TensorRT Version: 8.6

You said the version is 8.6 but &&&& FAILED TensorRT.trtexec [TensorRT v8503] # /usr/src/ the log shows you are using 8.5.3.

are you able to reproduce this issue？ do you need any other information？

dongjinxin123 · 2023-04-25T21:26:16Z

Let me answer this issue by myself，in this case, you still need to recompile TensorRT8.6 and set LD_LIBRARY_PATH to TensorRT .so

dongjinxin123 · 2023-04-25T21:26:30Z

Let me answer this issue by myself，in this case, you still need to recompile TensorRT8.6 and set LD_LIBRARY_PATH to TensorRT .so

spacewalk01 · 2024-01-23T05:48:42Z

same problem in tensorrt8.6. Tried to use onnx simplify but didn't work

brainzha · 2024-06-06T10:46:36Z

Make sure tensorrt version is higher than 8.6, torch.onnx.export( opset_version >= 17), this issue can be resolved

jackwei86 · 2024-09-09T02:50:09Z

Make sure tensorrt version is higher than 8.6, torch.onnx.export( opset_version >= 17), this issue can be resolved

"tensorrt version is higher than 8.6" ,u mean the tensorrt 10 or just 8.6.1. I met the same problem when convert sam2_model.onnx to tensorrt engine file using tensorRT version 8.6.1

sushilkhadkaanon · 2024-09-18T10:00:55Z

Hi @zerollzeng @brainzha @dongjinxin123 , Hi I'm having the same issue. I'm using tensorRt version 8.4.1 (I know this is an old one but all other models depend on this, so can't afford to change it). How do I make it work in 8.4.1? Do I need to write a custom plugin ? Please help me on this

THANKS!
`
input: "onnx::LayerNormalization_7"
input: "onnx::LayerNormalization_8"
output: "x.3"
name: "LayerNormalization_802"
op_type: "LayerNormalization"
attribute {
name: "axis"
i: -1
type: INT
}
attribute {
name: "epsilon"
f: 1e-05
type: FLOAT
}

[09/18/2024-09:34:49] [E] [TRT] parsers/onnx/ModelImporter.cpp:776: --- End node ---
[09/18/2024-09:34:49] [E] [TRT] parsers/onnx/ModelImporter.cpp:778: ERROR: parsers/onnx/builtin_op_importers.cpp:4890 In function importFallbackPluginImporter:
[8] Assertion failed: creator && "Plugin not found, are the plugin name, version, and namespace correct?"

TeamSeshDeadBoy · 2024-10-24T13:44:23Z

Okay, so i finally got my matching versions:
I was converting finetuned E5 -> onnx -> trt
This worked for me:
NVCR tensorrt container v22.12 + TensorRT v.8.5.1.7, exporting with `opset_version=14' and using polygraphgy to convert onnx to trt seems to have done the trick!

isuchy · 2025-01-06T16:31:07Z

Hi guys,
I ran into the same issue, I solved it by explicitly setting the opset=16 argument in the export function, which is lower than the default value opset=17 in the current Ultralytics v8.3.58.

zerollzeng self-assigned this Apr 15, 2023

zerollzeng added the triaged Issue has been triaged by maintainers label Apr 15, 2023

dongjinxin123 closed this as completed Apr 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot find op_type: "LayerNormalization" when convert the onnx model, using TensorRT 8.6 #2875

Cannot find op_type: "LayerNormalization" when convert the onnx model, using TensorRT 8.6 #2875

dongjinxin123 commented Apr 14, 2023

zerollzeng commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

zerollzeng commented Apr 15, 2023

zerollzeng commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

zerollzeng commented Apr 16, 2023

dongjinxin123 commented Apr 18, 2023

chenluuyou commented Apr 18, 2023

dongjinxin123 commented Apr 19, 2023

dongjinxin123 commented Apr 19, 2023

dongjinxin123 commented Apr 22, 2023

dongjinxin123 commented Apr 25, 2023

dongjinxin123 commented Apr 25, 2023

spacewalk01 commented Jan 23, 2024

brainzha commented Jun 6, 2024

jackwei86 commented Sep 9, 2024

sushilkhadkaanon commented Sep 18, 2024 •

edited

Loading

TeamSeshDeadBoy commented Oct 24, 2024

isuchy commented Jan 6, 2025

Cannot find op_type: "LayerNormalization" when convert the onnx model, using TensorRT 8.6 #2875

Cannot find op_type: "LayerNormalization" when convert the onnx model, using TensorRT 8.6 #2875

Comments

dongjinxin123 commented Apr 14, 2023

Description

Environment

Relevant Files

Steps To Reproduce

zerollzeng commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

zerollzeng commented Apr 15, 2023

zerollzeng commented Apr 15, 2023

dongjinxin123 commented Apr 15, 2023

zerollzeng commented Apr 16, 2023

dongjinxin123 commented Apr 18, 2023

chenluuyou commented Apr 18, 2023

dongjinxin123 commented Apr 19, 2023

dongjinxin123 commented Apr 19, 2023

dongjinxin123 commented Apr 22, 2023

dongjinxin123 commented Apr 25, 2023

dongjinxin123 commented Apr 25, 2023

spacewalk01 commented Jan 23, 2024

brainzha commented Jun 6, 2024

jackwei86 commented Sep 9, 2024

sushilkhadkaanon commented Sep 18, 2024 • edited Loading

TeamSeshDeadBoy commented Oct 24, 2024

isuchy commented Jan 6, 2025

sushilkhadkaanon commented Sep 18, 2024 •

edited

Loading