BiRefNet TensorRT Inference

Introduction

This project provides code for performing inference with BiRefNet using TensorRT. The aim is to accelerate the inference process by leveraging the high-performance capabilities of TensorRT.

Inference Time Comparison

1. First Inference Time

Method	Pytorch	ONNX	Tensorrt
inference time	0.71s	5.32s	0.17s

2. Average Inference Time (excluding the first)

Method	Pytorch	ONNX	Tensorrt
inference time	0.15s	4.43s	0.11s

Note:

Both the PyTorch and ONNX models are from the official BiRefNet GitHub.

The TensorRT model was converted using Convert-ONNX-Model-to-TensorRT-Engine.

All tests were conducted on a Win10 system with an RTX 4080 Super.

Refer to model_compare.py for the conversion code.

Features

Efficient inference with BiRefNet using TensorRT
foreground estimate
colab example
Performance comparison between PyTorch, ONNX, and TensorRT inference
Inference using Docker for an isolated and reproducible environment

Prerequisites

NVIDIA GPU with CUDA(>=11.X) and Cudnn(>=8.X)
Python 3.9

Installation

pip install -r requirements.txt

Usage

1. download onnx model

First, download onnx model from Google Drive

2. Convert ONNX Model to TensorRT Engine

second, convert your ONNX model to a TensorRT engine using the provided conversion script:

from utils import convert_onnx_to_engine

onnx_file_path = "birefnet.onnx"
engine_file_path = "engine.trt"

convert_onnx_to_engine(onnx_file_path, engine_file_path)

3. Run Inference

Now, you can run inference using the TensorRT engine with the following command:

3.1 single infer

 python .\infer.py --image-path image_path --output-path result.png --output-alpha-path result_alpha.png --engine-path .\engine.trt

3.2 infer for directory

python .\infer.py --image-path image_dir --output-path output_dir --output-alpha-path alpha_dir --engine-path .\engine.trt --mode m

Contributing

Contributions are welcome! Please feel free to submit a Pull Request or open an Issue if you have any suggestions or find bugs.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
README.md		README.md
common.py		common.py
common_runtime.py		common_runtime.py
engine_converter.py		engine_converter.py
function.py		function.py
infer.py		infer.py
model_compare.py		model_compare.py
requirements.txt		requirements.txt
transformers.py		transformers.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BiRefNet TensorRT Inference

Introduction

Inference Time Comparison

1. First Inference Time

2. Average Inference Time (excluding the first)

Features

Prerequisites

Installation

Usage

1. download onnx model

2. Convert ONNX Model to TensorRT Engine

3. Run Inference

3.1 single infer

3.2 infer for directory

Contributing

Thanks

About

Releases

Packages

Languages

yuanyang1991/birefnet_tensorrt

Folders and files

Latest commit

History

Repository files navigation

BiRefNet TensorRT Inference

Introduction

Inference Time Comparison

1. First Inference Time

2. Average Inference Time (excluding the first)

Features

Prerequisites

Installation

Usage

1. download onnx model

2. Convert ONNX Model to TensorRT Engine

3. Run Inference

3.1 single infer

3.2 infer for directory

Contributing

Thanks

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages