YOLOv8 Segmentation with End-to-End ONNX Model

This repository provides an end-to-end implementation of YOLOv8 for segmentation. Unlike most implementations available online, this version incorporates all post-processing directly inside the ONNX model, from Non-Maximum Suppression (NMS) to mask calculations, making it a true one-stop solution for real-time object detection and segmentation using ONNX.

Key Features:

End-to-End ONNX Model: The ONNX model handles all post-processing, including NMS and mask calculation, within the model.
Multiple Outputs:
- Bounding Boxes: Coordinates of the detected objects.
- Final Mask: A mask containing 1 value for every object mask detected in the image.
- Individual Masks: Separate binary masks for each object detected, which can be applied to the image for visualization.

Why This Repository?

There are no complete end-to-end implementations of YOLOv8 segmentation models available online. Most solutions require external post-processing code for tasks such as Non-Maximum Suppression (NMS) and mask generation. This repository addresses that gap by embedding all post-processing steps nto the ONNX model, making it easier to integrate the model into any pipeline.

Setup

1. Clone the Repository

git clone https://github.com/namas191297/yolov8-segmentation-end2end-onnxruntime.git
cd yolov8-segmentation-end2end-onnxruntime

2. Create a Python Environment

It is recommended to create a virtual environment to manage dependencies.

# Using conda (preferred)
conda create --name yolov8-segmentation python=3.9
conda activate yolov8-segmentation

3. Install Required Dependencies

pip install -r requirements.txt

Converting to ONNX and adding Post-Processing

You can run the following script to download the required YOLOv8 model, convert it to ONNX format and stitch post-processing to the model via ONNX runtime extensions.

python add_postprocessing_yolov8.py \
    --yolo-version 8 \
    --model-size 512 \
    --model yolov8n-seg.onnx \
    --final-model yolov8n-seg-final.onnx \
    --download-model \
    --run-inference \
    --input-width 512 \
    --input-height 512

Running the Model

You can run the model using your webcam feed for real-time segmentation.

python run_webcam.py --model models/yolov8n-640x640-end2end.onnx --input-width 640 --input-height 640

You can also run it on a single image:

python run_image.py --model models/yolov8n-640x640-end2end.onnx --image assets/pexels-car.jpg --input-width 640 --input-height 640

Additional Arguments

--input-width: The input width for the model (default is 640)
--input-height: The input height for the model (default is 640)

Model Outputs:

Bounding Boxes: The coordinates of each detected object.
Final Mask: A single binary mask containing a value of 1 for all masks detected.
Individual Masks: Binary masks for each individual object detected.

Model Outputs in Detail

The model provides the following outputs:

Bounding Boxes: The coordinates (xmin, ymin, xmax, ymax) that enclose each detected object along with class and confidence.
Final Mask: A single mask representing all detected objects, with one mask per object.
Individual Masks: Separate masks for each object, making it easy to apply and visualize each object's segmentation.

License

This repository uses the YOLOv8 model provided by Ultralytics. Please note that YOLOv8 is licensed under the AGPL-3.0 License. If you distribute or modify this repository, ensure compliance with the terms of the AGPL-3.0 license.

For more details, you can review the license here: Ultralytics YOLOv8 License.

Modifications

This repository is a modified version of Ultralytics YOLOv8 licensed under the AGPL-3.0 License. Modifications include:

Integrated all post-processing directly into the ONNX model.
Enhanced real-time object detection and segmentation capabilities.
Optimized model performance for specific use cases.

Date of Modifications: October 2024

Contributions

Contributions are welcome! Feel free to submit issues, fork the repository, and open pull requests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YOLOv8 Segmentation with End-to-End ONNX Model

Key Features:

Why This Repository?

Setup

1. Clone the Repository

2. Create a Python Environment

3. Install Required Dependencies

Converting to ONNX and adding Post-Processing

Running the Model

Additional Arguments

Model Outputs:

Model Outputs in Detail

License

Modifications

Contributions

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
models		models
LICENSE		LICENSE
README.md		README.md
add_postprocessing_yolov8.py		add_postprocessing_yolov8.py
requirements.txt		requirements.txt
run_image.py		run_image.py
run_webcam.py		run_webcam.py

License

namas191297/yolov8-segmentation-end2end-onnxruntime

Folders and files

Latest commit

History

Repository files navigation

YOLOv8 Segmentation with End-to-End ONNX Model

Key Features:

Why This Repository?

Setup

1. Clone the Repository

2. Create a Python Environment

3. Install Required Dependencies

Converting to ONNX and adding Post-Processing

Running the Model

Additional Arguments

Model Outputs:

Model Outputs in Detail

License

Modifications

Contributions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages