Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
-
Updated
Apr 25, 2025 - Python
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/implementations - Vision Transformer (ViT), ResNetV2, EfficientNetV2, NeRF, SegFormer, MixTransformer, (planned...) DeepLabV3+, ConvNeXtV2, YOLO, etc.
This repository provides an overview of Segformer, architecture encoder in particular. Some details of Segformer can be misleaded, thus makes a short description here to help understand the model. Meanwhile, the code (Keras/TensorFlow) is also provided for supporting.
This is a warehouse for SegFormer-pytorch-model, can be used to train your image datasets for segmentation tasks.
PyTorch Implementation of SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Real time Semantic Segmentation on Cityscapes dataset.
Tackling the challenges of off-road environment navigation through Attention Guided Off-Road Semantic Segmentation.
Semantic Segmentation of Drone Images With Hugging Face and MLflow
An application for automatic road damage assessment using semantic segmentation on high-resolution images. The project helps municipal authorities and maintenance teams detect and prioritize road repairs, improving safety and reducing costs.
Repository of the master's thesis focused on the Automatic Detection of Objects in the Track
A machine learning project designed to detect road damage from images, leveraging deep learning and computer vision techniques for efficient and accurate damage detection.
A repository of SegFormer model test
AI Pipeline that is truly free, and doesnt require and internet connection to use. Built and Optimized for use with Acer Predator Helios Neo 16 w/RTX4060
This repository implements and compares deep learning models for semantic segmentation using the Oxford-IIIT Pet and Cityscapes datasets.
Multiclass semantic segmentation problem using U-Net and SegFormer, trained on a multi-GPU.
This repo showcase the ENPM673: Perception for Autonomous robots final project. A vision transformer (ViT) architecture SegFormer, has been replicated for implementing semantic segmentation. Furthermore, it was deployed on raspberry pi with pi cam setup for validating the real-time performance.
Add a description, image, and links to the segformer topic page so that developers can more easily learn about it.
To associate your repository with the segformer topic, visit your repo's landing page and select "manage topics."