GitHub - shubhampundhir/BusPanel-Detection-YOLOV8: Detection(text extraction) of bus number from bus panel.Open-CV and Optical Character Recognition (OCR) for extracting the text on the bus.

Project: Detection of Bus Number in Bus Panel

Details

Author Name: Shubham Pundhir

Problem Statement

Mobility and orientation are the biggest challenges for people with visual impairments. This includes walking, taking public transportation, or even getting a ride on buses. Developing assistive technologies to help them navigate outdoor environments could potentially improve their quality of life. Since public transportation, such as buses, is a main tool for people with visual impairments to navigate outdoor, this project aims to create a proof of concept (POC) that can help people with visual impairment to determine if it’s the right bus they need through Optical Character Recognition (OCR) for the text on the bus.

Executive Summary

To assist the visually impaired passengers to travel more independently, a POC has been created to detect and recognise bus number of public buses arriving at a bus stop in Singapore. The design made use of a combination of Object Detection (using YOLOv5) and Optical Character Recognition (OCR), to extract bus numbers from the bus panel and convert the extracted text for audio notification.

Starting with data acquisition, we downloaded YouTube videos of public buses arriving at a bus stop in Singapore. Individual frames were extracted from the video every 1 second and saved as images. The collection of images were labelled using CVAT. Bounding boxes were specified around each bus panel only from the perspective of buses arriving at the bus stop. A manual review of the annotations was done to ensure that the bus panels in the images were properly labelled. The saved annotations were exported to YOLO format. The custom dataset was then formated and split into train, validation, and test set.

Following the data preparation process, we trained YOLOv5 on our custom object (bus panel) using the model’s pre-trained weights for transfer learning. The model's performance was evaluated using the mean Average Precision (mAP) metric. Our model performed pretty well as the mAP score was 89.2%. The closer our mAP score is to 100%, the better. Using this model we were able to detect and localise the bounding box coordinates of the bus panel contained in an image. We consider the model to be performing relatively well given that it is successful in generating predictions on new images. Finally, we incorporated the custom object detection model into the OCR pipeline to transform the bus number into raw text data with Tesseract OCR. The project deployment can be found on Streamlit.

Additional Information

There are a total of three notebooks in this project, namely:

Note: The custom YOLOv5 training is done entirely on Google Colab. Google Colab is a free cloud service that supports free GPU

The description and walkthrough for the entire project is available in the notebooks. The datasets for this project are accessible on Google Drive.

Conclusion

With the problem statement in mind to help people with visual impairment to determine if it’s the right bus they need, we have successfully created a POC for deployment and tested that it is indeed tangible to make use of computer vision-based system to solve our problem. We were able to train YOLOv5 on our custom dataset and found that YOLOv5 trains quickly, inferences quickly, and performs really well in detecting the bus panel contained in an image. But, OCR is not without its challenges. There were some limitations found when applied to recognize the bus number from the bus panel. Eg. Not being able read the bus number within the region of interest (ROI).

Future Exploration

Try out different OCR software
Better ways of preprocessing the images before feeding into the OCR software
Train the object detection model on images of buses at night since the current model is largely trained on images of buses in the day
Test the OCR pipeline on a video or on a live camera =======

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
codes		codes
data		data
.gitignore		.gitignore
README.md		README.md
README.txt		README.txt
bus_number.mp4		bus_number.mp4
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: Detection of Bus Number in Bus Panel

Details

Problem Statement

Executive Summary

Additional Information

Conclusion

Future Exploration

About

Releases

Packages

Languages

shubhampundhir/BusPanel-Detection-YOLOV8

Folders and files

Latest commit

History

Repository files navigation

Project: Detection of Bus Number in Bus Panel

Details

Problem Statement

Executive Summary

Additional Information

Conclusion

Future Exploration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages