Receipt Parsing

This project demonstrates how to use the Donut model with CORD dataset to perform receipt parsing. Receipt parsing involves extracting structured information from photographed receipts, such as itemized lists and totals.

Overview

The CORD dataset is a collection of receipts and invoices with ground truth annotations.
We leverage the power of Hugging Face Transformers to fine-tune a pre-trained Donut model for receipt parsing.
The model used in this project is fahmiaziz/finetune-donut-cord-v2.5, which is adapted to the CORD-V2 dataset you can see it here.

Requirements

To run this project, you need:

Python 3.7+
PyTorch
PyTorch-Lightning
Hugging Face Transformers
Pillow
flask

Model Training and Evaluation

We trained and evaluated our receipt parsing model using the Donut model with CORD-V2 dataset. The goal was to achieve a high accuracy of 90% or above. You can access the detailed training and evaluation results on Weights & Biases (WandB):

Model Training and Evaluation Dashboard

Here are some highlights of the training and evaluation process:

Dataset: We used the CORD dataset, which includes a diverse collection of receipts and invoices.
Model: Our model is based on the fahmiaziz/finetune-donut-cord-v2.5 architecture that has been fine-tuned from donut-base and customized specifically for receipt parsing.
Training Metrics: During training, we monitor various metrics, including accuracy, Tree Edit Distance (Tree ED) to ensure model performance.
Evaluation: Our model achieved over 90% accuracy on the test dataset, demonstrating its effectiveness in parsing receipts.
Visualization: The training and evaluation process can be visualized through the WandB dashboard linked above.

Feel free to explore the details of the training and evaluation results in WandB to gain more insight into our model's performance.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
gemini-vision		gemini-vision
img_test		img_test
notebook		notebook
templates		templates
.gitignore		.gitignore
README.md		README.md
evaluation.png		evaluation.png
predictions.png		predictions.png
requirements.txt		requirements.txt
server.py		server.py
vision.py		vision.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Receipt Parsing

Overview

Requirements

Model Training and Evaluation

Demonstration

About

Releases

Packages

Languages

fahmiaziz98/receipt_parsing

Folders and files

Latest commit

History

Repository files navigation

Receipt Parsing

Overview

Requirements

Model Training and Evaluation

Demonstration

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages