Upcoming support for new model architectures #1007

frgfm · 2022-08-01T15:12:15Z

As discussed in several GH issues, docTR could very well welcome new architectures for OCR 👍
Let's use this issue to track this for the next release!

A few things to consider:

docTR is not meant to make all architectures available. Let's focus on architectures that are reasonably sized and SOTA performances (or considered as a performance milestone for a given task).
it is acceptable to start with the implementation with only 1 DL backend. Although, gradually, within the next releases, full support needs to be added.
for faster iterations, training should be performed on synthetic data when available (perf will be pushed on private datasets later once the potential of the architecture is validated). A PR to add implementation for a given architecture should come with the exact args used in training to reproduce the training and the corresponding performances
we always have to credit the rightful contributors: papers are always cited in docTR, and providing an implementation is meant not to be a copy paste of another implementation. However is part of the code of someone else is used, that author should be credited ("borrowed from", "inspired by", etc.)

Here is the list of envisioned models:

Text detection

Text recognition

SkaarFacee · 2023-11-01T08:10:02Z

I was hoping to be able to help with implementing the PAN model

felixdittrich92 · 2023-11-02T06:40:53Z

If you want you can also work on this but as mentioned we would need to keep it as draft until the next release is done :)

felixdittrich92 · 2024-01-15T15:30:08Z

Related: #1425 (TextNet backbone TF / PT) / #1443 FAST

frgfm added this to the 0.6.0 milestone Aug 1, 2022

frgfm mentioned this issue Aug 1, 2022

Release tracker - v0.6.0 #791

Closed

85 tasks

frgfm mentioned this issue Sep 16, 2022

Support for Handwritten text #1049

Open

felixdittrich92 mentioned this issue Sep 26, 2022

Release tracker - v0.9.0 #1074

Closed

6 tasks

felixdittrich92 modified the milestones: 0.6.0, 0.7.0 Sep 26, 2022

felixdittrich92 linked a pull request Feb 2, 2024 that will close this issue

[TF/PT] Add FAST detection model #1443

Merged

3 tasks

felixdittrich92 self-assigned this Feb 9, 2024

felixdittrich92 closed this as completed in #1443 Feb 28, 2024

felixdittrich92 mentioned this issue Jun 6, 2024

Release tracker - v0.10.0 #1634

Closed

Provide feedback