Adding ViTSTR #513

felixdittrich92 · 2021-09-29T21:02:47Z

Adding Vision Transformer for scene text recognition i work currently on this (with huggingface ViT backbone) if i done and have solid results it would be a charme for me to add this model if you interested !? :)
Same for the new unilm/TrOCR model

charlesmindee · 2021-09-30T14:26:56Z

Hi @felixdittrich92,

Thanks for your message, it would be a pleasure having you contributing to the lib!

We already have a recognition model including a transformer decoder (MASTER), but we do not have yet full transformer architectures such as ViT or TrOCR. It is on the mid-term road map, and if you would like to propose your implementation you are more than welcome to open a PR! 🙏

Please read the CONTRIBUTING section and feel free to look at the models already implemented in doctr 😃

Thank you and have a nice day 👍

felixdittrich92 · 2021-09-30T15:21:37Z

i will do thanks :) 👍

charlesmindee · 2022-04-28T08:57:27Z

Hi @felixdittrich92, do you still plan to implement this ? If not, we may close this issue to avoid a huge stack of unaddressed ones!

felixdittrich92 · 2022-04-28T11:18:21Z

Huhu @charlesmindee 👋 ,
yes of course (maybe a bit lighter version with mobilevit) but i think ftm there are other thinks like a fix for master and sar are more important so i would say lets hold this on 1.0.0 wdyt ? 👍

charlesmindee · 2022-04-29T09:46:35Z

ok

chpatrick · 2023-01-30T22:57:41Z

@felixdittrich92 Hi, are there any model weights available for ViTSTR that are compatible with doctr? :)

I saw these ones but they seem to be named differently I suppose: https://github.com/roatienza/deep-text-recognition-benchmark/releases

charlesmindee added type: enhancement Improvement module: models Related to doctr.models topic: text recognition Related to the task of text recognition labels Sep 30, 2021

charlesmindee self-assigned this Sep 30, 2021

felixdittrich92 mentioned this issue Oct 30, 2021

Adding more languages for recognition models by default #563

Closed

fg-mindee added this to the 1.0.0 milestone Dec 10, 2021

felixdittrich92 mentioned this issue Sep 6, 2022

[DRAFT] [models] add ViTSTR in TF and PT #1048

Closed

2 tasks

felixdittrich92 linked a pull request Sep 19, 2022 that will close this issue

[models] add ViTSTR TF and PT and update ViT to work as backbone #1055

Merged

felixdittrich92 mentioned this issue Sep 21, 2022

Upcoming support for new model architectures #1007

Closed

3 tasks

felixdittrich92 closed this as completed in #1055 Sep 21, 2022

felixdittrich92 removed this from the 1.0.0 milestone Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding ViTSTR #513

Adding ViTSTR #513

felixdittrich92 commented Sep 29, 2021

charlesmindee commented Sep 30, 2021 •

edited

Loading

felixdittrich92 commented Sep 30, 2021

charlesmindee commented Apr 28, 2022

felixdittrich92 commented Apr 28, 2022

charlesmindee commented Apr 29, 2022

chpatrick commented Jan 30, 2023

Adding ViTSTR #513

Adding ViTSTR #513

Comments

felixdittrich92 commented Sep 29, 2021

charlesmindee commented Sep 30, 2021 • edited Loading

felixdittrich92 commented Sep 30, 2021

charlesmindee commented Apr 28, 2022

felixdittrich92 commented Apr 28, 2022

charlesmindee commented Apr 29, 2022

chpatrick commented Jan 30, 2023

charlesmindee commented Sep 30, 2021 •

edited

Loading