Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About: Bleed-Through Removal from Degraded Documents #14

Open
kerberosargos opened this issue Aug 23, 2024 · 3 comments
Open

About: Bleed-Through Removal from Degraded Documents #14

kerberosargos opened this issue Aug 23, 2024 · 3 comments

Comments

@kerberosargos
Copy link

Firstly thank you for your great work. I am newbie about AI. I need your help.

My question is about "Bleed-Through Removal from Degraded Documents" I have some images for testing. Your Docres "appearance" task almost works perfectly. I would like to clean images than try to detect text with "Craft" model. But some transparent object in image, causes problems. Some samples are as following. Can you give a advice for removing transparent objects for good text detection?

Thank you in advance for your effort.

Orginal DocRes Craft
Orginal DocRes Craft
Orginal DocRes Craft
@ZZZHANG-jx
Copy link
Owner

Firstly thank you for your great work. I am newbie about AI. I need your help.

My question is about "Bleed-Through Removal from Degraded Documents" I have some images for testing. Your Docres "appearance" task almost works perfectly. I would like to clean images than try to detect text with "Craft" model. But some transparent object in image, causes problems. Some samples are as following. Can you give a advice for removing transparent objects for good text detection?

Thank you in advance for your effort.

Thank you for your interest in our work. For the bleed-through degradation, we primarily simulated it in our training data using the bleed_through function. In the images you provided, the bleed-through is quite severe. I believe the simulation in the training data needs to be more intense, such as reducing the value in this line from 0.75 to a smaller number, and then fine-tuning the model. Additionally, you might want to try our other work, GCDRNet, which is specifically designed for appearance enhancement and might yield better results.

@kerberosargos
Copy link
Author

Firstly thank you very much for your quick answer. Acctually I have tried GCDRNet before DocRes with same test images. But DocRes result is better than GCRNet for me.

As I understand from your answer, I should train again your used dataset by changing 0.75 value. am I right?

@ZZZHANG-jx
Copy link
Owner

ZZZHANG-jx commented Aug 26, 2024

Firstly thank you very much for your quick answer. Acctually I have tried GCDRNet before DocRes with same test images. But DocRes result is better than GCRNet for me.

As I understand from your answer, I should train again your used dataset by changing 0.75 value. am I right?

Yes, that's correct. If you're not concerned with other tasks (such as binarization or dewarping), you can train only the appearance task, which will simplify things.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants