Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[models] TF & PT: Retrain all detection models with random rotation augmentation #1364

Closed
11 tasks done
Tracked by #1304
felixdittrich92 opened this issue Oct 30, 2023 · 0 comments
Closed
11 tasks done
Tracked by #1304
Assignees
Labels

Comments

@felixdittrich92
Copy link
Contributor

felixdittrich92 commented Oct 30, 2023

🚀 The feature

All detection models should be trained with rotation augmentation

PyTorch:

  • db_resnet50
  • db_resnet34
  • db_mobilenet_v3_large
  • linknet_resnet18
  • linknet_resnet34
  • linknet_resnet50

TensorFlow:
Note: (#1348 should be merged before we start with TF)

  • db_resnet50
  • db_mobilenet_v3_large (running)
  • linknet_resnet18
  • linknet_resnet34
  • linknet_resnet50

Motivation, pitch

  • After this retrain every detection model can be used also for rotated pages / text detection
  • We can drop linknet_resnet18_rotation (TF) and db_resnet50_rotation (PT)
  • We can remove the limitation ->
    if arch not in ROT_ARCHS and not assume_straight_pages:
  • with the new trained models and [prototype] compute orientation on segmentation map #1336 the page orientation detection will be much more robust (range -90 - 90 degrees) / upside down still open
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants