Rotate page #488

Rob192 · 2021-09-22T12:06:31Z

This is part of the solutions discussed in #225 and address the point "Integrate the feature in the predictor while being disabled by default" of the "Page-level orientation" part.

It implements the function rotate_page and tracks the 'page_angles'

# Conflicts: # doctr/models/core.py

# Conflicts: # doctr/models/_utils.py # doctr/models/core.py

codecov · 2021-09-22T12:15:04Z

Codecov Report

Merging #488 (66fad67) into main (e14645a) will increase coverage by 0.17%.
The diff coverage is 96.29%.

@@            Coverage Diff             @@
##             main     #488      +/-   ##
==========================================
+ Coverage   96.15%   96.32%   +0.17%     
==========================================
  Files         114      117       +3     
  Lines        4447     4523      +76     
==========================================
+ Hits         4276     4357      +81     
+ Misses        171      166       -5

Flag	Coverage Δ
unittests	`96.32% <96.29%> (+0.17%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/utils/visualization.py	`92.03% <0.00%> (ø)`
doctr/utils/geometry.py	`98.00% <96.66%> (-0.79%)`	⬇️
doctr/models/_utils.py	`95.34% <100.00%> (+1.30%)`	⬆️
doctr/models/predictor/pytorch.py	`100.00% <100.00%> (ø)`
doctr/models/predictor/tensorflow.py	`100.00% <100.00%> (ø)`
doctr/utils/common_types.py	`100.00% <100.00%> (ø)`
...dels/detection/differentiable_binarization/base.py	`89.44% <0.00%> (-2.49%)`	⬇️
doctr/datasets/cord.py	`97.43% <0.00%> (ø)`
doctr/datasets/funsd.py	`96.87% <0.00%> (ø)`
doctr/datasets/sroie.py	`94.87% <0.00%> (ø)`
... and 17 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e14645a...66fad67. Read the comment docs.

fg-mindee

Thanks a lot to jump back into this 🙏

I added a few comments but for me, there is only one major part missing:

with this PR, we rotate the page firsthand before any DL model forward
the process is almost identical, but the localizations of words have to be remapped into the original pages (meaning if we detect that the page has a 30° orientation, we forward its straightened version, if we detect a straight bounding box in this, we have to reproject it correctly so that the localization refers to the original page)

Let me know what you think :)

Dockerfile

doctr/models/_utils.py

doctr/models/predictor/tensorflow.py

fg-mindee · 2021-09-29T15:42:55Z

@Rob192 also, if you don't minde merging the main branch, there seems to be some conflicts 😅

Rob192 · 2021-09-30T09:06:50Z

@Rob192 also, if you don't minde merging the main branch, there seems to be some conflicts 😅
Yes ! you guys are mooving too fast for me 😮

Rob192 · 2021-09-30T09:55:18Z

Hello,

Thanks for the review ! My objective is now to use geometry.rotate_image to rotate the documents before going into the predictor and then use the same function to rotate every box, compare to the center of the document previously used.

However, I am feeling a bit confused regarding the expand functionnality you added in several functions of Doctr. I hoped you could help me clarify. I think you goal was to do the same for every function, i.e. given a image of shape (10,20), the rotation by an angle of 90° with expand=True would give a result image of shape (20,10). This is not the case for all the function in utils.geometry :

import numpy as np
import matplotlib.pyplot as plt

import torch
import tensorflow as tf

from doctr.utils import geometry
from doctr.transforms.functional.tensorflow import rotate as rotate_tf
from doctr.transforms.functional.pytorch import rotate as rotate_to

_, axes = plt.subplots(3, 1)

img = np.ones((32, 64, 3), dtype=np.float32)
rotated = geometry.rotate_image(img, 45., expand=True)
print(rotated.shape)
axes[0].imshow(rotated)

input_t = torch.ones((3, 32, 64), dtype=torch.float32)
boxes = np.array([
    [15, 20, 35, 30]
])
r_img, r_boxes = rotate_to(input_t, boxes, angle=45., expand=True)
print(r_img.shape)
axes[1].imshow(r_img.permute((1,2,0)))

input_t = tf.ones((32, 64, 3), dtype=tf.float32)
boxes = np.array([
    [15, 20, 35, 30]
])
r_img, r_boxes = rotate_tf(input_t, boxes, angle=45., expand=True)
print(r_img.shape)
axes[2].imshow(r_img)

Do you think I could modify geometry.rotate_image so that expand=True has the same behaviour as the two other functions in doctr.transforms.functional ? More precisely I would add a expand_type parameter that could take values in ['crop', 'keep_original_shape', 'keep_destination_shape']

fg-mindee

Thanks for the edits! Regarding your question about rotate_image, it's still an ongoing implementation so we can add optional features in it. But to keep the PR short, perhaps this should be implemented in the predictor directly? (we can refactor in another PR once we get this working)

Dockerfile

doctr/models/predictor/tensorflow.py

test/common/test_models.py

Rob192 · 2021-10-04T15:39:55Z

Hello @fg-mindee !
Here are my modifications to remap the localizations of the words back into the original pages. Here are some details of the choices I did :

I created a new rotate_image function inside doctr/models/_utils.py per your recommandation. I removed the last bit of the function to keep the size of the original image. I introduced a mask functionality to crop the padding at the end of the + then - rotation.
I had to modify rotate_boxes to follow the expand that is done in rotate_image. The calculation for the rotation of the boxes was not right and I fixed that. There is also a mask functionality in rotate_boxes
I hope this makes sense to you guys

fg-mindee

Thanks again Rob!
I think there are still a few edges cases we are not covering but this is taking a very solid shape!

doctr/models/_utils.py

doctr/utils/visualization.py

doctr/models/predictor/tensorflow.py

doctr/utils/geometry.py

fg-mindee

Still have a few comments, but the major one being about the reprojection of rotated bounding boxes

doctr/utils/geometry.py

doctr/models/zoo.py

… of 45°

fg-mindee

There is one merge conflict to resolve, minor tweaks in the unittest and we can merge!
Sorry about this 🙃

tests/pytorch/test_models_detection_pt.py

tests/pytorch/test_models_recognition_pt.py

tests/pytorch/test_models_zoo_pt.py

Rob192 · 2021-12-02T12:43:06Z

OK @fg-mindee ! Now I got it for the testing !

fg-mindee

Looks good to me, thanks a lot @Rob192 ! @charlesmindee would you mind taking a look at it as well before we merge? 🙏

charlesmindee

Thanks for the PR !

fg-mindee

Sorry Robin, a minor thing I missed in my last review to adjust 😅
That shouldn't change much but especially on this feature it's important to have a good understanding of each argument influence on the final output!

doctr/utils/geometry.py

…ratio

fg-mindee

There seems to be a failing test: you might have to pass preserve_origin_shape=True where I added the comment 👍

doctr/utils/geometry.py

doctr/models/predictor/pytorch.py

fg-mindee

Making that last argument non-optional and we're good!

doctr/utils/geometry.py

fg-mindee

All good 🙌 Thanks a lot!

Rob192 · 2021-12-06T08:23:10Z

Yeah ! Thanks a lot for your support in reviewing this !

Rob192 added 5 commits July 19, 2021 15:38

feat: integrate image rotation before using predictor

1af0fc1

Merge branch 'main' into rotate_page

811fd6a

# Conflicts: # doctr/models/core.py

Merge branch 'main' into rotate_page

6f5c183

# Conflicts: # doctr/models/_utils.py # doctr/models/core.py

feat: add rotate_document functionality

c2b18d7

fix: remove min_angle from rotate_page

312179f

fg-mindee suggested changes Sep 24, 2021

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

doctr/models/_utils.py Outdated Show resolved Hide resolved

doctr/models/predictor/tensorflow.py Outdated Show resolved Hide resolved

doctr/models/predictor/tensorflow.py Outdated Show resolved Hide resolved

fg-mindee self-assigned this Sep 24, 2021

fg-mindee added the module: models Related to doctr.models label Sep 24, 2021

fg-mindee added this to the 0.5.0 milestone Sep 24, 2021

fg-mindee added the type: enhancement Improvement label Sep 24, 2021

fg-mindee mentioned this pull request Sep 24, 2021

[models] detect page orientation #225

Closed

Rob192 added 2 commits September 30, 2021 10:51

merge

b6bc74f

fix: correct models.predictor.tensorflow

6fe1bc6

fg-mindee suggested changes Sep 30, 2021

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

doctr/models/predictor/tensorflow.py Outdated Show resolved Hide resolved

doctr/models/predictor/tensorflow.py Show resolved Hide resolved

test/common/test_models.py Outdated Show resolved Hide resolved

Rob192 added 3 commits October 1, 2021 10:14

fix: minor corrections

a6f2ff1

feat: Rotate back images and boxes after straightening

ccda4d0

fix: correct typo

7d4ed75

fg-mindee reviewed Oct 5, 2021

View reviewed changes

doctr/models/_utils.py Outdated Show resolved Hide resolved

doctr/utils/visualization.py Show resolved Hide resolved

doctr/models/predictor/tensorflow.py Outdated Show resolved Hide resolved

doctr/utils/geometry.py Outdated Show resolved Hide resolved

Rob192 added 2 commits October 5, 2021 13:52

fix: merge two functions rotate_image

a303b23

fix: do not rotate back pages but only boxes

7a78263

fg-mindee reviewed Oct 6, 2021

View reviewed changes

doctr/utils/geometry.py Outdated Show resolved Hide resolved

doctr/utils/geometry.py Outdated Show resolved Hide resolved

doctr/utils/geometry.py Outdated Show resolved Hide resolved

fg-mindee reviewed Oct 6, 2021

View reviewed changes

doctr/models/zoo.py Outdated Show resolved Hide resolved

Rob192 added 3 commits October 6, 2021 12:33

fix: typos

eb341ac

fix: add more testing for remap_boxes in cases of boxes with an angle…

eeff2d6

… of 45°

fix: remove the cropping after rotation of the image

16f3489

Rob192 added 2 commits December 2, 2021 10:08

fix: add testing for pytorch predictor

faac0bd

fix: styling

938c9f2

fg-mindee suggested changes Dec 2, 2021

View reviewed changes

fix: correct testing for ocrpredictor with pytorch

30a70f2

Rob192 added 2 commits December 2, 2021 13:44

fix: correct imports for testing

a7c0d55

fix: isort

ce23100

fg-mindee approved these changes Dec 2, 2021

View reviewed changes

charlesmindee previously approved these changes Dec 3, 2021

View reviewed changes

fg-mindee suggested changes Dec 3, 2021

View reviewed changes

doctr/utils/geometry.py Outdated Show resolved Hide resolved

doctr/utils/geometry.py Show resolved Hide resolved

fix: make sure that expand in rotate_image is keeping the same image …

8525b14

…ratio

Rob192 dismissed charlesmindee’s stale review via 8525b14 December 3, 2021 14:25

fg-mindee reviewed Dec 3, 2021

View reviewed changes

doctr/utils/geometry.py Outdated Show resolved Hide resolved

doctr/models/predictor/pytorch.py Show resolved Hide resolved

Rob192 added 3 commits December 3, 2021 17:14

fix: styling

7bcf639

fix: use absolute centers for rotate_boxes

2205737

fix: calculation of image_center and documentation

484451b

fg-mindee suggested changes Dec 5, 2021

View reviewed changes

doctr/utils/geometry.py Outdated Show resolved Hide resolved

doctr/utils/geometry.py Outdated Show resolved Hide resolved

fix: remove default value for orig_shape in rotate_boxes

66fad67

fg-mindee approved these changes Dec 5, 2021

View reviewed changes

fg-mindee merged commit 8382896 into mindee:main Dec 6, 2021

This was referenced Dec 7, 2021

feat: add line resolution for rotated boxes #677

Merged

[transforms] Add geometric transformations (train detection models with rotated samples) #352

Closed

fg-mindee added ext: tests Related to tests folder module: utils Related to doctr.utils type: new feature New feature and removed type: enhancement Improvement labels Dec 31, 2021

Rob192 deleted the rotate_page branch January 11, 2022 13:11

Rob192 mentioned this pull request Jan 11, 2022

Restore remap_boxes function #800

Closed

Rob192 mentioned this pull request Jan 25, 2022

Restore remap boxes #812

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rotate page #488

Rotate page #488

Rob192 commented Sep 22, 2021

codecov bot commented Sep 22, 2021 •

edited

Loading

fg-mindee left a comment

fg-mindee commented Sep 29, 2021

Rob192 commented Sep 30, 2021

Rob192 commented Sep 30, 2021 •

edited

Loading

fg-mindee left a comment

Rob192 commented Oct 4, 2021

fg-mindee left a comment

fg-mindee left a comment

fg-mindee left a comment

Rob192 commented Dec 2, 2021

fg-mindee left a comment

charlesmindee left a comment

fg-mindee left a comment

fg-mindee left a comment

fg-mindee left a comment

fg-mindee left a comment

Rob192 commented Dec 6, 2021

Rotate page #488

Rotate page #488

Conversation

Rob192 commented Sep 22, 2021

codecov bot commented Sep 22, 2021 • edited Loading

Codecov Report

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee commented Sep 29, 2021

Rob192 commented Sep 30, 2021

Rob192 commented Sep 30, 2021 • edited Loading

fg-mindee left a comment

Choose a reason for hiding this comment

Rob192 commented Oct 4, 2021

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

Rob192 commented Dec 2, 2021

fg-mindee left a comment

Choose a reason for hiding this comment

charlesmindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

fg-mindee left a comment

Choose a reason for hiding this comment

Rob192 commented Dec 6, 2021

codecov bot commented Sep 22, 2021 •

edited

Loading

Rob192 commented Sep 30, 2021 •

edited

Loading