YOLO #7496

senarvi · 2023-04-04T13:29:15Z

A generic YOLO implementation that supports the most important features of YOLOv3, YOLOv4, YOLOv5, YOLOv7, Scaled-YOLOv4, and YOLOX. It includes networks that have been written in PyTorch, but the user can also load a network from a Darknet configuration file. The features such as matching predictions to targets have been implemented in a modular way, so that they can easily be replaced, or reused in different models. Target class labels may be specified as a matrix of class probabilities, allowing multi-label classification. Includes unit tests and complete type hints for static type checking.

This code is contributed with the premission of my employer Groke Technologies.

Fixes #6341

…nge)

pytorch-bot · 2023-04-04T13:29:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7496

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

oke-aditya · 2023-04-05T05:18:41Z

cc @NicolasHug

senarvi · 2023-04-20T16:10:29Z

Seems like I was able to fix most of the unit tests. The biggest job was to get the TorchScript compilation working. In order to fix it, I had to make the code a bit uglier in some places. For example, the target matching classes cannot be subclasses, because the JIT compilation can't handle subclasses. Also, the iou function and the cross-entropy function cannot be passed as function objects, which makes the loss computation a bit awkward. I don't know if there would be some way to make function objects work.

vadimkantorov · 2023-05-27T10:26:19Z

test/test_models.py

+
+    def compute_mean_std(tensor):
+        # can't compute mean of integral tensor
+        tensor = tensor.to(torch.double)


is there a reason for using torch.double instead of torch.float32?
also, just in case there exists torch.std_mean function

Thanks for reviewing @vadimkantorov . This function used to be an inner function of test_detection_model() and I just copied it outside the function so that other functions can call it. So I don't know if there has been a reason to cast it to double instead of float32, but based on the above comment it seems like float32 would be fine. Should I change it to float32 and also switch to std_mean()?

I'm an not one of maintainers, let's wait for a more official review :) This is also part of test code, so less important...

@vadimkantorov all right, thanks for pointing that out though.

patches11 · 2024-05-01T21:45:25Z

I'm attempting to test these models by training them on the COCO dataset, however I am unable to get close to the results reported in the original papers, for example yolo v7 here:

https://github.com/WongKinYiu/yolov7?tab=readme-ov-file#performance

Wondering if I am doing something wrong in training these, or if you can provide any guidance on what you are doing to train them and the results you are seeing?

senarvi · 2024-05-02T17:39:47Z

Hi @patches11 . I haven't trained models on YOLO recently. Also, I don't have access to a compute cluster for training these models anymore. I did check earlier that I can use YOLOv4 weights, so the forward pass should be correct. But there are lots of details used in model training, like mosaic and copy-paste augmentation. I feel like all the details are not mentioned in the papers. I'm not even sure how exactly the models were tested (what data and resolution were used). I also found gradient clipping to be important, even though it's not used in the papers. Maybe @FateScript can comment what we're still missing?

patches11 · 2024-05-03T15:54:49Z

@senarvi thanks for the details, I will take a look at implementing those augmentations.

It does definitely seem like we don't get all the details in the papers

senarvi · 2024-05-06T19:43:46Z

Yeah, it will be interesting if we can get all the augmentations and training tricks exactly as in the papers. That way we could get a fair comparison between YOLO and other architectures. I think YOLOv7 uses 1280x1280 input size, which consists of four 640x640 tiles. So for each network input you sample four images, which makes it a bit more complicated. For the copy-paste augmentation we need segmentation masks, in addition to the bounding boxes. I think during testing they sort the images so that for each batch you get as similar sizes as possible, so that you don't have to resize the images as much. Still, if I'm right, the test results vary a little bit, depending on the batch size.

discort · 2024-12-02T16:59:14Z

@senarvi
Thanks for sharing your implementation. Any plans to finish this PR?

senarvi · 2024-12-02T20:20:10Z

@discort I was left waiting for someone to review this PR. I'm not sure if there's interest in merging this to torchvision... there hasn't been much activity in 1.5 years. More or less the same implementation was merged to lightning-bolts, though: https://github.com/Lightning-Universe/lightning-bolts/tree/master/src/pl_bolts/models/detection/yolo

I'm still happy to help with the code if there's interest, but I don't work for Groke Technologies anymore and I don't have the computational resources, so I cannot really continue to develop any features that we're missing to achieve a better performance on the COCO dataset.

I did go through other YOLO implementations, however, and noticed one detail that has been added at some point but I have missed: Task Alignment Learning (TAL). Assigning ground-truth boxes to anchors is based on a metric that combines both the score given to the correct class and the IoU. In case someone has the resources to continue the development, that's one thing to look at.

senarvi added 4 commits March 1, 2023 14:24

YOLO model

e71e0aa

Fixed module imports

b8f28a2

SimOTA matches targets also based on the size of the anchors (size_ra…

d0769d9

…nge)

Added support for label smoothing

ab88372

facebook-github-bot added the cla signed label Apr 4, 2023

senarvi mentioned this pull request Apr 4, 2023

[RFC] Support YOLOX detection model #6341

Open

1 task

senarvi added 12 commits April 5, 2023 20:08

Fixed type annotations and yolov4() helper function

154df0c

Fixed some unit tests

7cfa5af

Merge branch 'main' into yolo

50660b2

Fixed TorchScript compilation

5cc8558

Merge branch 'yolo' of github.com:groke-technologies/vision into yolo

ad2a9dd

Fixed formatting

7bb6008

Fixed flake8

27b9c9e

Fixed docstrings

89f1500

Fixed target matching

b7a836e

Merge branch 'main' into yolo

301ac4a

Fixed DarknetNetwork TorchScript compilation

37200dc

Merge branch 'yolo' of github.com:groke-technologies/vision into yolo

634061d

YOLO.forward() returns only losses when training

ae30df4

vadimkantorov reviewed May 27, 2023

View reviewed changes

dxoigmn added a commit to IntelLabs/MART that referenced this pull request Jun 23, 2023

Add torchvision YOLO model from pytorch/vision#7496

fe8a0f8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YOLO #7496

YOLO #7496

senarvi commented Apr 4, 2023 •

edited

Loading

pytorch-bot bot commented Apr 4, 2023 •

edited

Loading

oke-aditya commented Apr 5, 2023 •

edited

Loading

senarvi commented Apr 20, 2023

vadimkantorov May 27, 2023

senarvi May 28, 2023

vadimkantorov May 28, 2023 •

edited

Loading

senarvi May 28, 2023

patches11 commented May 1, 2024

senarvi commented May 2, 2024

patches11 commented May 3, 2024

senarvi commented May 6, 2024

discort commented Dec 2, 2024

senarvi commented Dec 2, 2024

YOLO #7496

Are you sure you want to change the base?

YOLO #7496

Conversation

senarvi commented Apr 4, 2023 • edited Loading

pytorch-bot bot commented Apr 4, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7496

❗ 2 Active SEVs

oke-aditya commented Apr 5, 2023 • edited Loading

senarvi commented Apr 20, 2023

vadimkantorov May 27, 2023

Choose a reason for hiding this comment

senarvi May 28, 2023

Choose a reason for hiding this comment

vadimkantorov May 28, 2023 • edited Loading

Choose a reason for hiding this comment

senarvi May 28, 2023

Choose a reason for hiding this comment

patches11 commented May 1, 2024

senarvi commented May 2, 2024

patches11 commented May 3, 2024

senarvi commented May 6, 2024

discort commented Dec 2, 2024

senarvi commented Dec 2, 2024

senarvi commented Apr 4, 2023 •

edited

Loading

pytorch-bot bot commented Apr 4, 2023 •

edited

Loading

oke-aditya commented Apr 5, 2023 •

edited

Loading

vadimkantorov May 28, 2023 •

edited

Loading