Should we control the pwh range for bbox regression from (0, 4) to (1/4, 4) ? by daikankan · Pull Request #8265 · ultralytics/yolov5

daikankan · 2022-06-20T07:29:01Z

where 2.7724 = 2 * 1.3862 = 2 * np.log(4), why not make the pwh in (1/4, 4) (instead of (0, 4)) for anchor_t=4?
After which the bbox regression should be more stable, below is my personal projects for comparison(not the official yolov5):

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Improved bounding box size prediction in YOLOv5 object detection models.

📊 Key Changes

Altered the computation of the width and height for bounding box predictions from using a squared scale factor to using an exponential function.
Updated the forward method in yolo.py and the loss calculation in utils/loss.py.

🎯 Purpose & Impact

Purpose: To enhance the accuracy of bounding box predictions by refining the way width and height are calculated.
Impact: Users can expect more accurate object detection, particularly in the size and scale of detected objects within an image. This may result in better performance for tasks where exact object dimensions are crucial. 🎯📈

glenn-jocher · 2022-06-21T13:35:33Z

@daikankan very interesting, thanks for sharing your results!

Something doesn't seem right about your equation though, it should be definable without an exp() function, especially since exp and sigmoid cancel each other out I believe.

Aside from that it is true that narrowing the range would provide less room for error especially in the early stages of training and possibly in later training also near the lower boundary.

glenn-jocher · 2022-06-21T13:37:11Z

@daikankan can you see if you can simplify this equation?

BTW we can not merge this PR currently as it would break all existing models but it might be suitable to merge at a new release that arrives with new models.

daikankan · 2022-06-22T04:01:27Z

@glenn-jocher thanks for your suggestion, but I think there is no way to simplify this equation, which is equivalent to pwh = torch.exp(torch.tanh(0.5 * ps[:, 2:4]) * 1.3862) * anchors[i].

LUO77123 · 2022-07-07T06:53:34Z

where 2.7724 = 2 * 1.3862 = 2 * np.log(4), why not make the pwh in (1/4, 4) (instead of (0, 4)) for anchor_t=4? After which the bbox regression should be more stable, below is my personal projects for comparison(not the official yolov5):

Hello, I want to try your scheme. I found that there are two places that need to be modified in yolo.py (y[..., 2:4] = (y[..., 2:4] * 2) ** 2 * self.anchor_grid[i]) and loss.py (pwh = (pwh.sigmoid() * 2) ** 2 * anchors[i]). Is that so, or are there other modifications I didn't find?

glenn-jocher · 2022-07-07T19:31:19Z

@daikankan we've implemented deterministic training now in PR #8213, so every training will be identical unless the seed is changed.

I've updated your branch with the latest changes from master including the #8213. Can you re-run your experiment and replot your results? This time the only changes will be due to the box regression change you made, there will be no more random differences between the two results.

daikankan · 2022-07-08T07:33:49Z

@LUO77123 yes you're right, maybe you can help to check the PR with experiments if you are interested, cause I have no extra GPU resource recently @glenn-jocher .

glenn-jocher · 2022-07-08T12:54:49Z

@daikankan got it! Yes I'll leave this open and run experiments when our GPU resources free up. I think this is a good change but I believe the equation must be able to be simplified. I haven't sat down to try to simplify it myself yet though, been super busy with repo maintenance.

LUO77123 · 2022-07-10T06:11:49Z

Can we try this? Change （y[..., 2:4] = (y[..., 2:4] * 2) ** 2 * self.anchor_grid[i]） into （y[..., 2:4] = (y[..., 2:4] * 3.75 + 0.25) * self.anchor_grid[i]）, which is linear growth rather than exponential growth. The formula is simplified.

glenn-jocher · 2022-07-10T21:38:02Z

@LUO77123 no. Nominal input should produce nominal output: 1.0 = fcn(0.5)

Your equation does not respect this constraint.

LUO77123 · 2022-07-11T02:10:42Z

@LUO77123 no. Nominal input should produce nominal output: 1.0 = fcn(0.5)

Your equation does not respect this constraint.

I get the function of square and cube by fitting the curve with Matlab
square:
(-0.17662+2.121x)^2+0.218806

cube:
(0.277629+1.27898x)^3+0.228605

glenn-jocher · 2022-07-11T13:27:50Z

@LUO77123 thanks! Can you plot these from x=0 to x=4 against the current equation?

LUO77123 · 2022-07-12T01:59:31Z

(-0.17662+2.121_x)^2+0.218806_

I drew four kinds of diagrams through Matlab, the original 0-4, and the remaining three 0.25-4, which are marked. Do you think it's ok?

daikankan · 2022-07-12T07:34:05Z

@glenn-jocher I think the most important principle is the constraint mu(torch.log(pwh)) = 0, for pwh in range (1/4, 4).
For instance, suppose x is in standard normal distribution, then we get the comparision:

glenn-jocher · 2022-07-12T15:05:25Z

@daikankan really interesting point, the orange data is much better distributed...

github-actions · 2023-03-22T00:21:03Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions YOLOv5 🚀 and Vision AI ⭐.

github-actions · 2023-10-03T00:21:19Z

👋 Hello there! We wanted to let you know that we've decided to close this pull request due to inactivity. We appreciate the effort you put into contributing to our project, but unfortunately, not all contributions are suitable or aligned with our product roadmap.

We hope you understand our decision, and please don't let it discourage you from contributing to open source projects in the future. We value all of our community members and their contributions, and we encourage you to keep exploring new projects and ways to get involved.

For additional resources and information, please see the links below:

Docs: https://docs.ultralytics.com
HUB: https://hub.ultralytics.com
Community: https://community.ultralytics.com

Thank you for your contributions to YOLO 🚀 and Vision AI ⭐

change pwh distribution.

6050fd4

Merge branch 'master' into master

ef703b9

Merge branch 'master' into master

4666708

github-actions bot added the Stale Stale and schedule for closing soon label Mar 22, 2023

github-actions bot removed the Stale Stale and schedule for closing soon label Apr 10, 2023

github-actions bot added the Stale Stale and schedule for closing soon label Oct 3, 2023

github-actions bot closed this Nov 3, 2023

Uh oh!

Conversation

daikankan commented Jun 20, 2022 • edited by UltralyticsAssistant Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

Uh oh!

glenn-jocher commented Jun 21, 2022

Uh oh!

glenn-jocher commented Jun 21, 2022

Uh oh!

daikankan commented Jun 22, 2022

Uh oh!

LUO77123 commented Jul 7, 2022

Uh oh!

glenn-jocher commented Jul 7, 2022

Uh oh!

daikankan commented Jul 8, 2022

Uh oh!

glenn-jocher commented Jul 8, 2022

Uh oh!

LUO77123 commented Jul 10, 2022

Uh oh!

glenn-jocher commented Jul 10, 2022

Uh oh!

LUO77123 commented Jul 11, 2022

Uh oh!

glenn-jocher commented Jul 11, 2022

Uh oh!

LUO77123 commented Jul 12, 2022

Uh oh!

daikankan commented Jul 12, 2022

Uh oh!

glenn-jocher commented Jul 12, 2022

Uh oh!

github-actions bot commented Mar 22, 2023

Uh oh!

github-actions bot commented Oct 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

daikankan commented Jun 20, 2022 •

edited by UltralyticsAssistant

Loading