-
Notifications
You must be signed in to change notification settings - Fork 584
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] Dense Mask IoU #5283
base: develop
Are you sure you want to change the base?
[WIP] Dense Mask IoU #5283
Conversation
Important Review skippedIgnore keyword(s) in the title. ⛔ Ignored keywords (3)
Please check the settings in the CodeRabbit UI or the You can disable this status message by setting the Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
Documentation and Community
|
@ehofesmann I know you mentioned there would be some edge cases regarding different pred and ground truth resolutions. Do you think my implementation covers this sufficiently? If not can you provide a small test case I can use? |
Very nice work! I haven't run this myself, but I do believe that this todo would need to be addressed in order to support preds and gts with different resolutions. You could try to test it with something like: gt_mask = np.zeros((90,90))
gt_mask[:60,:] = 1
pred_mask = np.zeros((50,50))
pred_mask[25:,:] = 1
gt = fo.Detection(bounding_box=[0.1,0.1,0.1,0.1], label="test", mask=gt_mask)
pred = fo.Detection(bounding_box=[0.1,0.1,0.1,0.1], label="test", mask=pred_mask)
fo.utils.iou._dense_iou(pred, gt) |
I'm not sure I understand the use case here. When do we have predictions and ground truths of different resolutions? And for the example you shared what is the expected behaviour |
I could definitely imagine a case where the ground truth labels were annotated on higher res images than what a segmentation model results. And also that 2 segmentation models might have resulting masks of different resolutions and you want to compare one with the other. For the example I would expect there to be an overlap of 1/6 (one mask covers the bottom 2/3 of a square image and the other mask the top 1/2) |
Got it, this should be handled correctly now. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code LGTM from a static review standpoint.
Note: I haven't run this myself; relying on you for implementation correctness.
Just to double confirm: as @ehofesmann pointed out, FO does not enforce nor assume that the masks for each object have the same resolution, so the IoU implementations must gracefully handle two Detection
instances whose masks are different sizes.
I added some documentation for this change here: 79080ff
@manushreegangwar @jacobsela could one of you validate the correctness of my implementation? Happy to provide guidance on how to test offline. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Haven't had a chance to verify by running code yet. Generally looks fine.
gt_size = gt_img_w * gt_img_h | ||
pred_size = pred_img_w * pred_img_h | ||
if gt_size > pred_size: | ||
pred_mask_full = etai.resize( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This resolves to cv2.resize
which by default uses linear interpolation. I'm pretty sure values in (0, 1) are treated as boolean True in np.logical_and
which may cause some strange edge case bugs in thin objects for example. I can check this more thoroughly later. Besides that looks good to me but I haven't run it yet.
What changes are proposed in this pull request?
Now when calling
fo.utils.iou.compute_ious(preds, gts, tolerance=None)
with pixel masks, the IoU will be computed using pixels instead of converting to contours. This is especially needed when trying to evaluate segmentation models which typically use dense IoU as a metric instead of one computed on contours.How is this patch tested? If it is not, please explain why.
(Details)
Release Notes
Is this a user-facing change that should be mentioned in the release notes?
notes for FiftyOne users.
(Details in 1-2 sentences. You can just refer to another PR with a description
if this PR is part of a larger change.)
What areas of FiftyOne does this PR affect?
fiftyone
Python library changes