`BinaryAccuracy()` sometimes gives incorrect answers due to non-deterministic sigmoiding #1604

idc9 · 2023-03-09T17:46:28Z

🐛 Bug

torchmetrics.classification.BinaryAccuracy will apply a sigmoid to some inputs but not others leading to incorrect behavior.

Details

The current behavior of BinaryAccuracy() is to apply a sigmoid transformation if the inputs are outside of [0, 1] before binarizing

If preds is a floating point tensor with values outside [0,1] range we consider the input to be logits and will auto apply sigmoid per element.

i.e.
y_hat = 1(sigmoid(z) >= threshold) if z outside [0, 1]
y_hat = 1(z >= threshold) if z inside [0, 1]

I assume z inside [0, 1] is checked for then entire batch (i.e. if one element of the batch is outside [0, 1] then we apply the sigmoid to everyone).

This will cause silent errors. In particular, if the user inputs logits then they expect the logits to always be sigmoided. However, it is totally possible for all of the logits to lie in [0, 1] for some batches in which case the input will not be sigmoided which will cause incorrect thresholding.

To Reproduce

Here is a simple example. Support our network outputs logits.

from torchmetrics.classification import BinaryAccuracy
from scipy.special import expit # expit = sigmoid
import numpy as np
import torch

This example should lead to a correct prediction

probability_thresh = 0.5 
logits = np.array([0.49]) # network output
target = np.array([1])

# logits of 0.49 give a probability of 0.62 indicating class 1, the correct prediction
expit(logits)
array([0.62010643])
int(expit(logits) >= probability_thresh) == target
True

BinaryAccuracy() however thinks it's an incorrect prediction~

# torchmetrics, however, thinks we have the inccorect prediction because it does NOT sigmoid the logits
ba = BinaryAccuracy(threshold=probability_thresh) 
ba.forward(preds=torch.tensor(logits), target=torch.tensor(target))
tensor(0.)

Suggested Fix

I suggest adding an argument indicating whether or not the input predictions are sigmoided so the inputs are either always sigmoided or never sigmoided

The text was updated successfully, but these errors were encountered:

github-actions · 2023-03-09T17:47:01Z

Hi! thanks for your contribution!, great first issue!

idc9 · 2023-03-09T17:50:06Z

Looks like a similar thing happens in MultilabelAccuracy (https://torchmetrics.readthedocs.io/en/stable/classification/accuracy.html)

idc9 added bug / fix Something isn't working help wanted Extra attention is needed labels Mar 9, 2023

SkafteNicki linked a pull request Mar 31, 2023 that will close this issue

Classification: option to disable input formatting [wip] #1676

Open

27 tasks

Borda changed the title ~~BinaryAccuracy() sometimes gives incorrect answers due to non-deterministic sigmoiding~~ BinaryAccuracy() sometimes gives incorrect answers due to non-deterministic sigmoiding Aug 25, 2023

Lightning-AI deleted a comment from stale bot Aug 25, 2023

Borda added this to the v1.1.x milestone Aug 25, 2023

Borda modified the milestones: v1.1.x, v1.2.x Sep 24, 2023

Borda added the v1.1.x label Oct 6, 2023

ocharles mentioned this issue Dec 12, 2023

Conversion from logits to probabilities happens on a batch by batch basis #2195

Open

Borda modified the milestones: v1.2.x, v1.3.x Jan 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`BinaryAccuracy()` sometimes gives incorrect answers due to non-deterministic sigmoiding #1604

`BinaryAccuracy()` sometimes gives incorrect answers due to non-deterministic sigmoiding #1604

idc9 commented Mar 9, 2023

github-actions bot commented Mar 9, 2023

idc9 commented Mar 9, 2023

BinaryAccuracy() sometimes gives incorrect answers due to non-deterministic sigmoiding #1604

BinaryAccuracy() sometimes gives incorrect answers due to non-deterministic sigmoiding #1604

Comments

idc9 commented Mar 9, 2023

🐛 Bug

Details

To Reproduce

Suggested Fix

github-actions bot commented Mar 9, 2023

idc9 commented Mar 9, 2023

`BinaryAccuracy()` sometimes gives incorrect answers due to non-deterministic sigmoiding #1604

`BinaryAccuracy()` sometimes gives incorrect answers due to non-deterministic sigmoiding #1604