The `divided by zeros` problem in low precision when use `uqi`, `sdi`, `ssim` and `ms-ssim` #2281

michael080808 · 2023-12-23T06:22:12Z

🐛 Bug

Hi, guys! Thanks for your fast response and fix! I really appreciate that.
When I tried to compute the uqi function, I found that there is still some situation that the quotients will be zeros.
Accroding to https://en.wikipedia.org/wiki/Variance, I know that

And I tried to debug the code for understanding why I got NaN values. I noticed that

torchmetrics/src/torchmetrics/functional/image/uqi.py

Lines 105 to 106 in 62adb40

    
           sigma_pred_sq = output_list[2] - mu_pred_sq 
        
           sigma_target_sq = output_list[3] - mu_target_sq

produces very small negative values around 5e-7. It seems that the results give the "negative variance" values due to float deviations, which is impossible for math. Although there is a eps to avoid zero, the absolute value of "negative variance" is much greater than eps, and the "negative variance" finally leads to this NaN or "divided by zeros" problems. sdi uses the uqi results so it produced the same NaN. Besides, ssim and ms-ssim use C1 and C2 to prevent this thing but in fact they also suffer the same situation.

To Reproduce

I'm sorry that I can't give the original data due to the upload limitations. Each file is 1.15GB. It's very tricky to reproduce the problem. If there are some other methods to upload data, I’m glad to provide my data. 😊

Expected behavior

I'm not sure how to solve the problem. torch.unfold can compute as the variance defination but will cost huge amount of memory which is not acceptable. I think it's better to add a check of the variance to prevent negetive ones join the quotients. However, I don't know which is better between using the absolute values of negetive results and just leaving them as zeros.

Environment

Name	Version	Build	Channel
_anaconda_depends	2023.07	py311_0	https://repo.anaconda.com/pkgs/main
conda	23.9.0	py311haa95532_0	https://repo.anaconda.com/pkgs/main
python	3.10.13	he1021f5_0	defaults
pytorch	2.1.0	py3.10_cuda12.1_cudnn8_0	pytorch
pytorch-cuda	12.1	hde6ce7c_5	pytorch
pytorch-mutex	1.0	cuda	pytorch
torchaudio	2.1.0	pypi_0	pypi
torchmetrics	1.2.1	pyhd8ed1ab_0	conda-forge
torchvision	0.16.0	pypi_0	pypi

The text was updated successfully, but these errors were encountered:

eminbdr · 2023-12-25T10:17:08Z

To solve this issue, one approach can be using Fractions instead of floats to represent numbers with high denominators. Since tensors only work with floats and integers, you can create two seperate tensors for representing numerators and denominators.

michael080808 added bug / fix Something isn't working help wanted Extra attention is needed labels Dec 23, 2023

Borda added the v1.2.x label Dec 27, 2023

SkafteNicki mentioned this issue Feb 13, 2024

Clamp variance calculation in certain image metrics #2378

Merged

4 tasks

Borda closed this as completed in #2378 Feb 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The `divided by zeros` problem in low precision when use `uqi`, `sdi`, `ssim` and `ms-ssim` #2281

The `divided by zeros` problem in low precision when use `uqi`, `sdi`, `ssim` and `ms-ssim` #2281

michael080808 commented Dec 23, 2023 •

edited

Loading

eminbdr commented Dec 25, 2023

The divided by zeros problem in low precision when use uqi, sdi, ssim and ms-ssim #2281

The divided by zeros problem in low precision when use uqi, sdi, ssim and ms-ssim #2281

Comments

michael080808 commented Dec 23, 2023 • edited Loading

🐛 Bug

To Reproduce

Expected behavior

Environment

eminbdr commented Dec 25, 2023

The `divided by zeros` problem in low precision when use `uqi`, `sdi`, `ssim` and `ms-ssim` #2281

The `divided by zeros` problem in low precision when use `uqi`, `sdi`, `ssim` and `ms-ssim` #2281

michael080808 commented Dec 23, 2023 •

edited

Loading