fix nDCG can not be called with negative relevance targets #378

paul-grundmann · 2021-07-16T10:11:48Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs? (No update needed)
Did you write any new necessary tests?

What does this PR do?

Fixes #377

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Of course I had fun fixing this issue :)

Borda

can we also add a test to cover this behaviour as nothing was failing till now...

codecov · 2021-07-16T10:23:56Z

Codecov Report

Merging #378 (84d7be6) into master (e00e3ab) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master     #378   +/-   ##
=======================================
  Coverage   96.44%   96.45%           
=======================================
  Files         120      120           
  Lines        3801     3804    +3     
=======================================
+ Hits         3666     3669    +3     
  Misses        135      135

Flag	Coverage Δ
Linux	`76.49% <100.00%> (+0.04%)`	⬆️
Windows	`76.49% <100.00%> (+0.04%)`	⬆️
cpu	`95.71% <100.00%> (-0.69%)`	⬇️
gpu	`96.37% <100.00%> (+<0.01%)`	⬆️
macOS	`95.71% <100.00%> (-0.69%)`	⬇️
pytest	`96.45% <100.00%> (+<0.01%)`	⬆️
python3.6	`95.58% <100.00%> (+<0.01%)`	⬆️
python3.8	`95.66% <100.00%> (-0.74%)`	⬇️
python3.9	`?`
torch1.3.1	`95.58% <100.00%> (+<0.01%)`	⬆️
torch1.4.0	`95.66% <100.00%> (+<0.01%)`	⬆️
torch1.9.0	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
torchmetrics/functional/retrieval/ndcg.py	`100.00% <100.00%> (ø)`
torchmetrics/utilities/checks.py	`92.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e00e3ab...84d7be6. Read the comment docs.

SkafteNicki · 2021-07-16T13:12:57Z

@paul-grundmann could you try changing the target line to target=torch.randint(low=-2, high=4... to also check for negative values?
https://github.com/PyTorchLightning/metrics/blob/a4d2a7a81ba482f28edd30bd76abc3cffbaca548/tests/retrieval/inputs.py#L35-L39

- Test nDCG with negative relevance targets

Borda · 2021-07-20T11:46:15Z

@paul-grundmann mind check the failing tests and #378 (comment)

…nto patch-1

paul-grundmann · 2021-07-20T16:19:41Z

It seems that the random generation of inputs can lead to invalid inputs for the calculation of the DCG (e.g. a single target with 0 relevance). This leads to a division by zero:

return _dcg(sorted_target) / _dcg(ideal_target) # _dcg(ideal_target) == 0

Should I add a test in the nDCG calculation if the ideal DCG is zero and then return 0.0 as a result?

SkafteNicki · 2021-07-22T14:52:24Z

It seems to me that maybe the ndcg should be changed to account for division by zero. Looking at sklearns implementation:
https://github.com/scikit-learn/scikit-learn/blob/2beed55847ee70d363bdbfe14ee4401438fba057/sklearn/metrics/_ranking.py#L1458-L1466
they set it to 0 we the denominator is 0. @lucadiliello any opinion here?

lucadiliello · 2021-07-22T15:08:51Z

It seems that the random generation of inputs can lead to invalid inputs for the calculation of the DCG (e.g. a single target with 0 relevance). This leads to a division by zero:
return _dcg(sorted_target) / _dcg(ideal_target) # _dcg(ideal_target) == 0
Should I add a test in the nDCG calculation if the ideal DCG is zero and then return 0.0 as a result?

Yes, I think the functional implementation should behave like the sklearn version, so it should return 0. However, if now you are allowing also negative targets, I think this line should be changed.

- Use the scikit-learn implementation of nDCG - Removed the test for non binary targets in test_ndcg.py and replaced the default parameters in the error test with a custom one that does not check for binary targets - set the _input_retrieval_scores_non_binary_target low to -1 to reduce the test failure rate

for more information, see https://pre-commit.ci

paul-grundmann · 2021-07-26T17:28:29Z

Ok I had some time to play around since the recent code change introduced a lot more failing tests.
On the one hand it seems that scikit learn also does not handle the edge case of the DCG being zero Typically this happens for k=1 and a relevance target of 0 at idx 0. This of course produces failing tests.

On the other hand the current architecture of the tests. In the helpers.py script there are tests for the errors we test against. In case of the nDCG the _errors_test_functional_metric_parameters_default contains a check for binary values. So I added a new parameter definition which is in fact mostly identical to the default_parameters but without the check for binary targets.
I am not so fluent with pyTest so I don't know whether there might be a more elegant solution for that.

I think the first problem is more drastically because the tests can actually fail randomly. With relevance targets of only -1 instead of -2 I had a lot of successful test runs but one or two failing at some point. Maybe the inputs need to be generated differently or at least with a check if the targets with k=1 contain at least one relevant sample

- removed unused imports in ndcg.py

Borda

LGTM 🐰

pep8speaks · 2021-07-28T12:58:45Z

Hello @paul-grundmann! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-07-28 16:07:39 UTC

for more information, see https://pre-commit.ci

Should fix issue Lightning-AI#377 -

322a682

paul-grundmann requested review from ananyahjha93, Borda, justusschock, SkafteNicki and tchaton as code owners July 16, 2021 10:11

Borda changed the title ~~Should fix issue #377 -~~ fix nDCG can not be called with negative relevance targets Jul 16, 2021

Borda approved these changes Jul 16, 2021

View reviewed changes

Borda added the bug / fix Something isn't working label Jul 16, 2021

mergify bot added 3 commits July 16, 2021 17:40

Merge branch 'master' into patch-1

cc70d8d

Merge branch 'master' into patch-1

1c603f1

Merge branch 'master' into patch-1

2ffef2f

Borda added the waiting on author label Jul 16, 2021

mergify bot and others added 3 commits July 16, 2021 21:43

Merge branch 'master' into patch-1

d41b1cb

Add:

5b49af5

- Test nDCG with negative relevance targets

Merge branch 'master' into patch-1

ab6bddc

paul-grundmann added 2 commits July 20, 2021 14:23

Fix: Check for non binary values for retrieval targets

d1db30c

Merge branch 'patch-1' of https://github.com/paul-grundmann/metrics i…

5a446c5

…nto patch-1

Borda removed the waiting on author label Jul 20, 2021

mergify bot added 5 commits July 24, 2021 15:59

Merge branch 'master' into patch-1

d0a8efe

Merge branch 'master' into patch-1

17fffa4

Merge branch 'master' into patch-1

6ea2780

Merge branch 'master' into patch-1

94947dc

Merge branch 'master' into patch-1

0b76837

Merge branch 'master' into patch-1

0b2c59e

mergify bot added ready and removed ready labels Jul 26, 2021

paul-grundmann and others added 3 commits July 26, 2021 19:27

Merge branch 'master' into patch-1

12d67e0

[pre-commit.ci] auto fixes from pre-commit.com hooks

4fe79b7

for more information, see https://pre-commit.ci

mergify bot added the ready label Jul 26, 2021

paul-grundmann added 2 commits July 27, 2021 11:06

Fix:

ce34b68

- removed unused imports in ndcg.py

Merge branch 'master' into patch-1

66a417e

Borda reviewed Jul 28, 2021

View reviewed changes

mergify bot added ready and removed ready labels Jul 28, 2021

Borda assigned SkafteNicki Jul 28, 2021

Merge branch 'master' into patch-1

132f026

SkafteNicki approved these changes Jul 28, 2021

View reviewed changes

SkafteNicki added 2 commits July 28, 2021 14:56

changelog

92de263

more stable tests

29411f2

SkafteNicki enabled auto-merge (squash) July 28, 2021 12:59

[pre-commit.ci] auto fixes from pre-commit.com hooks

c29d8ea

for more information, see https://pre-commit.ci

Borda disabled auto-merge July 28, 2021 13:35

Borda enabled auto-merge (squash) July 28, 2021 13:35

mergify bot and others added 2 commits July 28, 2021 14:02

Merge branch 'master' into patch-1

c72a0d3

Merge branch 'master' into patch-1

84d7be6

mergify bot removed the ready label Jul 28, 2021

Borda merged commit b1062c9 into Lightning-AI:master Jul 28, 2021

mergify bot added the ready label Jul 28, 2021

paul-grundmann deleted the patch-1 branch July 29, 2021 07:16

Borda added this to the v0.5 milestone Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix nDCG can not be called with negative relevance targets #378

fix nDCG can not be called with negative relevance targets #378

paul-grundmann commented Jul 16, 2021 •

edited by Borda

Loading

Borda left a comment

codecov bot commented Jul 16, 2021 •

edited

Loading

SkafteNicki commented Jul 16, 2021

Borda commented Jul 20, 2021

paul-grundmann commented Jul 20, 2021

SkafteNicki commented Jul 22, 2021

lucadiliello commented Jul 22, 2021

paul-grundmann commented Jul 26, 2021

Borda left a comment

pep8speaks commented Jul 28, 2021 •

edited

Loading

fix nDCG can not be called with negative relevance targets #378

fix nDCG can not be called with negative relevance targets #378

Conversation

paul-grundmann commented Jul 16, 2021 • edited by Borda Loading

Before submitting

What does this PR do?

PR review

Did you have fun?

Borda left a comment

Choose a reason for hiding this comment

codecov bot commented Jul 16, 2021 • edited Loading

Codecov Report

SkafteNicki commented Jul 16, 2021

Borda commented Jul 20, 2021

paul-grundmann commented Jul 20, 2021

SkafteNicki commented Jul 22, 2021

lucadiliello commented Jul 22, 2021

paul-grundmann commented Jul 26, 2021

Borda left a comment

Choose a reason for hiding this comment

pep8speaks commented Jul 28, 2021 • edited Loading

Comment last updated at 2021-07-28 16:07:39 UTC

paul-grundmann commented Jul 16, 2021 •

edited by Borda

Loading

codecov bot commented Jul 16, 2021 •

edited

Loading

pep8speaks commented Jul 28, 2021 •

edited

Loading