High memory usage of Perplexity metric #2337

nsmlzl · 2024-01-30T23:34:31Z

🐛 Bug

I ran out of memory (GPU) when computing the perplexity metric and would like to propose a small optimization to decrease its memory utilization.

To Reproduce

For instance, when running the following code PyTorch tries to allocate 1024 GB of GPU memory on my system.

from torchmetrics.text import Perplexity
import torch

gen = torch.manual_seed(42)
preds = torch.rand(512, 1024, 12, generator=gen).cuda()
target = torch.randint(12, (512, 1024), generator=gen).cuda()

perp = Perplexity().cuda()
print(perp(preds, target))

Memory Inefficiency

I think the inefficiency is in this line:

torchmetrics/src/torchmetrics/functional/text/perplexity.py

Line 94 in a68455a

probs = probs[:, target].diagonal()[mask]

probs[:, target] results in a large temporary tensor with (512*1024)^2 elements. Afterwards only the diagonal values are used.

Potential Solution

In contrast

probs = probs[torch.arange(target.numel()), target][mask]

would only require memory of the size of target.

Would you consider accepting a pull request with this optimization? Or was the previous implementation chosen for another reason?

Environment

TorchMetrics v1.2.1 (installed with pip) and Master branch.
Python 3.10.12
Pytorch 2.2.0
CUDA 12.1

The text was updated successfully, but these errors were encountered:

github-actions · 2024-01-30T23:34:54Z

Hi! thanks for your contribution!, great first issue!

required an optimization in torchmetrics; see Lightning-AI/torchmetrics#2337

nsmlzl · 2024-02-02T14:03:49Z

Just created PR #2346 with the (small) change. Feel free to merge, when you like it.

nsmlzl added bug / fix Something isn't working help wanted Extra attention is needed labels Jan 30, 2024

Borda added the v1.2.x label Jan 31, 2024

nsmlzl added a commit to nsmlzl/mamborosDNA that referenced this issue Jan 31, 2024

multi-gpu perplexity metric

a45a3b0

required an optimization in torchmetrics; see Lightning-AI/torchmetrics#2337

nsmlzl mentioned this issue Feb 2, 2024

Memory optimization of perplexity metric #2346

Merged

4 tasks

Borda closed this as completed in #2346 Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High memory usage of Perplexity metric #2337

High memory usage of Perplexity metric #2337

nsmlzl commented Jan 30, 2024 •

edited by Borda

Loading

github-actions bot commented Jan 30, 2024

nsmlzl commented Feb 2, 2024

High memory usage of Perplexity metric #2337

High memory usage of Perplexity metric #2337

Comments

nsmlzl commented Jan 30, 2024 • edited by Borda Loading

🐛 Bug

To Reproduce

Memory Inefficiency

Potential Solution

Environment

github-actions bot commented Jan 30, 2024

nsmlzl commented Feb 2, 2024

nsmlzl commented Jan 30, 2024 •

edited by Borda

Loading