[BUG] RuntimeError: index 1000000000 is out of bounds #680

xonfour · 2024-11-20T22:07:09Z

OS

Linux

GPU Library

CUDA 12.x

Python version

3.11

Pytorch version

2.5.1

Model

No response

Describe the bug

After the interference has been working cleanly and without problems for a while, this suddenly appears:

[...] generator.iterate()
          ^^^^^^^^^^^^^^^^^^^
  File "exllamav2/exl2_env/lib/python3.11/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "exllamav2/generator/dynamic.py", line 973, in iterate
    self.iterate_gen(results)
  File "exllamav2/generator/dynamic.py", line 1213, in iterate_gen
    job.receive_logits(job_logits)
  File "exllamav2/generator/dynamic.py", line 1817, in receive_logits
    ExLlamaV2Sampler.sample(
  File "exllamav2/generator/sampler.py", line 434, in sample
    ExLlamaV2Sampler.apply_dry(settings, tokenizer, sequence_ids, logits)
  File "exllamav2/generator/sampler.py", line 272, in apply_dry
    logits.scatter_add_(-1, indices, penalties)
RuntimeError: index 1000000000 is out of bounds for dimension 2 with size 131072

131072 is my total context/cache size. What is going wrong here? Thanks a lot!

Reproduction steps

not easy, happens after some (heavy) usage.

Expected behavior

generator.iterate() should operate without error

Logs

No response

Additional context

No response

Acknowledgements

I have looked for similar issues before submitting this one.
I understand that the developers have lives and my issue will be answered when possible.
I understand the developers of this program are human, and I will ask my questions politely.

The text was updated successfully, but these errors were encountered:

turboderp · 2024-11-21T03:05:41Z

I'm not able to reproduce this, but I have an idea as to why it happens. The index of 1000000000 suggests it's trying to penalize an image token. Is this happening with a vision model?

xonfour · 2024-11-21T16:11:28Z

Yes it is. Could it be possible that I just keep to many image embeddings at the same time?

turboderp · 2024-11-22T01:17:16Z

Turns out it's just a special case I hadn't considered. When the same image appears multiple times in a context and you're using DRY, the sampler tries to apply a penalty to image tokens, and since they're not represented in the logits you get an out-of-bounds error. Should be fixed in dev with the latest commit.

xonfour · 2024-11-27T22:38:03Z

Fix seems to work, no more errors so far. Thanks a lot!

xonfour added the bug Something isn't working label Nov 20, 2024

xonfour closed this as completed Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] RuntimeError: index 1000000000 is out of bounds #680

[BUG] RuntimeError: index 1000000000 is out of bounds #680

xonfour commented Nov 20, 2024 •

edited

Loading

turboderp commented Nov 21, 2024

xonfour commented Nov 21, 2024

turboderp commented Nov 22, 2024

xonfour commented Nov 27, 2024

[BUG] RuntimeError: index 1000000000 is out of bounds #680

[BUG] RuntimeError: index 1000000000 is out of bounds #680

Comments

xonfour commented Nov 20, 2024 • edited Loading

OS

GPU Library

Python version

Pytorch version

Model

Describe the bug

Reproduction steps

Expected behavior

Logs

Additional context

Acknowledgements

turboderp commented Nov 21, 2024

xonfour commented Nov 21, 2024

turboderp commented Nov 22, 2024

xonfour commented Nov 27, 2024

xonfour commented Nov 20, 2024 •

edited

Loading