TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' #29426

Naozumi520 · 2024-03-04T06:16:00Z

System Info

transformers version: 4.39.0.dev0
Platform: Windows-10-10.0.22621-SP0
Python version: 3.11.8
Huggingface_hub version: 0.21.3
Safetensors version: 0.4.2
Accelerate version: 0.27.2
Accelerate config: not found
PyTorch version (GPU?): 2.2.1+cu121 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using GPU in script?:
Using distributed or parallel set-up in script?:

Who can help?

@ArthurZucker @amyeroberts

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Run llava using llava.serve.cli and libraries in my versions.
Ask model some questions.
Return error:

<|im_start|>user
: hi
<|im_start|>assistant
: Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "C:\Users\Naozu\Desktop\AI\2.Models\Nazuna20240304\LLaVA\llava\serve\cli.py", line 128, in <module>
    main(args)
  File "C:\Users\Naozu\Desktop\AI\2.Models\Nazuna20240304\LLaVA\llava\serve\cli.py", line 98, in main
    output_ids = model.generate(
                 ^^^^^^^^^^^^^^^
  File "C:\Users\Naozu\AppData\Local\Programs\Python\Python311\Lib\site-packages\torch\utils\_contextlib.py", line 115, in decorate_context
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\Naozu\Desktop\AI\2.Models\Nazuna20240304\LLaVA\llava\model\language_model\llava_llama.py", line 137, in generate
    return super().generate(
           ^^^^^^^^^^^^^^^^^
  File "C:\Users\Naozu\AppData\Local\Programs\Python\Python311\Lib\site-packages\torch\utils\_contextlib.py", line 115, in decorate_context
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\Naozu\AppData\Local\Programs\Python\Python311\Lib\site-packages\transformers\generation\utils.py", line 1597, in generate
    result = self.sample(
             ^^^^^^^^^^^^
  File "C:\Users\Naozu\AppData\Local\Programs\Python\Python311\Lib\site-packages\transformers\generation\utils.py", line 2711, in sample
    outputs = self(
              ^^^^^
  File "C:\Users\Naozu\AppData\Local\Programs\Python\Python311\Lib\site-packages\torch\nn\modules\module.py", line 1511, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\Naozu\AppData\Local\Programs\Python\Python311\Lib\site-packages\torch\nn\modules\module.py", line 1520, in _call_impl
    return forward_call(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position'

Expected behavior

Everything should work just fine.

The text was updated successfully, but these errors were encountered:

ArthurZucker · 2024-03-04T11:03:42Z

Hey! I think you should open this issue on the LLaVA repo maybe? or just update your version of transformers!

Naozumi520 · 2024-03-04T11:17:19Z

Hm that's weird, cuz I install transformers by forking the repo, it should be the latest one...

ArthurZucker · 2024-03-05T07:23:15Z

Could you provide a transformers reproducer? Here it is wrapped around the \LLaVA\llava\model\language_model\llava_llama.py and I am not sure what is in there!

YujieLu10 · 2024-03-05T19:37:20Z

a workaround here, add cache_position=None to the forward() method in Class LlavaLlamaForCausalLM

LsTam91 · 2024-03-07T11:31:56Z

I get the same error when I Fine-Tune my llama type models since I updated transformer library

ArthurZucker · 2024-03-25T03:29:25Z

Again I need a reproducer to be able to push a fix. Also now the cache_positions should no be required by the model and is intialized if not passed, see:

transformers/src/transformers/models/llama/modeling_llama.py

Lines 980 to 985 in 76a33a1

    
           if cache_position is None: 
        
               if isinstance(past_key_values, StaticCache): 
        
                   raise ValueError("cache_position is a required argument when using StaticCache.") 
        
               cache_position = torch.arange( 
        
                   past_seen_tokens, past_seen_tokens + inputs_embeds.shape[1], device=inputs_embeds.device 
        
               )

.

amankumarhal · 2024-03-31T22:33:29Z

I had the same issue when I tried with transformers version 4.38.x and 4.39.x. I installed transformers==4.37.2 and it worked.

ArthurZucker · 2024-04-01T11:06:07Z

Thanks @amankumarhal , but again, without reproducers I am having a really hard time fixing this

2015aroras · 2024-04-02T00:13:39Z

This repros for me with a different model with the following code. I imagine it repros with many models.

from transformers import TextGenerationPipeline
from transformers.models.auto import AutoModelForCausalLM, AutoTokenizer

model_path = "allenai/OLMo-1B"
model = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
pipeline = TextGenerationPipeline(model=model, tokenizer=tokenizer)
output = pipeline("question: who wrote romeo and juliet? answer: ", max_new_tokens=30)

I believe the issue is caused by 23db187 at src/transformers/generation/utils.py line 1940. This introduced "cache_position" into model_kwargs (as I saw when I went over the above repro with my debugger).

ArthurZucker · 2024-04-02T08:47:51Z

Oh this is using a custom code (remote code). The issue should be opened and fixed on the OlMO repo. Otherwise just wait for #29885 to be merged, which will have native support!

vanitech · 2024-04-08T17:35:17Z

+1 encountered this error when trying to use olmo

ChenRan2000 · 2024-04-10T06:43:34Z

+1 encountered this error.

ArthurZucker · 2024-04-10T12:42:02Z

Olmo is not part of transformers will be with #29885

vanitech · 2024-04-12T16:14:57Z

Workaround that got rid of the error for me was an older version of transformers
%pip install transformers==4.38

ArthurZucker · 2024-04-18T08:53:31Z

Olmo is on transformers now

MaximilianKr · 2024-04-19T11:21:58Z

Got the same error for OLMo-1B with Transformers 4.39.3

OLMoForCausalLM.forward() got an unexpected keyword argument 'cache_position'

edit: I was using conda, Transformers 4.40.0 is not yet available. Reinstalling most recent Transformers 4.40.0 via pip led to the following error mesage when importing hf_olmo:

ValueError: 'olmo' is already used by a Transformers config, pick another name.

2015aroras · 2024-04-19T17:34:03Z

Please use -hf versions of OLMo (e.g. https://huggingface.co/allenai/OLMo-1.7-7B-hf) for Transformer 4.40.0 onwards (OLMo just got integrated into transformers!).

segalinc · 2024-04-23T17:31:49Z

I also added an issue in the LLAva repo haotian-liu/LLaVA#1448

github-actions · 2024-05-18T08:04:38Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

Keyan-Hu · 2024-05-22T16:40:53Z

Transformers 4.37.2 OK！

ChL-eng · 2024-06-12T03:11:37Z

Transformers 4.37.2 OK！

it works!

ArthurZucker · 2024-06-18T18:32:38Z

Closing then! Thanks 🤗

KevinXu-01 · 2024-09-26T13:29:57Z

a workaround here, add cache_position=None to the forward() method in Class LlavaLlamaForCausalLM

It works for me. Thanks!!!

pseudotensor added a commit to h2oai/LLaVA that referenced this issue Mar 8, 2024

https://github.com/huggingface/transformers/issues/29426

476facc

2015aroras mentioned this issue Apr 2, 2024

Fix HF pipeline test failure allenai/OLMo#534

Merged

segalinc mentioned this issue Apr 23, 2024

[Usage] TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' haotian-liu/LLaVA#1448

Open

ArthurZucker closed this as completed Jun 18, 2024

xumingze0308 mentioned this issue Aug 29, 2024

[Bug] TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' apple/ml-slowfast-llava#1

Closed

zjysteven mentioned this issue Aug 31, 2024

Inference giving errors after finetuning LLavaInterleave 0.5B zjysteven/lmms-finetune#34

Closed

This was referenced Sep 23, 2024

After updating ComfyUI, I encountered an error with the LLavaSamplerSimple gokayfem/ComfyUI_VLM_nodes#106

Closed

Error occurred when executing AudioLDM2Node: 'cache_position' gokayfem/ComfyUI_VLM_nodes#101

Closed

ldzhangyx mentioned this issue Nov 15, 2024

KeyError: 'cache_position' ldzhangyx/MusicMagus#7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' #29426

TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' #29426

Naozumi520 commented Mar 4, 2024 •

edited

Loading

ArthurZucker commented Mar 4, 2024

Naozumi520 commented Mar 4, 2024

ArthurZucker commented Mar 5, 2024

YujieLu10 commented Mar 5, 2024 •

edited

Loading

LsTam91 commented Mar 7, 2024

ArthurZucker commented Mar 25, 2024

amankumarhal commented Mar 31, 2024

ArthurZucker commented Apr 1, 2024

2015aroras commented Apr 2, 2024 •

edited

Loading

ArthurZucker commented Apr 2, 2024

vanitech commented Apr 8, 2024 •

edited

Loading

ChenRan2000 commented Apr 10, 2024

ArthurZucker commented Apr 10, 2024

vanitech commented Apr 12, 2024

ArthurZucker commented Apr 18, 2024

MaximilianKr commented Apr 19, 2024 •

edited

Loading

2015aroras commented Apr 19, 2024

segalinc commented Apr 23, 2024

github-actions bot commented May 18, 2024

Keyan-Hu commented May 22, 2024

ChL-eng commented Jun 12, 2024

ArthurZucker commented Jun 18, 2024

KevinXu-01 commented Sep 26, 2024

TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' #29426

TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' #29426

Comments

Naozumi520 commented Mar 4, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

ArthurZucker commented Mar 4, 2024

Naozumi520 commented Mar 4, 2024

ArthurZucker commented Mar 5, 2024

YujieLu10 commented Mar 5, 2024 • edited Loading

LsTam91 commented Mar 7, 2024

ArthurZucker commented Mar 25, 2024

amankumarhal commented Mar 31, 2024

ArthurZucker commented Apr 1, 2024

2015aroras commented Apr 2, 2024 • edited Loading

ArthurZucker commented Apr 2, 2024

vanitech commented Apr 8, 2024 • edited Loading

ChenRan2000 commented Apr 10, 2024

ArthurZucker commented Apr 10, 2024

vanitech commented Apr 12, 2024

ArthurZucker commented Apr 18, 2024

MaximilianKr commented Apr 19, 2024 • edited Loading

2015aroras commented Apr 19, 2024

segalinc commented Apr 23, 2024

github-actions bot commented May 18, 2024

Keyan-Hu commented May 22, 2024

ChL-eng commented Jun 12, 2024

ArthurZucker commented Jun 18, 2024

KevinXu-01 commented Sep 26, 2024

Naozumi520 commented Mar 4, 2024 •

edited

Loading

YujieLu10 commented Mar 5, 2024 •

edited

Loading

2015aroras commented Apr 2, 2024 •

edited

Loading

vanitech commented Apr 8, 2024 •

edited

Loading

MaximilianKr commented Apr 19, 2024 •

edited

Loading