Added check if reuse_cache supported by selected model_type by iermolae · Pull Request #675 · huggingface/optimum-habana

iermolae · 2024-01-31T11:14:21Z

Add a check if reuse_cache flag supported by selected model_type in GaudiGenerationMixin.generate()

Previously, if run_generation.py ran with other than "llama" model and --reuse_check there was an error:

AttributeError: 'GaudiMistralForCausalLM' object has no attribute 'allocate_kv_cache'

Now there is a more user-freindly message describing the problem:

AssertionError: reuse_cache only supported by llama at the moment

HuggingFaceDocBuilderDev · 2024-02-02T03:02:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

regisss

LGTM!

…ace#675)

huggingface#2254) (huggingface#675) Co-authored-by: Karol Brejna <karolbrejna@apache.org>

Added check if reuse_cache supported by selected model_type

a9341c5

iermolae requested review from bhargaveede, ssarkar2 and vivekgoe as code owners January 31, 2024 11:14

ssarkar2 approved these changes Jan 31, 2024

View reviewed changes

regisss added the run-test Run CI for PRs from external contributors label Feb 2, 2024

regisss approved these changes Feb 2, 2024

View reviewed changes

regisss merged commit 4077355 into huggingface:main Feb 2, 2024

jychen21 pushed a commit to jychen21/optimum-habana that referenced this pull request Feb 27, 2024

Added check if reuse_cache supported by selected model_type (huggingf…

a62f002

…ace#675)

HolyFalafel pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Mar 11, 2024

Added check if reuse_cache supported by selected model_type (huggingf…

4f8b2d2

…ace#675)

astachowiczhabana mentioned this pull request Jun 7, 2024

Added additional check to run with distributed enabled and world_size=1 (#96) HabanaAI/optimum-habana-fork#116

Merged

gplutop7 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Oct 15, 2025

Fix CWE-561 Dead Code Vulnerability related to use_new_cache = False (

9348707

huggingface#2254) (huggingface#675) Co-authored-by: Karol Brejna <karolbrejna@apache.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added check if reuse_cache supported by selected model_type#675

Added check if reuse_cache supported by selected model_type#675
regisss merged 1 commit into
huggingface:mainfrom
iermolae:reuse_cache_check

iermolae commented Jan 31, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Feb 2, 2024

Uh oh!

regisss left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

iermolae commented Jan 31, 2024

Add a check if reuse_cache flag supported by selected model_type in GaudiGenerationMixin.generate()

Uh oh!

HuggingFaceDocBuilderDev commented Feb 2, 2024

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants