Skip to content

Sampling search UseKV cache till input seq len for prefill phase#161

Merged
2 commits merged into
HabanaAI:habana-mainfrom
puneeshkhanna:prefill_kvcache_sampling
Apr 15, 2024
Merged

Sampling search UseKV cache till input seq len for prefill phase#161
2 commits merged into
HabanaAI:habana-mainfrom
puneeshkhanna:prefill_kvcache_sampling

Conversation

@puneeshkhanna
Copy link
Copy Markdown

What does this PR do?

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

@ghost ghost merged commit 64efe5b into HabanaAI:habana-main Apr 15, 2024
astachowiczhabana pushed a commit that referenced this pull request Apr 19, 2024
* Sampling search UseKV cache till input seq len for prefill phase

* Remove redundant line
astachowiczhabana pushed a commit that referenced this pull request Apr 22, 2024
* Sampling search UseKV cache till input seq len for prefill phase

* Remove redundant line
astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024
* Sampling search UseKV cache till input seq len for prefill phase

* Remove redundant line
astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024
* Sampling search UseKV cache till input seq len for prefill phase

* Remove redundant line
puneeshkhanna pushed a commit to puneeshkhanna/optimum-habana-fork that referenced this pull request May 2, 2024
…anaAI#161)

* Sampling search UseKV cache till input seq len for prefill phase

* Remove redundant line
@astachowiczhabana
Copy link
Copy Markdown

huggingface#1028

astachowiczhabana pushed a commit that referenced this pull request Mar 11, 2025
* [Llama-vision] Add support for Fused RMS Norm

* fix constructor

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

* add reference in forward

* fix formatting

* Update optimum/habana/transformers/models/mllama/modeling_mllama.py

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

* organize imports

---------

Co-authored-by: Jay Gala <jaygala@habana.ai>
Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>
astachowiczhabana pushed a commit that referenced this pull request Mar 31, 2025
* [Llama-vision] Add support for Fused RMS Norm

* fix constructor

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

* add reference in forward

* fix formatting

* Update optimum/habana/transformers/models/mllama/modeling_mllama.py

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

* organize imports

---------

Co-authored-by: Jay Gala <jaygala@habana.ai>
Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>
zhanglirong1999 pushed a commit that referenced this pull request Apr 17, 2025
Co-authored-by: Jay Gala <jaygala@habana.ai>
Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>
This pull request was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants