Sampling search UseKV cache till input seq len for prefill phase by puneeshkhanna · Pull Request #161 · HabanaAI/optimum-habana-fork

puneeshkhanna · 2024-04-12T08:32:19Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

* Sampling search UseKV cache till input seq len for prefill phase * Remove redundant line

…anaAI#161) * Sampling search UseKV cache till input seq len for prefill phase * Remove redundant line

astachowiczhabana · 2024-06-11T07:09:45Z

huggingface#1028

* [Llama-vision] Add support for Fused RMS Norm * fix constructor Co-authored-by: Yaser Afshar <yaser.afshar@intel.com> * add reference in forward * fix formatting * Update optimum/habana/transformers/models/mllama/modeling_mllama.py Co-authored-by: Yaser Afshar <yaser.afshar@intel.com> * organize imports --------- Co-authored-by: Jay Gala <jaygala@habana.ai> Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

Co-authored-by: Jay Gala <jaygala@habana.ai> Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

Sampling search UseKV cache till input seq len for prefill phase

4875b1e

puneeshkhanna requested review from bhargaveede, ssarkar2 and vivekgoe as code owners April 12, 2024 08:32

Remove redundant line

f13eea4

ghost approved these changes Apr 15, 2024

View reviewed changes

ghost merged commit 64efe5b into HabanaAI:habana-main Apr 15, 2024

astachowiczhabana pushed a commit that referenced this pull request Apr 19, 2024

Sampling search UseKV cache till input seq len for prefill phase (#161)

1e077eb

* Sampling search UseKV cache till input seq len for prefill phase * Remove redundant line

astachowiczhabana pushed a commit that referenced this pull request Apr 22, 2024

Sampling search UseKV cache till input seq len for prefill phase (#161)

1b9b2ae

* Sampling search UseKV cache till input seq len for prefill phase * Remove redundant line

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

Sampling search UseKV cache till input seq len for prefill phase (#161)

9ccd8a3

* Sampling search UseKV cache till input seq len for prefill phase * Remove redundant line

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

Sampling search UseKV cache till input seq len for prefill phase (#161)

3e07be0

* Sampling search UseKV cache till input seq len for prefill phase * Remove redundant line

zhanglirong1999 pushed a commit that referenced this pull request Apr 17, 2025

[Llama-vision] Add support for Fused RMS Norm (#161) (huggingface#1892)

dde8e29

Co-authored-by: Jay Gala <jaygala@habana.ai> Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampling search UseKV cache till input seq len for prefill phase#161

Sampling search UseKV cache till input seq len for prefill phase#161
2 commits merged into
HabanaAI:habana-mainfrom
puneeshkhanna:prefill_kvcache_sampling

puneeshkhanna commented Apr 12, 2024

Uh oh!

astachowiczhabana commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

puneeshkhanna commented Apr 12, 2024

What does this PR do?

Before submitting

Uh oh!

astachowiczhabana commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants