Sarkar/mistral optimizations#804
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
regisss
left a comment
There was a problem hiding this comment.
I left several comments. Can you also provide a command that I can use for testing please?
command line to test (as tested on gaudi 2): #cfg 2: #cfg 3: #cfg 4 |
|
|
||
| def reorder_kv_cache(self, beam_idx: torch.LongTensor): | ||
| if self.past_key is None: | ||
| # if not hasattr(self, 'past_key'): |
There was a problem hiding this comment.
The commented line is still there
What does this PR do?
Pull in optimizations from llama for mistral
Fixes # (issue)
Before submitting