Skip to content

[SW-224874] Reduce index_copy to fp8 in llama2 - QDQ flow#2065

Merged
astachowiczhabana merged 1 commit into
huggingface:v1.19-releasefrom
HabanaAI:rtiefenbrunn/SW-224874_IndexCopy_fp8
Jun 25, 2025
Merged

[SW-224874] Reduce index_copy to fp8 in llama2 - QDQ flow#2065
astachowiczhabana merged 1 commit into
huggingface:v1.19-releasefrom
HabanaAI:rtiefenbrunn/SW-224874_IndexCopy_fp8

Conversation

@Tiefen-boop
Copy link
Copy Markdown
Contributor

No description provided.

Copy link
Copy Markdown
Collaborator

@mandy-li mandy-li left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@mandy-li mandy-li requested a review from regisss June 18, 2025 14:09
@Tiefen-boop
Copy link
Copy Markdown
Contributor Author

@regisss can you review and merge?

@astachowiczhabana astachowiczhabana merged commit 654700f into huggingface:v1.19-release Jun 25, 2025
1 check passed
@Tiefen-boop Tiefen-boop deleted the rtiefenbrunn/SW-224874_IndexCopy_fp8 branch July 1, 2025 07:27
astachowiczhabana pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jul 3, 2025
astachowiczhabana added a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jul 8, 2025
astachowiczhabana added a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jul 10, 2025
astachowiczhabana added a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jul 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants