Skip to content

Don't include padding in first token latency time#297

Merged
astachowiczhabana merged 1 commit into
habana-mainfrom
dev/astachowicz/no_padding_in_first_token
Jul 16, 2024
Merged

Don't include padding in first token latency time#297
astachowiczhabana merged 1 commit into
habana-mainfrom
dev/astachowicz/no_padding_in_first_token

Conversation

@astachowiczhabana
Copy link
Copy Markdown

What does this PR do?

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

Copy link
Copy Markdown

@afierka-intel afierka-intel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Didn't read.

@astachowiczhabana
Copy link
Copy Markdown
Author

Unmatched PR

xinyu-intel pushed a commit that referenced this pull request Mar 4, 2025
Change-Id: Ie1098cb7d75ee4c52290fde359769fd4cf2ece1b
astachowiczhabana added a commit that referenced this pull request May 26, 2025
…x7B (torch.compile) (v2) (#297)

* [SW-224449] Add an option to specialize on float inputs for Mixtral 8x7B (torch.compile) (#286)

* Add option to specialize_float in torch_dynamo for text-generation

* update README for --dynamo_specialize_float

* Update examples/text-generation/README.md

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

* Update examples/text-generation/run_generation.py

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>

---------

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>
Co-authored-by: Adam Stachowicz <105052242+astachowiczhabana@users.noreply.github.com>

* Revert "[SW-224449] Add an option to specialize on float inputs for Mixtral 8…" (#296)

This reverts commit 88579fb1ca3a369793e3713576b155e85a39d0a6.

* Add an option to specialize on float inputs for Mixtral 8x7B (torch.compile) v2

---------

Co-authored-by: Yaser Afshar <yaser.afshar@intel.com>
Co-authored-by: Adam Stachowicz <105052242+astachowiczhabana@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants