Skip to content

Remove contingous call#849

Closed
libinta wants to merge 21 commits into
mainfrom
synapse_1.15addition3
Closed

Remove contingous call#849
libinta wants to merge 21 commits into
mainfrom
synapse_1.15addition3

Conversation

@libinta
Copy link
Copy Markdown
Collaborator

@libinta libinta commented Mar 29, 2024

What does this PR do?

The contingous call causes performance degradation for not reuse_cache case such as TGI.
Remove it for performance improvement.

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

regisss and others added 21 commits March 22, 2024 15:48
Co-authored-by: Dudi Lester <dlester@habana.ai>
Co-authored-by: Nir David <124874956+nirda7@users.noreply.github.com>
…817)

Co-authored-by: Dudi Lester <160421192+dudilester@users.noreply.github.com>
Co-authored-by: Yan Tomsinsky <73292515+Yantom1@users.noreply.github.com>
Co-authored-by: Yan Tomsinsky <73292515+Yantom1@users.noreply.github.com>
Co-authored-by: bgoldberg-habana <149692267+bgoldberg-habana@users.noreply.github.com>
Co-authored-by: Sun Choi <schoi@habana.ai>
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>
…Gaudi2 (#779)

Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>
Co-authored-by: Sun Choi <schoi@habana.ai>
Co-authored-by: Pankaj Dixit <pdixit@habana.ai>
Co-authored-by: Yeonsil Yoon <yyoon@habana.ai>
Co-authored-by: Vivek <vgoel@habana.ai>
Co-authored-by: Libin Tang <litang@habana.ai>
Co-authored-by: Shiv Kaul <skaul@habana.ai>
Co-authored-by: Libin Tang <litang@habana.ai>
Co-authored-by: Yeonsil Yoon <yyoon@habana.ai>
…e case

such as TGI. Remove it for performance improvement.
@libinta libinta requested a review from mandy-li as a code owner March 29, 2024 07:26
@libinta libinta requested a review from a user March 29, 2024 07:26
Copy link
Copy Markdown
Contributor

@szutenberg szutenberg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This change does not solve the issue with performance regression in the TGI MLPerf scenario.

My proposal is to revert 844908d which introduced the regression. I'll push PR shortly.

Base automatically changed from synapse_1.15 to main March 29, 2024 23:03
@regisss regisss self-requested a review as a code owner March 29, 2024 23:03
@regisss regisss closed this Mar 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants