-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Whisper Model Optimization #15473
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
hanbitmyths
merged 46 commits into
microsoft:main
from
kunal-vaishnavi:kvaishnavi/whisper
Apr 19, 2023
Merged
Whisper Model Optimization #15473
Changes from all commits
Commits
Show all changes
46 commits
Select commit
Hold shift + click to select a range
bc95327
Work in progress
kunal-vaishnavi d95d4a3
Work in progress 2
kunal-vaishnavi dc0c918
Work in progress 3
kunal-vaishnavi aa36390
Work in progress 4
kunal-vaishnavi 65ea436
Work in progress 5
kunal-vaishnavi c3b2564
Work in progress 6
kunal-vaishnavi 8c30983
Work in progress 7
kunal-vaishnavi b3d1e26
Work in progress 8
kunal-vaishnavi 2a24376
Work in progress 9
kunal-vaishnavi 8aea1da
Work in progress 10
kunal-vaishnavi 5de0331
Work in progress 11
kunal-vaishnavi dedd007
Work in progress 12
kunal-vaishnavi a7bff6b
Work in progress 13
kunal-vaishnavi 2fa2201
Cleaning up comments
kunal-vaishnavi 53416bf
Cleaning up more comments
kunal-vaishnavi ea23e01
Merge branch 'main' into dev
kunal-vaishnavi bf7f23f
Merge branch 'microsoft:main' into dev
kunal-vaishnavi 7e7b19f
Fixing few issues after merging with main
kunal-vaishnavi 4bf560a
Fix multihead attention flag
kunal-vaishnavi 5ef69a5
Changing attention fusion in decoder with past to multihead attention…
kunal-vaishnavi 09235ba
Fix separating present KV into present K and present V
kunal-vaishnavi 911768c
Adding test cases, fusion changes, and kernel changes
kunal-vaishnavi f8389eb
Removing commented out code
kunal-vaishnavi 96e061c
Remove QKV format assert
kunal-vaishnavi c106f32
Remove condition for memory efficient attention
kunal-vaishnavi b2f3d99
Adding onnx test files
kunal-vaishnavi 406a5d9
Merge branch 'main' into dev
kunal-vaishnavi 4003653
Add ORT return if error
kunal-vaishnavi d1aaa56
Fix allocator naming and casting
kunal-vaishnavi 9b341cf
Fix casting and remove extra parameter
kunal-vaishnavi ee32f88
Fix CodeQL scan errors and convert value to float
kunal-vaishnavi 7d36cae
Fix test cases
kunal-vaishnavi 33299d1
Fix more test cases
kunal-vaishnavi 0e5d42c
Add whisper folder to build
kunal-vaishnavi 8c2b2a4
Adding format changes suggested by linter
kunal-vaishnavi 2b002ab
Remove extra parenthesis
kunal-vaishnavi 0388430
Adding more format changes suggested by linter
kunal-vaishnavi 2a94bdf
Adding space and comma suggestions from linter
kunal-vaishnavi 97aaedb
Fix allocator initialization
kunal-vaishnavi dbda09d
Remove commented out line
kunal-vaishnavi b65e668
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 52e34c8
Remove packed qkv and simplify calculating present kv
kunal-vaishnavi a75c121
Add changes suggested by new linter
kunal-vaishnavi 70eab06
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 6868e94
Add changes suggested by new C++ linter
kunal-vaishnavi bc17d24
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.