-
Notifications
You must be signed in to change notification settings - Fork 192
Whisper Redesigned Solution #1229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
kunal-vaishnavi
wants to merge
81
commits into
main
Choose a base branch
from
kvaishnavi/whisper
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+132,384
−1,027
Open
Changes from all commits
Commits
Show all changes
81 commits
Select commit
Hold shift + click to select a range
c2c8745
Rename Whisper encoder input to audio features
kunal-vaishnavi 1d5f4f0
Initial commit for new export
kunal-vaishnavi 5bf4628
Fix KV cache initialization and runtime bugs
kunal-vaishnavi 3cb936e
Add another check for alignment heads input
kunal-vaishnavi b648f58
Dump logits in ORT GenAI
kunal-vaishnavi 2a5b762
Fix cross QK update
kunal-vaishnavi e24db74
Fix finalize cross QK
kunal-vaishnavi e4c838e
Save checkpoint for working solution
kunal-vaishnavi 3a548a1
Clean up code
kunal-vaishnavi 4d9af67
Remove unneeded template instantiations
kunal-vaishnavi 1d9161d
Fixes: update crossQK copy for first step;
mindest 97be76a
Enable getting model inputs to user
kunal-vaishnavi 1bcd264
Add additional check for cache indirection
kunal-vaishnavi c35a73d
Add audio processing unit test
kunal-vaishnavi 1d5da61
Fix Whisper GenAI config
kunal-vaishnavi efd0199
Save checkpoint for working solution
kunal-vaishnavi fbebe68
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi ef955e7
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi e869d02
Squashed commit of the following:
kunal-vaishnavi 32c48d2
Initial changes to work with main
kunal-vaishnavi e4a8b5f
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 323028a
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 7756a86
Resolving build errors after merging main
kunal-vaishnavi a167add
Fix prompt length and get input
kunal-vaishnavi 8782b47
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi c93a1ab
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi c0efa93
Fix build issues after syncing with main
kunal-vaishnavi 27ba626
Add gpt2 to list of LLMs
kunal-vaishnavi 2eb198b
Cast from ORT float16 to uint16 and then uint16 to half
kunal-vaishnavi d4e7446
Remove const casting
kunal-vaishnavi 8558cab
Fix windows build errors
kunal-vaishnavi 40a555a
Update processing for audio features
kunal-vaishnavi 4c3752c
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 5fa7300
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 198db2d
Fix duplicate config names after merging main
kunal-vaishnavi 3264244
Add comments to C API process methods
kunal-vaishnavi 83a915d
Move SetInputs from params to generator
kunal-vaishnavi c876bd2
Use SetExtraInputs for all states
kunal-vaishnavi 095d452
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 67359da
Fix build errors after merging main
kunal-vaishnavi 59ae78d
Align spacing for comment
kunal-vaishnavi c12d16b
Add extra inputs back to decoder only state
kunal-vaishnavi eec4bba
Always call SetExtraInputs
kunal-vaishnavi da6fc9a
Add audio processing APIs in other languages
kunal-vaishnavi 1e29004
Comment out multi-prompt APIs for now
kunal-vaishnavi a0d5be7
Fix Java build issues with new audio classes
kunal-vaishnavi c474d62
Add missing Objective-C interfaces for new audio classes
kunal-vaishnavi eb88379
Fix variable names in setting inputs for Java API
kunal-vaishnavi 502afb3
Update Java unit tests
kunal-vaishnavi c7a865b
Fix tensor unit test in Java
kunal-vaishnavi 6edf8d3
Add C/C++ APIs to set batched input ids
kunal-vaishnavi aed1720
Start updating Whisper inference examples
kunal-vaishnavi eb285b9
Update Whisper examples and add Python pre-processing binding
kunal-vaishnavi 7edc027
Update audio preprocessing unit tests
kunal-vaishnavi 85689af
Add changes suggested by clang-format and CodeQL
kunal-vaishnavi 29cf80b
Remove extra newline for clang-format
kunal-vaishnavi d1a7608
Add Python CI test for Whisper
kunal-vaishnavi fe27ea1
Fix cache indirection updating
kunal-vaishnavi 8883492
Fix build warning in Windows CIs
kunal-vaishnavi 1839220
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 92b32d9
Remove commented out code
kunal-vaishnavi 9bcc681
Use feature extraction instead of speech log mel
kunal-vaishnavi 1a1ef86
Fix variable names based on PR feedback
kunal-vaishnavi 9577e36
Fix import name for E2E unit tests
kunal-vaishnavi 96251cd
Update ORT extensions commit
kunal-vaishnavi 8e40be7
Only transpose K caches when DMMHA is used
kunal-vaishnavi 17cc672
Fix extra inputs usage for pipeline and GPT models
kunal-vaishnavi 8fb2f1b
Merge branch 'main' into kvaishnavi/whisper
kunal-vaishnavi 0aa011b
Move SetExtraInputs to the right state
kunal-vaishnavi 2681f35
Access sessions field through model object
kunal-vaishnavi 7f283a4
Rewrite batched preprocessing APIs
kunal-vaishnavi a7cbacc
Use different C++ API call for one prompt in preprocessing
kunal-vaishnavi 9989264
Remove vector usage for C-only environment in Java bindings
kunal-vaishnavi 4ca81bd
Cast pybind str to std string
kunal-vaishnavi 04513a2
Remove OgaCheckResult from Java bindings
kunal-vaishnavi 4a76e15
Fix typo in Java doc string
kunal-vaishnavi b316861
Fix NativeMethods function name
kunal-vaishnavi 5141f00
Add changes suggested by linters
kunal-vaishnavi 2e3560a
Change how strdup is defined
kunal-vaishnavi d977340
Add changes from PR feedback
kunal-vaishnavi 363ca7c
Activate Whisper E2E CI tests
kunal-vaishnavi File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.