Prostt5 ctranslate2 splitting long sequenes#327
Open
mpjw wants to merge 23 commits intosteineggerlab:prostt5-ctranslate2from
Open
Prostt5 ctranslate2 splitting long sequenes#327mpjw wants to merge 23 commits intosteineggerlab:prostt5-ctranslate2from
mpjw wants to merge 23 commits intosteineggerlab:prostt5-ctranslate2from
Conversation
added 23 commits
August 6, 2024 21:13
…e length bug in the 3Di predictions
…ing splitting functionality
…ute once CUDA build is fixed
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Functionality to predict AA sequences that are longer than ProstT5 attention with a split-wise approach.
Sequences will be split if longer than --prostt5-split-length (default 6000, deactivate with 0), and the prediction of splits will be concatenated.
Test cases include one file with a 500 split length and one with a 6000 split length.
Additionally, a conda environment was created for compiling this branch with CUDA.