Skip to content

Prostt5 ctranslate2 splitting long sequenes#327

Open
mpjw wants to merge 23 commits intosteineggerlab:prostt5-ctranslate2from
mpjw:prostt5-ctranslate2
Open

Prostt5 ctranslate2 splitting long sequenes#327
mpjw wants to merge 23 commits intosteineggerlab:prostt5-ctranslate2from
mpjw:prostt5-ctranslate2

Conversation

@mpjw
Copy link
Copy Markdown

@mpjw mpjw commented Aug 14, 2024

Functionality to predict AA sequences that are longer than ProstT5 attention with a split-wise approach.
Sequences will be split if longer than --prostt5-split-length (default 6000, deactivate with 0), and the prediction of splits will be concatenated.
Test cases include one file with a 500 split length and one with a 6000 split length.
Additionally, a conda environment was created for compiling this branch with CUDA.

Marcus Wagner added 23 commits August 6, 2024 21:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant