-
Notifications
You must be signed in to change notification settings - Fork 6
Open
Labels
kraken2Issue related to bugs/instabilities in Kraken2.Issue related to bugs/instabilities in Kraken2.
Description
The memory usage is becoming an ongoing issue especially with the newer databases and the self classification step during DB construction. Once recent strategy is the implementation by sharded hashes in Kun-Peng:
https://github.com/eric9n/Kun-peng
Some initial tests looked good and the unique syncmer assignment also improved the MEDI classifications a bit. This issue tracks the addition of sharded hashing into the workflow.
Open Steps
- change download scripts to also download the decoys previously contained in the Kraken2 standard DB (bacteria, archaea, virus, plasmid, vectors)
- figure out how to layout the database to make it work with Kun-Peng
- benchmark the DB constructions (slower with Kun-Peng)
- decide on the level of fragmentation (shard size)
- benchmark the classification speed with the sharded hash
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
kraken2Issue related to bugs/instabilities in Kraken2.Issue related to bugs/instabilities in Kraken2.