Add modules and subworkflows for single-cell DNA processing.#79
Open
ljwharbers wants to merge 92 commits intonf-core:devfrom
Open
Add modules and subworkflows for single-cell DNA processing.#79ljwharbers wants to merge 92 commits intonf-core:devfrom
ljwharbers wants to merge 92 commits intonf-core:devfrom
Conversation
Performing merge for first release
Post-release DOI
1.1.0 Updates
Release 1.2.0
…lt. Now using chopper
…pt the fastq file
…ode_formats. Also change dlogic for whitelist gunzipping
Collaborator
|
I know we talked in slack, but just to put it here as well. One of the other items we'll need is test data for the DNA portions of the pipeline (and updates to the test.configs or perhaps even new test configs/profiles), with the downsampled and normal FASTQs going here: https://github.com/nf-core/test-datasets/tree/scnanoseq |
…isoform labels as well
Contributor
Author
|
I am sorry it turned into this beast of a PR... :') I have tested it and done some last-minute refactoring of the outdirs. Also added some test data that runs through without a problem on my end. Feel free to have a look and let me know if there is anything needed on my end @atrull314 |
…sing cDNA data types
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
The following key changes have been made to enable both cDNA and DNA sample processing with the same pipeline.
Modules added:
flexiplex/discovery,flexiplex/filter,flexiplex/assignandflexiformatter. The flexiplex modules are used to extract barcodes, filter and assign barcodes.flexiformatteris used to move the barcodes from the readname to the bam tags.Subworkflow added:
demultiplex_blaze.nfanddemultiplex_flexiplex.nfwhere the demultiplexing steps are executed.align_deduplicate_dna.nfwhich performsminimap2alignment intopicard MarkDuplicatesandBAM_SORT_STATS_SAMTOOLS.Input changes:
typewhich can be either dna or cdna. It is an optional column and defaults to cdna to maintain compatibility with older samplesheets.Config changes:
whitelistparameter is removed and replaced by two new parameterswhitelist_dnaandwhitelist_cdnato specifiy a DNA and cDNA whitelist, respectively.assets/are now.gzinstead of.zipto easier compatibility with blaze/flexiplex with one simple gunzip module if the file ends with.gz.demux_toolto specify whether to useflexiplexorblazefor cDNA samples demultiplexing.QC changes:
READ_COUNTS()is edited to be able to use flexiplex and blaze output as input.TODO:
Update readmes and ...?
PR checklist
nf-core pipelines lint).nextflow run . -profile test,docker --outdir <OUTDIR>).nextflow run . -profile debug,test,docker --outdir <OUTDIR>).docs/usage.mdis updated.docs/output.mdis updated.CHANGELOG.mdis updated.README.mdis updated (including new tool citations and authors/contributors).