This repository contains a Nextflow implementation of the CgpWGS_hg38 pipeline, mutect2 and gridss.
All tools used in this pipeline are contained in the CgpWGS_hg38 container. It can be excuted via Singularity or Docker.
- Nextflow >= 23.04 (tested version - This script makes use of DSL 2 synthax, which became the default at 22.03.0-edge)
- Reference files (As of now, the script will not check for these files and download them.)
Please do not install Singularity through conda.
Singularity must be owned by root in order to run this workflow. If you do not have access to a singularity installation owned by root try requesting to your system administrator.
- Setup a working directory and copy the
cgpWGS.nfandnextflow.configfiles inside it. - Edit the variables in 'config/params.config' to point to the appropriate directories (similar to the
nextflow.configof the original pipeline). NOTE: Paths must either be absolute, or include the variable$projectDirif using paths relative to the script directory. - Edit the variables in
config/process.configto adjust executor settings. - Create an input CSV table. The table must have 2 columns only: tumor file and normal file paths e.g. [./mytumor.bam,./mynormal.bam].
- Execute
nextflow run cgpWGS.nf -with-singularity <path-to-container>to execute the pipeline.
- You can execute nextflow with the flag
-with-reportto create an HTML resource usage report. This is useful to set resource configuration. - If your pipeline execution is interrupted/fails, you can execute nextflow with the flag
-resumeto resume where you left off.