mini-prep

This Nextflow workflow processes paired-end sequencing reads by:

Quality trimming and filtering with fastp
Removing contaminant sequences using bbmap
Taxonomically classifying the clean reads using kraken2
Report using multiqc

Prerequisites

BBMap database: Must contain /ref/index/ directory
Kraken2 database: Must contain hash.k2d file

Usage

nextflow run main.nf --input_pattern "path/to/reads/*_{1,2}.fastq.gz" \
                     --contaminants_db "path/to/bbmap_db" \
                     --kraken_db "path/to/kraken_db"

Workflow Steps

graph TD
    A[Input Reads] --> B[FASTP]
    B --> C[BBMAP]
    C --> D[Clean Reads]
    C --> E[Mapped Contaminants]
    E --> F[SAMTOOLS_STATS]
    D --> G[KRAKEN]
    B --> H[QC Reports]
    F --> H
    H --> I[MULTIQC]

Outputs

Quality-filtered and contaminant-free reads
Taxonomic classification of clean reads
Comprehensive QC reports (FASTP, SAMTOOLS_STATS, MULTIQC)

Note

Materials prepared for a training session at the Quadram Institute.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
conf		conf
docs		docs
dog-mt		dog-mt
modules		modules
scripts		scripts
workflows		workflows
.gitignore		.gitignore
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mini-prep

Prerequisites

Usage

Workflow Steps

Outputs

Note

About

Uh oh!

Releases

Packages

Uh oh!

Languages

quadram-institute-bioscience/mini-prep

Folders and files

Latest commit

History

Repository files navigation

mini-prep

Prerequisites

Usage

Workflow Steps

Outputs

Note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages