A Snakemake pipeline for quantitative metagenomics profiling using METEOR, converted from the original Nextflow pipeline.
METEOR is a platform for quantitative metagenomics profiling of complex ecosystems. It performs:
- Species-level taxonomic profiling (Bacteria, Archaea, Eukaryotes)
- Functional analysis
- Strain-level population structure inference
- Snakemake (≥6.0)
- Conda/Mamba
- Python ≥3.11
- Clone or download this pipeline
- Install Snakemake:
conda install -c conda-forge -c bioconda snakemake
Place paired-end FASTQ files in a directory with the naming pattern:
*_R1*.fastq.gzand*_R2*.fastq.gz- Or
*_R1*.fastqand*_R2*.fastq
Edit config.yaml:
- Set
input_dirto your FASTQ directory - Set
output_dirfor results - Choose a
catalogue_nameor provide a customcatalogue_path - Adjust
cpus,fastmode, and memory settings as needed
# Dry run to check the workflow
snakemake --configfile config.yaml --dry-run
# Run with 8 cores
snakemake --configfile config.yaml --cores 8 --use-conda
# Run on a cluster (example with SLURM)
snakemake --configfile config.yaml --cores 8 --use-conda --cluster "sbatch -t 60 -c {threads}"- Download/Prepare Catalogue: Downloads prebuilt microbial gene catalogue
- FASTQ Processing: Indexes and prepares FASTQ files
- Mapping: Maps reads against the gene catalogue
- Profiling: Generates taxonomic and functional profiles
- Merging: Combines profiles from all samples
- Strain Analysis: Performs strain-level analysis (if not in fast mode)
- Tree Building: Constructs phylogenetic trees from strain data
Results are written to the specified output_dir:
merged/: Combined taxonomic and functional profilestree/: Phylogenetic trees (if strain analysis performed)meteor_report.html: Summary report
Choose from prebuilt catalogues:
hs_10_4_gut: Human gut microbiomehs_8_4_oral: Human oral microbiomehs_2_9_skin: Human skin microbiomefc_1_3_gut: Cat gut microbiomeclf_1_0_gut: Dog gut microbiomemm_5_0_gut: Mouse gut microbiomessc_9_3_gut: Pig gut microbiome- And others (see config.yaml)
- Normal mode: Full taxonomic + functional analysis
- Fast mode (
fast: true): Taxonomic analysis only, reduced memory usage
- Normal mode: ~30GB+ RAM (depends on sample size)
- Fast mode: ~10GB RAM maximum
- Memory scales with number of reads per sample
- No FASTQ files found: Check file naming pattern and input directory
- Memory errors: Increase
minimum_memory_gbor enable fast mode - Catalogue download fails: Check internet connection and catalogue name
Remove intermediate files:
snakemake clean --configfile config.yamlThis Snakemake version maintains the same functionality as nf-meteor.nf:
| Feature | Nextflow | Snakemake |
|---|---|---|
| Catalogue download | ✓ | ✓ |
| FASTQ processing | ✓ | ✓ |
| Read mapping | ✓ | ✓ |
| Taxonomic profiling | ✓ | ✓ |
| Functional profiling | ✓ | ✓ |
| Profile merging | ✓ | ✓ |
| Strain analysis | ✓ | ✓ |
| Tree construction | ✓ | ✓ |
| Memory management | ✓ | ✓ |
| Fast mode | ✓ | ✓ |
If you use this pipeline, please cite:
- METEOR: Ghozlane et al. "Accurate profiling of microbial communities for shotgun metagenomic sequencing with Meteor2." Microbiome (2025)
- Snakemake: Köster & Rahmann. "Snakemake—a scalable bioinformatics workflow engine." Bioinformatics (2012)