METEOR Shotgun Metagenomics Snakemake Pipeline

A Snakemake pipeline for quantitative metagenomics profiling using METEOR, converted from the original Nextflow pipeline.

Overview

METEOR is a platform for quantitative metagenomics profiling of complex ecosystems. It performs:

Species-level taxonomic profiling (Bacteria, Archaea, Eukaryotes)
Functional analysis
Strain-level population structure inference

Requirements

Snakemake (≥6.0)
Conda/Mamba
Python ≥3.11

Installation

Clone or download this pipeline

Install Snakemake:

conda install -c conda-forge -c bioconda snakemake

Usage

1. Prepare your data

Place paired-end FASTQ files in a directory with the naming pattern:

*_R1*.fastq.gz and *_R2*.fastq.gz
Or *_R1*.fastq and *_R2*.fastq

2. Configure the pipeline

Edit config.yaml:

Set input_dir to your FASTQ directory
Set output_dir for results
Choose a catalogue_name or provide a custom catalogue_path
Adjust cpus, fast mode, and memory settings as needed

3. Run the pipeline

# Dry run to check the workflow
snakemake --configfile config.yaml --dry-run

# Run with 8 cores
snakemake --configfile config.yaml --cores 8 --use-conda

# Run on a cluster (example with SLURM)
snakemake --configfile config.yaml --cores 8 --use-conda --cluster "sbatch -t 60 -c {threads}"

Pipeline Steps

Download/Prepare Catalogue: Downloads prebuilt microbial gene catalogue
FASTQ Processing: Indexes and prepares FASTQ files
Mapping: Maps reads against the gene catalogue
Profiling: Generates taxonomic and functional profiles
Merging: Combines profiles from all samples
Strain Analysis: Performs strain-level analysis (if not in fast mode)
Tree Building: Constructs phylogenetic trees from strain data

Output

Results are written to the specified output_dir:

merged/: Combined taxonomic and functional profiles
tree/: Phylogenetic trees (if strain analysis performed)
meteor_report.html: Summary report

Configuration Options

Catalogue Options

Choose from prebuilt catalogues:

hs_10_4_gut: Human gut microbiome
hs_8_4_oral: Human oral microbiome
hs_2_9_skin: Human skin microbiome
fc_1_3_gut: Cat gut microbiome
clf_1_0_gut: Dog gut microbiome
mm_5_0_gut: Mouse gut microbiome
ssc_9_3_gut: Pig gut microbiome
And others (see config.yaml)

Processing Modes

Normal mode: Full taxonomic + functional analysis
Fast mode (fast: true): Taxonomic analysis only, reduced memory usage

Memory Requirements

Normal mode: ~30GB+ RAM (depends on sample size)
Fast mode: ~10GB RAM maximum
Memory scales with number of reads per sample

Troubleshooting

Common Issues

No FASTQ files found: Check file naming pattern and input directory
Memory errors: Increase minimum_memory_gb or enable fast mode
Catalogue download fails: Check internet connection and catalogue name

Cleaning Up

Remove intermediate files:

snakemake clean --configfile config.yaml

Comparison with Original Nextflow Pipeline

This Snakemake version maintains the same functionality as nf-meteor.nf:

Feature	Nextflow	Snakemake
Catalogue download	✓	✓
FASTQ processing	✓	✓
Read mapping	✓	✓
Taxonomic profiling	✓	✓
Functional profiling	✓	✓
Profile merging	✓	✓
Strain analysis	✓	✓
Tree construction	✓	✓
Memory management	✓	✓
Fast mode	✓	✓

Citation

If you use this pipeline, please cite:

METEOR: Ghozlane et al. "Accurate profiling of microbial communities for shotgun metagenomic sequencing with Meteor2." Microbiome (2025)
Snakemake: Köster & Rahmann. "Snakemake—a scalable bioinformatics workflow engine." Bioinformatics (2012)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
envs		envs
.gitignore		.gitignore
README.md		README.md
Snakefile		Snakefile
config.yaml		config.yaml
fix_directories.sh		fix_directories.sh
meteor.nf		meteor.nf
setup_test.sh		setup_test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

METEOR Shotgun Metagenomics Snakemake Pipeline

Overview

Requirements

Installation

Usage

1. Prepare your data

2. Configure the pipeline

3. Run the pipeline

Pipeline Steps

Output

Configuration Options

Catalogue Options

Processing Modes

Memory Requirements

Troubleshooting

Common Issues

Cleaning Up

Comparison with Original Nextflow Pipeline

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

METEOR Shotgun Metagenomics Snakemake Pipeline

Overview

Requirements

Installation

Usage

1. Prepare your data

2. Configure the pipeline

3. Run the pipeline

Pipeline Steps

Output

Configuration Options

Catalogue Options

Processing Modes

Memory Requirements

Troubleshooting

Common Issues

Cleaning Up

Comparison with Original Nextflow Pipeline

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages