CalibratedChIP

Pipeline for analysing and calibrating ChIP-seq data with spike-in genome.

Emily Georgiades, Hughes Group (March 2020)

This pipeline has been re-written using scripts from Nadya Fursova (Robert Klose Lab, Biochem) and is designed to take fastq files as input and output downsampled bigwig files to view in UCSC.

Downsampling is achieved by using a 4% spike-in in ChIP.

Version requirements ⚙️

Bowtie2 v2.1.0
Sambamba v0.6.6
Samtools v1.3 (using htslib 1.3)
Python v3.7.4
macs2 v2.0.10
ucsctools v373

Preliminary requirements

1. Catenated genome

Take the two genomes of interest and rename chromosomes so that thet include species:

sed 's/>chr/>mm10_chr/g' /databank/igenomes/Mus_musculus/UCSC/mm10/Sequence/Bowtie2Index/genome.fa > ./mm10_genome.fa

sed 's/>chr/>dm6_chr/g' /databank/igenomes/Drosophila_melanogaster/UCSC/dm6/Sequence/Bowtie2Index/genome.fa > ./dm6_genome.fa

Catenate these two genomes:

cat /databank/igenomes/Mus_musculus/UCSC/mm10/Sequence/Bowtie2Index/genome.fa /databank/igenomes/Drosophila_melanogaster/UCSC/dm6/Sequence/Bowtie2Index/genome.fa > catenated_mm10_dm6.fa &

Then need to build bowtie2 index:

bowtie2-build /path/concatenated.fa output_prefix

See instructions on Homer webpage

2. paths_to_fastqs.txt

Needs to tab separated and without headers:

sampleName pathtoRead1 pathtoRead2

3. chrom.size files

These need to be saved in the same directory as the bowtie2 indexing.

They can be downloaded from UCSC e.g. hg19.chrom.sizes

SUMMARY

You should have a new directory containing the following (can sym link to scripts):

paths_to_fastqs.txt
downSampling_calc.py
calibratedChIP_pipeline.sh

Run: $ bash calibratedChIP_pipeline.sh -g genome -s spike-in genome -b bt2_dir -i yes/no -p path/public_dir

For help see: $ bash -h calibratedChIP_pipeline.sh

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
snakemake		snakemake
.gitignore		.gitignore
README.md		README.md
calibratedChIP_pipeline.sh		calibratedChIP_pipeline.sh
downSampling_calc.py		downSampling_calc.py
paths_to_fastqs.txt		paths_to_fastqs.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CalibratedChIP

Pipeline for analysing and calibrating ChIP-seq data with spike-in genome.

Emily Georgiades, Hughes Group (March 2020)

Version requirements ⚙️

Preliminary requirements

1. Catenated genome

2. paths_to_fastqs.txt

3. chrom.size files

SUMMARY

About

Uh oh!

Releases

Packages

Languages

EGeorgia/CalibratedChIP

Folders and files

Latest commit

History

Repository files navigation

CalibratedChIP

Pipeline for analysing and calibrating ChIP-seq data with spike-in genome.

Emily Georgiades, Hughes Group (March 2020)

Version requirements ⚙️

Preliminary requirements

1. Catenated genome

2. paths_to_fastqs.txt

3. chrom.size files

SUMMARY

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages