Skip to content

baillielab/MMC_computational

Repository files navigation

MMC computational analysis

Welcome to the downstream analysis arm of the Molecular Mechanisms Cluster (MMC) at the University of Edinburgh. If you have any questions, ideas or would like to collaborate, please reach out to silvia.shen[at]ed.ac.uk.

Aims

Our aims are as follows (taken from the FGI website).

"This research cluster will tackle the “missing link” between the genome and disease. Our aim is to identify molecules in our bodies that cause disease by looking at how tiny variations in the genome change these molecules in single cells. We will create a unique dataset by reading molecular signals in human cells donated by hundreds of patients, potentially leading to ideas for new, effective drugs. ... In the Molecular Mechanisms Cluster we hold ourselves to a simple measure of success: the number of disease genes explained (number of disease-associated common genetic variants significantly colocalising with molecular quantitative trait loci)."

In simple terms, our goals are:
⭐️ Identify QTLs
⭐️ Create open source database of QTLs
⭐️ Colocalise QTLs
⭐️ Further analysis of QTLs

For an overview of the whole process, please see the MMC powerpoint. For an overview of what we could do with the very exciting single-cell data, see figure below (the projects I am currently working on are highlighted in red).

Initial analysis

Following processing of raw data, single-cell data will be pre-processed, visualised and cellular structure will be identified. For this pipeline, please see the single cell pipeline.

Cell type analysis

Please see the cell-type analysis plan for what we plan to do.

  • What known cell types are there in our samples?
  • What cells are communicating with each other? (receptor-ligand pairs)
  • How does the abundance of cell-types change with environmental perturbations?

eQTL mapping

Please see the single-cell eQTL mapping strategy on proposed options for single-cell eQTL mapping.
For pipeline design, please see the pipeline specification doc.

Future analysis ideas

Please feel free to add any and all ideas you would like to implement (or see implemented by the team/me)!

  • Variance QTL mapping
  • Dynamic cell state/continuous cell state or phenotype mapping (perhaps go straight into this?)
  • Allelic imbalance analysis (see key literature)
  • Something with reads or tails of reads?
  • Producing a sc-eQTL atlas (similar to cattle gtex)
  • Which genes have the most variation across a single cell type? (and which SNPs/regions is this associated with) --> varQTLs
  • Single-cell integration with polygenic risk scores? Paper

Image

Resources

General single-cell stuff

Single-cell QTL mapping resources

Tutorials

Scanpy and Theis lab tutorials here.

Acknowledgements

Kathryn Campbell, Dominique McCormick: data engineers and data scientists working on raw data processing and work with ODAP (HPC system). Konrad Rawlik, Kenneth Baillie: MMC leadership.

About

Molecular mechanisms cluster single-cell QTL mapping analysis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors