Welcome to the downstream analysis arm of the Molecular Mechanisms Cluster (MMC) at the University of Edinburgh. If you have any questions, ideas or would like to collaborate, please reach out to silvia.shen[at]ed.ac.uk.
Aims
Our aims are as follows (taken from the FGI website).
"This research cluster will tackle the “missing link” between the genome and disease. Our aim is to identify molecules in our bodies that cause disease by looking at how tiny variations in the genome change these molecules in single cells. We will create a unique dataset by reading molecular signals in human cells donated by hundreds of patients, potentially leading to ideas for new, effective drugs. ... In the Molecular Mechanisms Cluster we hold ourselves to a simple measure of success: the number of disease genes explained (number of disease-associated common genetic variants significantly colocalising with molecular quantitative trait loci)."
In simple terms, our goals are:
⭐️ Identify QTLs
⭐️ Create open source database of QTLs
⭐️ Colocalise QTLs
⭐️ Further analysis of QTLs
For an overview of the whole process, please see the MMC powerpoint. For an overview of what we could do with the very exciting single-cell data, see figure below (the projects I am currently working on are highlighted in red).
Following processing of raw data, single-cell data will be pre-processed, visualised and cellular structure will be identified. For this pipeline, please see the single cell pipeline.
Please see the cell-type analysis plan for what we plan to do.
- What known cell types are there in our samples?
- What cells are communicating with each other? (receptor-ligand pairs)
- How does the abundance of cell-types change with environmental perturbations?
Please see the single-cell eQTL mapping strategy on proposed options for single-cell eQTL mapping.
For pipeline design, please see the pipeline specification doc.
Please feel free to add any and all ideas you would like to implement (or see implemented by the team/me)!
- Variance QTL mapping
- Dynamic cell state/continuous cell state or phenotype mapping (perhaps go straight into this?)
- Allelic imbalance analysis (see key literature)
- Something with reads or tails of reads?
- Producing a sc-eQTL atlas (similar to cattle gtex)
- Which genes have the most variation across a single cell type? (and which SNPs/regions is this associated with) --> varQTLs
- Single-cell integration with polygenic risk scores? Paper
- The Theis lab's book on single-cell best practices and the accompanying paper and tutorial and github.
- Aarun Lun's thoughts on single cell analysis on github
- Sanger's Hemberg group resources: book
- scanpy tutorials
- Cuomo's review paper: Single-cell genomics meets human genetics. Great initial overview of main methods and challenges.
- Maria's review paper: The Power of Single-Cell RNA Sequencing in eQTL Discovery. Great starting point, v high level.
- Zhang and Zhao's review paper: eQTL studies: from bulk tissues to single cells. Has more equations and is more detailed on the QTL mapping bit.
Scanpy and Theis lab tutorials here.
Kathryn Campbell, Dominique McCormick: data engineers and data scientists working on raw data processing and work with ODAP (HPC system). Konrad Rawlik, Kenneth Baillie: MMC leadership.
