Skip to content

PayneLab/pancancerProteinMRNA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

309 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Activating cancer hallmarks through changes in mRNA/protein regulation

As a diverse family of diseases, cancer is unified by a set of common dysfunctions, such as limitless growth potential and an insensitivity to anti-growth signals. These shared overarching biological processes have been termed the hallmarks of cancer. To better understand the root cause of cellular dysregulation, intense molecular characterization of tumors have utilized DNA, RNA and protein measurement techniques to produce proteogenomic data. In large cancer cohort studies, genomic and proteogenomic data have frequently identified many cancer hallmarks, including cell cycle and cell signaling. However, altered metabolism - a known cancer hallmark - is not as clearly identified in mutation screens of simple differential expression analyses. Here we introduce a new computational method to identify changes in cellular regulation by focusing on the mRNA/protein relationship. We create a metric, Δ_corr, to capture when the mRNA/protein correlation changes significantly between tumor and normal tissues, and show that it is distinct from differential expression and also not associated with DNA mutation profiles. Our method clearly highlights altered metabolic pathways across multiple tumor types. Δ_corr gives researchers a new perspective on dysfunction of tumor cells and introduces a novel method for proteogenomic data integration.

Table of Contents

Requirements

  • pip install -r requirements.txt
  • pip install -e ./pcprutils

Δ_corr

Δ_corr is the difference between the healthy and diseased correlations of RNA and protein abundances. The correlation scripts and tables can be found in the data folder here.
The csv files contain Δ_corr tables calculated with the scripts in the folder.
The scripts were run in BYU's supercomputer. To run this localy:
Run download_pancan.py and pass as argument a token for downloading non published datasets.
Use the command in the found in the cancer bash files to calculate the tables. However, note that the it may take some time to finish the run.

Transmutation Effects

The transmutation effects scripts and tables can be found in the data folder here.
The csv files contain the transmutation effect tables calculated with the scripts in the folder.
The scripts were run in BYU's supercomputer. To run this localy:
Run download_pancan.py and pass as argument a token for downloading non published datasets.
Use the command in the cancer bash files. Again, note that the it may take some time to finish running.

Figures/Table

Figures and table produced by the scripts can be found here.
All the scripts that produce the images and table can be found here.

pcprutils

Collection of useful functions to load cancers, fetch proteomic and transcriptomic data, and calculate correlations and permutation scores.

Legacy Version Notice

The code in this repo uses an older version of CPTAC. This may cause dependency issues or warnings with some packages. It is recomended to install in a clean environment. If there is issues installing the older packages from the requirements, try using pip 23.0 as this version of pip managed to install them. If you want to run everything from scratch, CPTAC will request from you Box authorization. Please contact us if you need authorization.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages