-
Notifications
You must be signed in to change notification settings - Fork 0
License
zhejilab/PepScore
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
PepScore Calculation Contact: Zhe Ji (zhe.ji@northwestern.edu) The program is to calculate peptide stable probability based on encoding open reading frame (ORF) features. Requirements: Perl program installation; R program installation. 1. Calculate the false discovery rate (FDR) of ORF lengths considerating transcript lengths. usage: perl CalculateFDR.ORFlength.pl -g SequenceFile -t lengthsOfInterest -o outputDir -s startCodon -g SequenceFile: input the random sequence file in the fasta format -t lengthsOfInterest: Tab delimited text file with two columns. The first column shows the transcript length, and the second shows the ORF length (e.g. 1000 36) -o outputDir: output director -r repeat [optional]: the number of times generating random transcript sequences, default: 1000 -s startCodon [optional]: start codon types, default: ATG -l orfLengthCutoff [optional]: cutoff of minimum candidate ORF length, default: 6 Example command line: perl CalculateFDR.ORFlength.pl -g rand.genome.fa -t transcript.length.txt -o outputDir 2. Calculate the phyloCSF score of an ORF. usage: perl CalculatePhylocsf.pl -f orfFile -p phylocsfDir -o outputFile -f orfFile: ORF structure in the genePred format; -p phylocsfDir: the folder link containing phylocsf score files; -o outputFile: output file. You can download the phyloCSF score files from the following website: https://data.broadinstitute.org/compbio1/PhyloCSFtracks/ The downloaded files are the bw format. And convert the files to the bedGraph format using the "bigWigToBedGraph" tool from the UCSC Genome Browser, which can be downloaded from "https://hgdownload.soe.ucsc.edu/admin/exe/". Be sure the phylocsfDir contains the following files: PhyloCSF+1.bedGraph, PhyloCSF+2.bedGraph, PhyloCSF+3.bedGraph, PhyloCSF-1.bedGraph, PhyloCSF-2.bedGraph, PhyloCSF-3.bedGraph 3. Run "PepScore.R" to calculate PepScore. As shown in the example folder, the input file should contain the following minimum column information for each ORF in a row. len.fdr: the ORF length FDR calculated by "CalculateFDR.ORFlength.pl"; pfam: The Pfam prediction result (https://www.ebi.ac.uk/interpro/). 1: with a domain; 0: without a domain; tmhmm: The TMHMM prediction result (https://services.healthtech.dtu.dk/service.php?TMHMM-2.0). 1: with a domain; 0: without a domain; phylocsf: the averaged PhyloCSF score across the ORF region, which can be calculated by CalculatePhylocsf.pl; orf.len: the ORF length (nt).
About
No description, website, or topics provided.
Resources
License
Stars
Watchers
Forks
Releases
No releases published