Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
d8df6b1
Fix bug on custom path
VergaJU Jul 7, 2023
38ba530
Merge pull request #1 from VergaJU/fix_path
GuerreroVazquez Feb 5, 2024
14e56ee
Merge pull request #2 from Kathyren/master
GuerreroVazquez Feb 13, 2024
aa9d308
networks data
GuerreroVazquez Jun 25, 2024
e195e5b
networks data_adding TF data
GuerreroVazquez Jun 25, 2024
dd42e41
Update main_pipeline.py
GuerreroVazquez Jun 25, 2024
ff1748f
Add direction to network and evaluate nodes and edges
GuerreroVazquez Jun 26, 2024
0f60c4c
add the pathways to the nodes
GuerreroVazquez Jun 26, 2024
2fbc653
add_pathways_to_nodes
GuerreroVazquez Jun 26, 2024
a4443ab
add TF to network
GuerreroVazquez Jun 26, 2024
8e06f4d
add_DDS_data
GuerreroVazquez Jun 27, 2024
4b0ed2d
Weigth mirna must be negative
GuerreroVazquez Jun 27, 2024
f402cbb
network_style cytoscape
GuerreroVazquez Jul 2, 2024
65a5e37
Update main_pipeline.py
GuerreroVazquez Jul 2, 2024
f4a16c1
random_walk
GuerreroVazquez Jul 2, 2024
5c47c5e
notebooks
GuerreroVazquez Jul 4, 2024
ae51be1
generat_more_pr_filter_network
GuerreroVazquez Jul 4, 2024
9a3a3a9
Update .gitignore
GuerreroVazquez Jul 4, 2024
b9bc7e7
select_next_valid_node
GuerreroVazquez Jul 4, 2024
8f9c988
Update test_network.py
GuerreroVazquez Jul 4, 2024
be150e1
Update requirements.txt
GuerreroVazquez Jul 4, 2024
401cbb7
walking_network_to_pick_mirs
GuerreroVazquez Jul 4, 2024
aa2f11c
n
GuerreroVazquez Jul 16, 2024
19baaea
prepate funtons for noteboods
GuerreroVazquez Jul 19, 2024
abe3e82
process
GuerreroVazquez Jul 20, 2024
697eeba
Update Tissue-DE-Network.ipynb
GuerreroVazquez Jul 23, 2024
494b580
a lot
GuerreroVazquez Aug 2, 2024
0e4f0e5
Update main_pipeline.py
GuerreroVazquez Aug 2, 2024
9332e33
Update jupyter_functions.py
GuerreroVazquez Nov 14, 2024
cf4a198
looking at the network weights noteboks
GuerreroVazquez Jan 10, 2025
f67b6a0
missed from previous commit
GuerreroVazquez Jan 10, 2025
b6c9330
network processing does not need the main, but it does need a test
GuerreroVazquez Jan 10, 2025
140d7ab
New network analysis
GuerreroVazquez Jan 14, 2025
0c70e83
Network before refactor
GuerreroVazquez Feb 25, 2025
9a89688
Update .gitignore
GuerreroVazquez Feb 25, 2025
a7728be
Update .gitignore
GuerreroVazquez Feb 25, 2025
14dd02f
Delete graph1_Analysis_Sarcopenia.cyjs.png
GuerreroVazquez Feb 25, 2025
a2841e9
Create README.md
GuerreroVazquez Jun 16, 2025
3223792
Merge pull request #4 from GuerreroVazquez/GuerreroVazquez-patch-1
GuerreroVazquez Jun 16, 2025
e150673
Update README.md
GuerreroVazquez Jun 16, 2025
9e6a3f7
Merge pull request #5 from GuerreroVazquez/GuerreroVazquez-patch-2
GuerreroVazquez Jun 16, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
19 changes: 19 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -21,3 +21,22 @@ network/graph1_STRING_Magagnes20092default node.csv
.idea/misc.xml
.idea/Muscle_wasting.iml
*.bkp
*.sif
network/Sessions/*
*.png
*.gif
*.png
network/.ipynb_checkpoints/*
network/.ipynb_checkpoints/mirnas_influence_01-checkpoint.ipynb
network/.ipynb_checkpoints/mirnas_influence_02-checkpoint.ipynb
network/.ipynb_checkpoints/mirnas_influence_03_90-checkpoint.ipynb
.virtual_documents/*
network/data/inmune_genes_symbol.txt
network/.virtual_documents/Untitled.ipynb
.idea/Muscle_wasting.iml
.ipynb_checkpoints/Untitled-checkpoint.ipynb
*.tsv
*.txt
database_analysis/sql_operations.py
*.jpg
*.png
8 changes: 7 additions & 1 deletion .idea/misc.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

104 changes: 104 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,104 @@
<h1>Muscle_wasting</h1>

<p>A bioinformatics pipeline for transcriptomic analysis of muscle wasting conditions. Integrates differential expression, transcription factor analysis, pathway enrichment, and network generation in a unified workflow.</p>

<h2>🚀 Features</h2>
<ul>
<li><strong>run_DEA</strong> – Perform Differential Expression Analysis (DEA) and extract Differentially Dysregulated Sites (DDSs).</li>
<li><strong>run_DDs</strong> – Identify Differentially Expressed Genes (DEGs).</li>
<li><strong>run_TF</strong> – Predict and analyze transcription factors driving the DEGs.</li>
<li><strong>run_pathway</strong> – Conduct pathway enrichment on DEGs or TF targets.</li>
<li><strong>run_network_creation</strong> – Build regulatory networks based on TF–gene relationships.</li>
<li><strong>main_pipeline</strong> – Execute the full workflow in sequence.</li>
</ul>

<h2>📁 Repository Structure</h2>
<pre><code>Muscle_wasting/
├── common_tools/ # Shared scripts and utilities
├── database_analysis/ # Database-driven analyses
├── data/ # Raw/processed input data
├── network/ # Network construction modules
├── paper_mining/ # PubMed paper mining scripts
├── cytoscape/ # Cytoscape-specific tools
├── Databases/ # External data resources
├── ncbi/ # NCBI query modules
├── Tests/ # Unit and integration tests
├── Constants.py # Global constants and config
├── main.py # CLI entrypoint; defines main_pipeline
├── requirements.txt # Python dependencies
└── DE_genes.ods # Example DEG output file
</code></pre>

<h2>🛠️ Installation</h2>
<p>Install required packages:</p>
<pre><code>pip install -r requirements.txt</code></pre>

<p>Main dependencies:</p>
<ul>
<li>pandas</li>
<li>numpy</li>
<li>scipy</li>
<li>statsmodels (or DESeq2/edgeR via rpy2)</li>
<li>networkx</li>
<li>gseapy or enrichr</li>
<li>py4cytoscape</li>
<li>pytest</li>
</ul>

<h2>⚙️ Usage</h2>

<h3>As a library:</h3>
<pre><code>from main import run_DEA, run_DDs, run_TF, run_pathway, run_network_creation

dds = run_DEA(input_counts, sample_metadata)
degs = run_DDs(dds)
tfs = run_TF(degs)
pathways = run_pathway(degs)
network = run_network_creation(tfs, degs)
</code></pre>

<h3>Via the command-line:</h3>
<pre><code>python main.py \
--input counts.csv \
--metadata meta.csv \
--output_dir results/
</code></pre>

<p>This runs the full <code>main_pipeline</code>, sequentially performing:</p>
<ol>
<li>DEA & DDS detection</li>
<li>DEG calling</li>
<li>TF analysis</li>
<li>Pathway enrichment</li>
<li>Network construction</li>
</ol>

<p>Results are stored in <code>results/</code> as CSV and graph files.</p>

<h2>🧪 Testing</h2>
<p>Run tests from the repo root:</p>
<pre><code>pytest</code></pre>

<h2>📄 Output</h2>
<ul>
<li>DDS list</li>
<li>DEG list</li>
<li>Predicted TFs and regulators</li>
<li>Enriched pathway reports</li>
<li>Network export formats (e.g. SIF, GraphML)</li>
</ul>

<h2>📚 Citation</h2>
<p>If you publish results using this pipeline, please cite the original <a href="https://github.com/GuerreroVazquez/Muscle_wasting">GuerreroVazquez/Muscle_wasting</a> repository.</p>

<h2>❓ Questions & Contributing</h2>
<p>For bugs, feature requests, or contributions, please open an issue or pull request.</p>

<hr />

<h3>Author</h3>
<p>
Karen Guerrero‑Vazquez<br />
📧 <a href="mailto:Karen.GuerreroVazquez@gmail.com">GuerreroVazquez.Karen@gmail.com</a><br />
GitHub: <a href="https://github.com/GuerreroVazquez">GuerreroVazquez</a>
</p>
6 changes: 6 additions & 0 deletions Untitled.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
{
"cells": [],
"metadata": {},
"nbformat": 4,
"nbformat_minor": 5
}
121 changes: 121 additions & 0 deletions cytoscape/Data/Apps_details.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,121 @@
{
"gene_mania": {
"source": "gene_mania",
"node_keys": [
"id",
"shared_name",
"score",
"node_type",
"query_term",
"log_score",
"name",
"annotations",
"annotation_name",
"gene_name",
"SUID",
"selected"
],
"edge_keys": [
"id",
"source",
"target",
"shared_name",
"highlight",
"normalized_max_weight",
"shared_interaction",
"raw_weights",
"name",
"interaction",
"SUID",
"networks",
"selected",
"data_type"
],
"main_name": "gene_name"
},
"tf": {
"source": "tf",
"node_keys": [
"shared_name",
"name"
],
"edge_keys": [
"shared_name",
"shared_interaction",
"name",
"interaction",
"weight",
"direction"
],
"main_name": "name"
},
"stringdb": {
"edge_keys": [
"id",
"source",
"target",
"shared_name",
"stringdb_textmining",
"shared_interaction",
"stringdb_score",
"name",
"interaction",
"SUID",
"stringdb_coexpression",
"selected"
],
"main_name": "name",
"node_keys": [
"id",
"compartment_golgi_apparatus",
"stringdb_namespace",
"tissue_urine",
"compartment_nucleus",
"stringdb_structures",
"tissue_saliva",
"stringdb_species",
"tissue_liver",
"compartment_extracellular",
"tissue_stomach",
"selected",
"compartment_endosome",
"tissue_heart",
"compartment_lysosome",
"stringdb_enhancedLabel_Passthrough",
"stringdb_canonical_name",
"tissue_eye",
"compartment_cytoskeleton",
"tissue_nervous_system",
"tissue_pancreas",
"target_family",
"tissue_bone",
"stringdb_node_type",
"name",
"tissue_muscle",
"stringdb_STRING_style",
"compartment_mitochondrion",
"target_development_level",
"tissue_lung",
"compartment_cytosol",
"tissue_skin",
"compartment_plasma_membrane",
"shared_name",
"stringdb_interactor_score",
"tissue_kidney",
"SUID",
"_id",
"compartment_endoplasmic_reticulum",
"stringdb_description",
"tissue_thyroid_gland",
"display_name",
"tissue_adrenal_gland",
"tissue_gall_bladder",
"stringdb_sequence",
"tissue_intestine",
"stringdb_database_identifier",
"tissue_blood",
"tissue_spleen"
],
"source": "stringdb"
}
}
108 changes: 108 additions & 0 deletions cytoscape/Data/Apps_details.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,108 @@
stringdb:
edge_keys:
- id
- source
- target
- shared_name
- stringdb_textmining
- shared_interaction
- stringdb_score
- name
- interaction
- SUID
- stringdb_coexpression
- selected
main_name: name
node_keys:
- id
- compartment_golgi_apparatus
- stringdb_namespace
- tissue_urine
- compartment_nucleus
- stringdb_structures
- tissue_saliva
- stringdb_species
- tissue_liver
- compartment_extracellular
- tissue_stomach
- selected
- compartment_endosome
- tissue_heart
- compartment_lysosome
- stringdb_enhancedLabel_Passthrough
- stringdb_canonical_name
- tissue_eye
- compartment_cytoskeleton
- tissue_nervous_system
- tissue_pancreas
- target_family
- tissue_bone
- stringdb_node_type
- name
- tissue_muscle
- stringdb_STRING_style
- compartment_mitochondrion
- target_development_level
- tissue_lung
- compartment_cytosol
- tissue_skin
- compartment_plasma_membrane
- shared_name
- stringdb_interactor_score
- tissue_kidney
- SUID
- _id
- compartment_endoplasmic_reticulum
- stringdb_description
- tissue_thyroid_gland
- display_name
- tissue_adrenal_gland
- tissue_gall_bladder
- stringdb_sequence
- tissue_intestine
- stringdb_database_identifier
- tissue_blood
- tissue_spleen

gene_mania:
edge_keys:
- id
- source
- target
- shared_name
- highlight
- normalized_max_weight
- shared_interaction
- raw_weights
- name
- interaction
- SUID
- networks
- selected
- data_type
main_name: gene_name
node_keys:
- id
- shared_name
- score
- node_type
- query_term
- log_score
- name
- annotations
- annotation_name
- gene_name
- SUID
- selected

tf:
edge_keys:
- shared_name
- name
- interaction
- weight
- direction
main_name: name
node_keys:
- shared_name
- name
13 changes: 13 additions & 0 deletions cytoscape/Data/properties_2_keep.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
all:
- id
- source
- target
gene_mania:
- annotation_name
- score
- gene_name
stringdb:
- tissue_urine
tf:
- shared_name
- name
Loading