Implement modular benchmarking pipeline for patient graph evaluation by Vicbi · Pull Request #14 · DaneshjouLab/MonopathDAGs

Vicbi · 2025-05-10T00:19:47Z

Implement modular benchmarking pipeline for patient graph evaluation

♻️ Current Situation & Problem

This PR introduces a new modular benchmarking pipeline for evaluating patient trajectory graphs derived from clinical case reports. It addresses the need to validate and compare different representation techniques (LLM-based reconstructions, BERTScore, structural checks, and trajectory embeddings).

⚙️ Release Notes

Added LLMReconstructor using DSPy-compatible APIs for narrative generation from graph data.
Integrated BERTScoreEvaluator to compare reconstructed vs. original text.
Introduced TopologyValidator for validating DAG structure, timestamp order, and connected components.
Added TrajectoryEmbedder based on Bio_ClinicalBERT with pooling strategy for per-patient vectorization.
Added CLI scripts (main.py, batch_run.py) to run the pipeline on single or multiple graphs.
New visualization utilities support t-SNE, heatmap, and cluster plotting.
HTML input support for case report text.

📚 Documentation

Each module includes inline docstrings and logging for clarity and debugging.
Configurable via config.py to easily switch between LLM backends and embedding models.
Visuals and results are saved in output/ and output/plots.
Usage scripts added for both single-graph (main.py) and batch mode (batch_run.py).
README to be updated in a follow-up PR.

📝 Code of Conduct & Contributing Guidelines

By submitting this pull request, you agree to follow our Coding Guidelines:

I agree to follow the Coding Guidelines.

…uster boundaries

Implement modular benchmarking pipeline for patient graph evaluation

fb24c4a

Vicbi requested a review from gtcha2 May 10, 2025 00:22

Vicbi added 10 commits May 9, 2025 19:33

Update

b792b8e

Run benchmark on test set of 88 cases in

bec50fe

Update in

09f3b51

Add string similarity metrics, remove preliminary testing results

591989c

Add benchmarking results in

6aa85eb

Clean unecessary results

e919fd7

Update clustering pipeline

3cf64a9

cleanup: benchmark code and results

0db3799

Get BERTScore results in histograms

40eaf82

Add t-SNE plots with cancer type coloring, metastasis markers, and cl…

4dbf510

…uster boundaries

gtcha2 merged commit 9bb7b55 into main May 15, 2025
2 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Implement modular benchmarking pipeline for patient graph evaluation#14

Implement modular benchmarking pipeline for patient graph evaluation#14
gtcha2 merged 11 commits intomainfrom
similarity_benchmark_vicky

Vicbi commented May 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

Vicbi commented May 10, 2025

Implement modular benchmarking pipeline for patient graph evaluation

♻️ Current Situation & Problem

⚙️ Release Notes

📚 Documentation

📝 Code of Conduct & Contributing Guidelines

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants