Active Define: The Clinical Semantic Layer

](assets/dashboard_demo.png)

Active Define transforms define.xml from a passive documentation artifact into an executable semantic graph.

Instead of hardcoding SQL or Python filters for every clinical trial (e.g., WHERE VSTESTCD='DIABP'), this engine reads the XML metadata contract and compiles validation logic dynamically.

🏗 Architecture

graph LR
    A[define.xml] -->|Compiler| B(Semantic Graph)
    C[Raw Data .xpt] -->|Engine| D{Auto-Filter}
    B --> D
    D --> E[Validated Subsets]

🚀 Project Structure

This project follows a production-grade modular architecture:

active-define/
│
├── data/                   # Raw inputs (define.xml, vs.xpt)
├── src/                    # Core Engine Logic
│   └── active_define/      # The Python Package (Parser, Transpiler)
├── scripts/                # Utility Tools
│   ├── setup.py            # Auto-provisions CDISC Pilot 01 test data
│   ├── visualize.py        # Generates Mermaid.js logic diagrams
│   └── export.py           # Compiles XML to clean JSON
├── notebooks/              # Interactive Demos
├── run_demo.py             # CLI Entry Point
└── compiled_metadata.json  # Artifact: The compiled logic schema

🛠 Usage

1. Setup

Install dependencies and download the official CDISC Pilot 01 dataset.

pip install -r requirements.txt
python scripts/setup.py

2. Run the Engine

Execute the semantic graph against the raw Vital Signs data.

python run_demo.py

Output: The engine will identify 6 unique Vital Sign definitions in the XML and automatically slice the 29,000+ row dataset into validated cohorts without hardcoded filters.

3. Generate Artifacts

Visualize the logic extracted from the XML.

python scripts/visualize.py

🧠 The "Why"

In traditional Clinical Programming, we manually write code that duplicates the logic in define.xml. Active Define proves that metadata can be Infrastructure as Code—driving the ingestion process automatically.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
notebooks		notebooks
scripts		scripts
src/active_define		src/active_define
.gitignore		.gitignore
README.md		README.md
app.py		app.py
compiled_metadata.json		compiled_metadata.json
requirements.txt		requirements.txt
run_demo.py		run_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Active Define: The Clinical Semantic Layer

🏗 Architecture

🚀 Project Structure

🛠 Usage

1. Setup

2. Run the Engine

3. Generate Artifacts

🧠 The "Why"

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

mnouira02/active-define-core

Folders and files

Latest commit

History

Repository files navigation

Active Define: The Clinical Semantic Layer

🏗 Architecture

🚀 Project Structure

🛠 Usage

1. Setup

2. Run the Engine

3. Generate Artifacts

🧠 The "Why"

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages