Agentic Engineering Design

Multi-agent LLM framework for conceptual systems engineering and design.

This repository implements the framework described in the paper "Agentic Large Language Models for Conceptual Systems Engineering and Design" published in the Journal of Mechanical Design. It provides a structured multi-agent workflow that guides LLM agents through requirements extraction, functional decomposition, and simulator code generation for engineering design tasks.

Key Features

Multi-Agent System (MAS): 9-role orchestrated workflow for comprehensive design
Two-Agent System (2AS): Simplified Generator-Reflector loop for ablation studies
Design-State Graph (DSG): JSON-serializable representation bundling requirements, physical embodiments, and Python physics models
Modular Architecture: Plug-and-play LLM backends (OpenAI, Ollama, custom endpoints)
Tool Integration: Web search, arXiv search, Python REPL, graph manipulation tools

Quick Start

Option A — Conda (recommended)

git clone https://github.com/SoheylM/agentic-eng-design.git
cd agentic-eng-design
python bootstrap_env.py          # creates env, installs deps + pre-commit hooks
conda activate agentic-eng-design
cp .env.example .env             # edit with your API keys

Option B — pip only

git clone https://github.com/SoheylM/agentic-eng-design.git
cd agentic-eng-design
pip install -e .[dev]            # editable install with dev tools
pre-commit install               # set up pre-commit hooks
cp .env.example .env             # edit with your API keys

Note: requirements.txt is kept for backwards compatibility. All dependency management lives in pyproject.toml.

Configure LLM Backend

Edit llm_models.py to use your preferred LLM backend:

OpenAI API: Set openai_api_key and openai_api_base
Local vLLM: Set openai_api_base="http://localhost:8000/v1"
Local Ollama: Set openai_api_base="http://localhost:11434/v1"
Local SGLang: Set openai_api_base="http://localhost:8002/v1"

Demo Workflow

Step 1: Run Experiments

For Water System (Solar-Powered Water Filtration):

python run_pipeline.py --system water --llm reasoning --temp 1.0 --workflow mas --runs 1

For UAM System (eVTOL Aircraft):

python run_pipeline.py --system uam --llm reasoning --temp 1.0 --workflow mas --runs 1

For Full Experimental Study:

python run_pipeline.py  # Runs all combinations

Step 2: Visualize Results

For Water System:

python visualization/visualize_third_best_dsg.py

For UAM System:

python visualization/visualize_uam_dsg.py

Step 3: Display Metrics

Quick Terminal Display (for demos):

python display_metrics.py <batch_id>
# Example: python display_metrics.py 20250615_185047

Generate Detailed Reports:

python eval_saved.py --batch-id <batch_id>
# Example: python eval_saved.py --batch-id 20250615_185047

Evaluate All Batches:

python eval_all.py

Metrics Explanation

The framework evaluates Design-State Graphs (DSGs) using 7 metrics:

M1 (JSON Validity): Percentage of valid JSON outputs
M2 (Requirements Coverage): Percentage of system requirements addressed
M3 (Embodiment Presence): Percentage of nodes with physical embodiments
M4 (Code Compatibility): Percentage of generated Python code that executes successfully
M5 (Workflow Completion): Percentage of runs that complete successfully
M6 (Runtime): Average execution time in seconds
M7 (Node Count): Average number of nodes in the final DSG

Note: M2 automatically adapts to the system type (water vs UAM) based on the Cahier des Charges requirements.

IDETC Demo Workflow

For conference demonstrations, follow this sequence:

Show Existing Results (Water System):

python visualization/visualize_third_best_dsg.py
python display_metrics.py 20250615_185047

Run Live Demo (UAM System):

python run_pipeline.py --system uam --llm reasoning --temp 1.0 --workflow mas --runs 1

Show Live Results:

python visualization/visualize_uam_dsg.py
python display_metrics.py <new_batch_id>

This demonstrates the framework's capability to handle different engineering design problems with automatic requirement parsing.

Repository Structure

agentic-eng-design/
├── agents/                 # Individual agent implementations
├── workflows/              # MAS and 2AS workflow definitions
├── visualization/          # DSG visualization and analysis tools
├── experiment_results/     # Experimental outputs and metrics
├── config.py              # Environment configuration
├── data_models.py         # Pydantic data models
├── llm_models.py          # LLM client setup
└── tools.py               # Agent tool definitions

Development

This project uses modern Python tooling for code quality:

Tool	Purpose	Command
Ruff	Linting & formatting	`ruff check .` / `ruff format .`
MyPy	Static type checking	`mypy .`
Pre-commit	Git hooks for auto-checks	`pre-commit run --all-files`
Pytest	Testing	`pytest tests/ -v`

Pre-commit hooks run automatically on git commit. To run all checks manually:

pre-commit run --all-files

Configuration lives in pyproject.toml (ruff, mypy, pytest) and .pre-commit-config.yaml.

Citation

If you use this work, please cite:

@article{10.1115/1.4070328,
    author = {Massoudi, Soheyl and Fuge, Mark},
    title = {Agentic Large Language Models for Conceptual Systems Engineering and Design},
    journal = {Journal of Mechanical Design},
    volume = {148},
    number = {5},
    pages = {051405},
    year = {2026},
    month = {01},
    doi = {10.1115/1.4070328},
    url = {https://doi.org/10.1115/1.4070328},
}

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Engineering Design

Key Features

Quick Start

Option A — Conda (recommended)

Option B — pip only

Configure LLM Backend

Demo Workflow

Step 1: Run Experiments

Step 2: Visualize Results

Step 3: Display Metrics

Metrics Explanation

IDETC Demo Workflow

Repository Structure

Development

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 292 Commits
agents		agents
experiment_logs		experiment_logs
experiment_results		experiment_results
langsmith_traces		langsmith_traces
runs		runs
tests		tests
visualization		visualization
workflows		workflows
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
bootstrap_env.py		bootstrap_env.py
config.py		config.py
data_models.py		data_models.py
display_metrics.py		display_metrics.py
environment.yml		environment.yml
eval_all.py		eval_all.py
eval_saved.py		eval_saved.py
experiment_config.py		experiment_config.py
graph_utils.py		graph_utils.py
llm_models.py		llm_models.py
prompts.py		prompts.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_pipeline.py		run_pipeline.py
tools.py		tools.py
utils.py		utils.py
validation.py		validation.py

Folders and files

Latest commit

History

Repository files navigation

Agentic Engineering Design

Key Features

Quick Start

Option A — Conda (recommended)

Option B — pip only

Configure LLM Backend

Demo Workflow

Step 1: Run Experiments

Step 2: Visualize Results

Step 3: Display Metrics

Metrics Explanation

IDETC Demo Workflow

Repository Structure

Development

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages