🧠 SPhyR

A Spatial Physical Reasoning Benchmark

🤗 SPhyR on HuggingFace

You can also explore or download the dataset directly from Hugging Face:

🔗 SPhyR on Hugging Face

🔁 How to Re-Generate the Dataset

Follow these steps to recreate the dataset from scratch.

🛠️ Step 1: Installation

Create a Conda Environment
Make sure you have Miniconda or Anaconda installed.
```
conda create -n "sphyr" python -y
conda activate sphyr
```
Install Poetry & Project Dependencies
Poetry is used for dependency management.
```
pip install poetry
poetry install
```

🦏 Step 2: Rhinoceros 8.0 & Grasshopper Setup

Download Rhinoceros 8.0
Rhino includes the Grasshopper visual programming environment.
📥 Download here
Install Millipede Plugin
Move the Millipede plugin to Grasshopper's special components folder:
```
src/sphyr/dataset_creation/topology_optimization_data/2D/rhino_grasshopper/libraries/millipede
```
You can access the special folder in Grasshopper via:
File > Special Folders > Components Folder
Open the Rhino & Grasshopper Files
- Rhino File:
  src/sphyr/dataset_creation/topology_optimization_data/2D/rhino_grasshopper/SPhyR_2D.3dm
- Grasshopper Script:
  src/sphyr/dataset_creation/topology_optimization_data/2D/rhino_grasshopper/SPhyR_2D.gh
✅ Once opened, run the Grasshopper script by toggling the boolean on the top-left of the canvas.

💡 Tip: If you'd rather skip this step, precomputed results are available:
- Raw Data: src/sphyr/dataset_creation/topology_optimization_data/2D/raw_data
- Plots/Frames: src/sphyr/dataset_creation/topology_optimization_data/2D/frames

📦 Step 3: Convert to JSON (HuggingFace Dataset Format)

Run the following Python script to convert raw simulation output to a format suitable for evaluation on HuggingFace:

python src/sphyr/dataset_creation/raw_data_2D_to_huggingface_datasets.py

This script processes the .csv simulation outputs into structured .json entries.

📊 Additional Information

🧪 Results Overview

Benchmarks for 100 samples are available for the following models:

Claude 3.7 Sonnet
Claude Opus 4
DeepSeek-R1
Gemini 1.5 Pro
Gemini 2.5 Pro
GPT-3.5 Turbo
GPT-4.1
GPT-4o
Perplexity Sonar
Perplexity Sonar Reasoning

📁 You can find these results inside the results directory.

3D Topology Optimization Data

We have included a preliminary sub-set of 3D data and corresponding plots, but we plan to release a full set in the future. 3D Data can be found here: src/sphyr/dataset_creation/topology_optimization_data/3D.

Citation

BibTeX:

@misc{siedler2025sphyr,
  title        = {SPhyR: Spatial-Physical Reasoning Benchmark on Material Distribution},
  author       = {Philipp D. Siedler},
  year         = {2025},
  eprint       = {2505.16048},
  archivePrefix= {arXiv},
  primaryClass = {cs.AI},
  doi          = {10.48550/arXiv.2505.16048},
  url          = {https://arxiv.org/abs/2505.16048}
}

APA:

Siedler, P. D. (2025). SPhyR: Spatial-Physical Reasoning Benchmark on Material Distribution. arXiv. https://doi.org/10.48550/arXiv.2505.16048

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.vscode		.vscode
docs		docs
results		results
src/sphyr		src/sphyr
tests/metrics		tests/metrics
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
highlighted_output.tex		highlighted_output.tex
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 SPhyR

🤗 SPhyR on HuggingFace

🔁 How to Re-Generate the Dataset

🛠️ Step 1: Installation

🦏 Step 2: Rhinoceros 8.0 & Grasshopper Setup

📦 Step 3: Convert to JSON (HuggingFace Dataset Format)

📊 Additional Information

🧪 Results Overview

3D Topology Optimization Data

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

philippds/SPhyR

Folders and files

Latest commit

History

Repository files navigation

🧠 SPhyR

🤗 SPhyR on HuggingFace

🔁 How to Re-Generate the Dataset

🛠️ Step 1: Installation

🦏 Step 2: Rhinoceros 8.0 & Grasshopper Setup

📦 Step 3: Convert to JSON (HuggingFace Dataset Format)

📊 Additional Information

🧪 Results Overview

3D Topology Optimization Data

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages