`railroad` : PDDL-like concurrent multi-agent, probabilistic planning

Multi-Agent Task Planning, supporting concurrency and probabilistic effects.

The railroad planning framework is meant to support concurrent multi-robot task planning under uncertainty. Operators are PDDL-like and defined in Python, so that learned estimators can be used to specify timing, probabilities, and costs. Planning is C++-based for efficiency, and use MCTS with an uncertainty-aware h-sum heuristic (still a work in progress) as its value function.

Developed by the Robot Anticipatory Intelligence & Learning (RAIL) Group @ GMU, led by Prof. Gregory J. Stein.

Key properties

States store both active fluents and upcoming effects: actions add effects to a queue, which update the active fluents as time advances.
Concurrency: state transitions advance time until an agent is marked (free {agent_name}), letting multiple robots act concurrently.
Probabilistic state transitions: effects can be probabilistic
Planning via MCTS: planning via Monte Carlo Tree Search over joint action spaces

Quickstart via the `uv` package manager

mkdir railroad-env && cd railroad-env && uv venv
uv pip install 'git+https://github.com/RAIL-group/railroad.git@gjstein/code-cleanup#subdirectory=packages/railroad'
uv run railroad example multi-object-search

Quick Example: Two-Robot Object Search

Run this example in a Google Colab notebook.

Two robots concurrently move and search to find a Knife and a Cup in a five-room space.

import numpy as np
from railroad.core import Fluent as F, get_action_by_name, State, Operator, Effect
import railroad.operators
from railroad.environment import SymbolicEnvironment
from railroad.planner import MCTSPlanner
from railroad.dashboard import PlannerDashboard

locations = {
    "den":     np.array([5, 5]),
    "kitchen": np.array([0, 0]),
    "bedroom": np.array([10, 0]),
    "office":  np.array([0, 8]),
    "garage":  np.array([10, 8]),
}
objects_by_type = {
    "robot":    {"robot1", "robot2"},
    "location": set(locations),
    "object":   {"Knife", "Cup"},
}
# Ground truth object locations (unknown to robots initially)
true_object_locations = {"kitchen": {"Cup"}, "garage": {"Knife"}}

def move_time(robot, loc_from, loc_to):
    return float(np.linalg.norm(locations[loc_from] - locations[loc_to]))

move = Operator(
    name="move",
    parameters=[("?r", "robot"), ("?from", "location"), ("?to", "location")],
    preconditions=[F("at ?r ?from"), F("free ?r")],
    effects=[  # not free at t=0, free again at destination after move_time
        Effect(time=0, resulting_fluents={F("not free ?r"), F("not at ?r ?from")}),
        Effect(time=(move_time, ["?r", "?from", "?to"]),
               resulting_fluents={F("free ?r"), F("at ?r ?to")}),
    ],
)

@railroad.operators.numeric  # decorator to allow algebraic "1 - prob"
def object_find_prob(robot: str, loc: str, obj: str) -> float:
    objects_here = true_object_locations.get(loc, set())
    return 0.9 if obj in objects_here else 0.2

search = Operator(
    name="search",
    parameters=[("?r", "robot"), ("?loc", "location"), ("?obj", "object")],
    preconditions=[F("at ?r ?loc"), F("free ?r"), F("not found ?obj"),
                   F("not revealed ?loc"), F("not searched ?loc ?obj")],
    effects=[  # after 5s, location is searched, revealing if object is there
        Effect(time=0, resulting_fluents={F("not free ?r")}),
        Effect(time=5.0, resulting_fluents={F("free ?r"), F("searched ?loc ?obj")},
               prob_effects=[  # find object with p=object_find_prob
                   ((object_find_prob, ["?r", "?loc", "?obj"]),
                    [Effect(time=0, resulting_fluents={F("found ?obj"), F("at ?obj ?loc")})]),
                   ((1 - object_find_prob, ["?r", "?loc", "?obj"]), []),
               ]),
    ],
)

# Both robots start free in the den
initial_state = State(0.0, {
    F("free robot1"), F("free robot2"),
    F("at robot1 den"), F("at robot2 den"),
    F("revealed den"),
})

goal = F("found Knife") & F("found Cup")

env = SymbolicEnvironment(
    state=initial_state, objects_by_type=objects_by_type,
    operators=[move, search],
    true_object_locations=true_object_locations,
)

def fluent_filter(f):
    return any(kw in f.name for kw in ["at", "holding", "found"])

with PlannerDashboard(goal, env, fluent_filter=fluent_filter) as dashboard:
    # Plan-act loop: replan whenever a robot becomes free
    for _ in range(20):
        if goal.evaluate(env.state.fluents):
            break

        actions = env.get_actions()
        planner = MCTSPlanner(actions)
        action_name = planner(env.state, goal, max_iterations=10000, c=200)
        action = get_action_by_name(actions, action_name)
        env.act(action)
        dashboard.update(planner, action_name)

The planner dispatches both robots in parallel. Sample dashboard output:

Actions Taken (5)
         |0.0                       12.1|
  robot1 |1                4            |
  robot2 |2             3           5   |

  1. move robot1 den kitchen
  2. move robot2 den garage
  3. search robot2 garage Knife
  4. search robot1 kitchen Cup
  5. move robot2 garage office

Goal:
  AND(✓(found Knife), ✓(found Cup))

Total cost: 12.1 (seconds)

Built-in Examples

uv run railroad example <name>

multi-object-search -- Search for and collect multiple objects with multiple robots
clear-table -- Clear objects from a table (demonstrates negative goals)
find-and-move-couch -- Cooperative task requiring two robots (demonstrates wait operators)
heterogeneous-robots -- Drone, rover, and crawler with different speeds and capabilities
- Add --interruptible-moves to allow rerouting robots mid-transit

Key Concepts

Fluent -- A fact about the world: F("at robot1 kitchen"), F("free robot1")
State -- The current set of fluents, the time, and any upcoming effects
Operator -- An action template with typed parameters: move ?robot ?from ?to
Effect -- A state change that happens at a specified time; can be probabilistic
Goal -- A target condition to achieve, built from fluents with &, |, and ~

Goal Expressions

Goals compose with Python operators:

from railroad.core import Fluent as F

F("found Knife") & F("found Fork")           # AND -- both must be true
F("at robot1 kitchen") | F("at robot1 bed")   # OR  -- at least one
~F("at Knife table")                          # NOT -- must not hold

# Combine freely
goal = (F("found Knife") | F("found Spoon")) & ~F("at Cup table")

Running via this Repo

Requires Python 3.13+ and uv:

git clone https://github.com/RAIL-group/railroad.git
cd railroad
uv run railroad example multi-object-search   # builds automatically on first run

Architecture

Railroad is organized as a monorepo. The core planning engine lives in packages/railroad/:

packages/railroad/
  include/               # C++ headers (A* search, MCTS, FF heuristic)
  src/railroad/
    _bindings.cpp        # pybind11 bridge
    core.py              # Fluent, State, Action, Operator, Effect, Goal
    planner.py           # MCTSPlanner (wraps C++ MCTS with automatic preprocessing)
    operators/           # Helper constructors for move, search, pick, place, wait
    environment/
      environment.py     # Abstract Environment base class
      symbolic.py        # SymbolicEnvironment for simulation and testing
      skill.py           # ActiveSkill protocol
      procthor/          # Optional AI2-THOR/ProcTHOR 3D simulator integration
    examples/            # Built-in runnable examples
    bench/               # Benchmarking framework with MLflow + Plotly Dash

Additional packages:

packages/environments/ -- Extra environment backends (e.g. PyRoboSim)

Development

uv run ty check              # type-check (fast, run first)
uv run pytest                # full test suite
uv run pytest -vk <filter>   # run specific tests

uv run automatically detects changes to source files (including C++) and rebuilds as needed.

Name		Name	Last commit message	Last commit date
Latest commit History 646 Commits
.github/workflows		.github/workflows
packages		packages
resources/pyrobosim_worlds		resources/pyrobosim_worlds
scripts		scripts
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
README.old.md		README.old.md
conftest.py		conftest.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`railroad` : PDDL-like concurrent multi-agent, probabilistic planning

Key properties

Quickstart via the `uv` package manager

Quick Example: Two-Robot Object Search

Built-in Examples

Key Concepts

Goal Expressions

Running via this Repo

Architecture

Development

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

RAIL-group/railroad

Folders and files

Latest commit

History

Repository files navigation

railroad : PDDL-like concurrent multi-agent, probabilistic planning

Key properties

Quickstart via the uv package manager

Quick Example: Two-Robot Object Search

Built-in Examples

Key Concepts

Goal Expressions

Running via this Repo

Architecture

Development

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

`railroad` : PDDL-like concurrent multi-agent, probabilistic planning

Quickstart via the `uv` package manager

Packages