The Dimensional Framework

A simple two-shot PlanningAgent

3rd person POV

The Dimensional Framework

The universal framework for AI-native generalist robotics

What is Dimensional?

Dimensional is an open-source framework for building agentive generalist robots. DimOS allows off-the-shelf Agents to call tools/functions and read sensor/state data directly from ROS.

The framework enables neurosymbolic orchestration of Agents as generalized spatial reasoners/planners and Robot state/action primitives as functions.

The result: cross-embodied "Dimensional Applications" exceptional at generalization and robust at symbolic action execution.

DIMOS x Unitree Go2

We are shipping a first look at the DIMOS x Unitree Go2 integration, allowing for off-the-shelf Agents() to "call" Unitree ROS2 Nodes and WebRTC action primitives, including:

Navigation control primitives (move, reverse, spinLeft, spinRight, etc.)
WebRTC control primitives (FrontPounce, FrontFlip, FrontJump, etc.)
Camera feeds (image_raw, compressed_image, etc.)
IMU data
State information
Lidar / PointCloud primitives 🚧
Any other generic Unitree ROS2 topics

Features

DimOS Agents
- Agent() classes with planning, spatial reasoning, and Robot.Skill() function calling abilities.
- Integrate with any off-the-shelf model: OpenAIAgent, GeminiAgent 🚧, DeepSeekAgent 🚧, HuggingfaceAgent 🚧, etc.
- Modular agent architecture for easy extensibility and chaining of Agent output --> Subagents input.
- Agent spatial / language memory for location grounded reasoning and recall. 🚧
DimOS Infrastructure
- A reactive data streaming architecture using RxPY to manage real-time video (or other sensor input), outbound commands, and inbound robot state between the DimOS interface, Agents, and ROS2.
- Robot Command Queue to handle complex multi-step actions to Robot.
- Simulation bindings (Genesis, Isaacsim, etc.) to test your agentive application before deploying to a physical robot.
DimOS Interface / Development Tools
- Local development interface to control your robot, orchestrate agents, visualize camera/lidar streams, and debug your dimensional agentive application.

Docker Quick Start 🚀

⚠️ Recommended to start

Prerequisites

Docker and Docker Compose installed
A Unitree Go2 robot accessible on your network
The robot's IP address
OpenAI API Key

Configuration:

Configure your environment variables in .env

OPENAI_API_KEY=<OPENAI_API_KEY>
ROBOT_IP=<ROBOT_IP>
CONN_TYPE=webrtc
WEBRTC_SERVER_HOST=0.0.0.0
WEBRTC_SERVER_PORT=9991
DISPLAY=:0

Run docker compose

xhost +local:root # If running locally and desire RVIZ GUI
docker compose -f docker/unitree/agents_interface/docker-compose.yml up --build

Interface will start at http://localhost:3000

Python Quick Start 🐍

Prerequisites

A Unitree Go2 robot accessible on your network
The robot's IP address
OpenAI API Key

Python Installation (Ubuntu 22.04)

sudo apt install python3-venv

# Clone the repository
git clone --recurse-submodules https://github.com/dimensionalOS/dimos-unitree.git
cd dimos-unitree

# Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate

sudo apt install portaudio19-dev python3-pyaudio

# Install dependencies
pip install -r requirements.txt

# Copy and configure environment variables
cp default.env .env

ROS2 Unitree Go2 SDK Installation

System Requirements

Ubuntu 22.04
ROS2 Distros: Iron, Humble, Rolling

See Unitree Go2 ROS2 SDK for additional installation instructions.

mkdir -p ros2_ws
cd ros2_ws
git clone --recurse-submodules https://github.com/abizovnuralem/go2_ros2_sdk.git src
sudo apt install ros-$ROS_DISTRO-image-tools
sudo apt install ros-$ROS_DISTRO-vision-msgs

sudo apt install python3-pip clang portaudio19-dev
cd src
pip install -r requirements.txt
cd ..

# Ensure clean python install before running
source /opt/ros/$ROS_DISTRO/setup.bash
rosdep install --from-paths src --ignore-src -r -y
colcon build

Run the test application

ROS2 Terminal:

# Change path to your Go2 ROS2 SDK installation
source /ros2_ws/install/setup.bash
source /opt/ros/$ROS_DISTRO/setup.bash

export ROBOT_IP="robot_ip" #for muliple robots, just split by ,
export CONN_TYPE="webrtc"
ros2 launch go2_robot_sdk robot.launch.py

Python Terminal:

# Change path to your Go2 ROS2 SDK installation
source /ros2_ws/install/setup.bash
python test/test_planning_agent_web_interface.py

DimOS Interface:

cd dimos/web/dimos_interface
yarn install 
yarn dev # you may need to run sudo if previously built via Docker

Project Structure

Non-production directories excluded

.
├── dimos/
│   ├── agents/      # Agent implementation and behaviors
│   ├── robot/       # Robot control and hardware interface
│   │   └── unitree/ # Unitree Go2 specific control and skills implementations
│   ├── stream/      # WebRTC and data streaming
│   ├── web/         # DimOS development interface and API
│   │   └── dimos_interface/  # DimOS web interface 
│   ├── simulation/  # Robot simulation environments
│   ├── utils/       # Utility functions and helpers
│   └── types/       # Type definitions and interfaces
├── tests/           # Test files
└── docker/          # Docker configuration files and compose definitions

Building

Simple DimOS Application

from dimos.robot.unitree.unitree_go2 import UnitreeGo2
from dimos.robot.unitree.unitree_skills import MyUnitreeSkills
from dimos.robot.unitree.unitree_ros_control import UnitreeROSControl
from dimos.agents.agent import OpenAIAgent

# Initialize robot
robot = UnitreeGo2(ip=robot_ip,
                  ros_control=UnitreeROSControl(),
                  skills=MyUnitreeSkills())

# Initialize agent
agent = OpenAIAgent(
            dev_name="UnitreeExecutionAgent",
            input_video_stream=robot.get_ros_video_stream(),
            skills=robot.get_skills(),
            system_query="Jump when you see a human! Front flip when you see a dog!",
            model_name="gpt-4o"
        )

while True: # keep process running
  time.sleep(1)

DimOS Application with Agent chaining

Let's build a simple DimOS application with Agent chaining. We define a planner as a PlanningAgent that takes in user input to devise a complex multi-step plan. This plan is passed step-by-step to an executor agent that can queue AbstractRobotSkill commands to the ROSCommandQueue.

Our reactive Pub/Sub data streaming architecture allows for chaining of Agent_0 --> Agent_1 --> ... --> Agent_n via the input_query_stream parameter in each which takes an Observable input from the previous Agent in the chain.

Via this method you can chain together any number of Agents() to create complex dimensional applications.

web_interface = RobotWebInterface(port=5555)

robot = UnitreeGo2(ip=robot_ip,
                  ros_control=UnitreeROSControl(),
                  skills=MyUnitreeSkills())

# Initialize master planning agent
planner = PlanningAgent(
            dev_name="UnitreePlanningAgent",
            input_query_stream=web_interface.query_stream, # Takes user input from dimOS interface
            skills=robot.get_skills(),
            model_name="gpt-4o",
        )

# Initialize execution agent
executor = OpenAIAgent(
            dev_name="UnitreeExecutionAgent",
            input_query_stream=planner.get_response_observable(), # Takes planner output as input
            skills=robot.get_skills(),
            model_name="gpt-4o",
            system_query="""
            You are a robot execution agent that can execute tasks on a virtual
            robot. ONLY OUTPUT THE SKILLS TO EXECUTE.
            """
        )

while True: # keep process running
  time.sleep(1)

Calling Action Primitives

Call action primitives directly from Robot() for prototyping and testing.

robot = UnitreeGo2(ip=robot_ip,)

# Call a Unitree WebRTC action primitive
robot.webrtc_req(api_id=1016) # "Hello" command

# Call a ROS2 action primitive
robot.move(distance=1.0, speed=0.5)

Creating Custom Skills (non-unitree specific)

Create basic custom skills by inheriting from `AbstractRobotSkill` and implementing the `call` method.

class Move(AbstractRobotSkill):
    distance: float = Field(...,description="Distance to reverse in meters")
    def __init__(self, robot: Optional[Robot] = None, **data):
        super().__init__(robot=robot, **data)
    def __call__(self):
        super().__call__()
        return self._robot.move(distance=self.distance)

Chain together skills to create recursive skill trees

class JumpAndFlip(AbstractRobotSkill):
    def __init__(self, robot: Optional[Robot] = None, **data):
        super().__init__(robot=robot, **data)
    def __call__(self):
        super().__call__()
        jump = Jump(robot=self._robot)
        flip = Flip(robot=self._robot)
        return (jump() and flip())

Unitree Test Files

tests/run_go2_ros.py: Tests UnitreeROSControl(ROSControl) initialization in UnitreeGo2(Robot) via direct function calls robot.move() and robot.webrtc_req()
tests/simple_agent_test.py: Tests a simple zero-shot class OpenAIAgent example
tests/unitree/test_webrtc_queue.py: Tests ROSCommandQueue via a 20 back-to-back WebRTC requests to the robot
tests/test_planning_agent_web_interface.py: Tests a simple two-stage PlanningAgent chained to an ExecutionAgent with backend FastAPI interface.
tests/test_unitree_agent_queries_fastapi.py: Tests a zero-shot ExecutionAgent with backend FastAPI interface.

Documentation

For detailed documentation, please visit our documentation site (Coming Soon).

Contributing

We welcome contributions! See our Bounty List for open requests for contributions. If you would like to suggest a feature or sponsor a bounty, open an issue.

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Acknowledgments

Huge thanks to!

The Roboverse Community and their unitree-specific help. Check out their Discord.
@abizovnuralem for his work on the Unitree Go2 ROS2 SDK we integrate with for DimOS.
@legion1581 for his work on the Unitree Go2 WebRTC Connect from which we've pulled the Go2WebRTCConnection class and other types for seamless WebRTC-only integration with DimOS.
@tfoldi for the webrtc_req integration via Unitree Go2 ROS2 SDK, which allows for seamless usage of Unitree WebRTC control primitives with DimOS.

Contact

GitHub Issues: For bug reports and feature requests
Email: build@dimensionalOS.com

Known Issues

Agent() failure to execute Nav2 action primitives (move, reverse, spinLeft, spinRight) is almost always due to the internal ROS2 collision avoidance, which will sometimes incorrectly display obstacles or be overly sensitive. Look for [behavior_server]: Collision Ahead - Exiting DriveOnHeading in the ROS logs. Reccomend restarting ROS2 or moving robot from objects to resolve.
docker-compose up --build does not fully initialize the ROS2 environment due to std::bad_alloc errors. This will occur during continuous docker development if the docker-compose down is not run consistently before rebuilding and/or you are on a machine with less RAM, as ROS is very memory intensive. Reccomend running to clear your docker cache/images/containers with docker system prune and rebuild.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
dimOS.egg-info		dimOS.egg-info
dimos		dimos
dist		dist
docker		docker
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.style.yapf		.style.yapf
LICENSE		LICENSE
README.md		README.md
default.env		default.env
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

License

alexlin2/dimos-unitree

Folders and files

Latest commit

History

Repository files navigation

The Dimensional Framework

What is Dimensional?

DIMOS x Unitree Go2

Features

Docker Quick Start 🚀

Prerequisites

Configuration:

Run docker compose

Python Quick Start 🐍

Prerequisites

Python Installation (Ubuntu 22.04)

ROS2 Unitree Go2 SDK Installation

System Requirements

Run the test application

ROS2 Terminal:

Python Terminal:

DimOS Interface:

Project Structure

Building

Simple DimOS Application

DimOS Application with Agent chaining

Calling Action Primitives

Creating Custom Skills (non-unitree specific)

Create basic custom skills by inheriting from AbstractRobotSkill and implementing the __call__ method.

Chain together skills to create recursive skill trees

Unitree Test Files

Documentation

Contributing

License

Acknowledgments

Contact

Known Issues

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Create basic custom skills by inheriting from `AbstractRobotSkill` and implementing the `call` method.

Packages