[Enhancement] Add Snake Environment - Classic Single-Agent RL Game #202

AlirezaShamsoshoara · 2025-11-17T07:08:57Z

Add Snake Environment - Classic Single-Agent RL Game

Summary

This PR adds a new Snake Environment to OpenEnv, providing a classic snake game implementation based on marlenv's Snake-v1. The environment offers a clean, OpenEnv-compatible interface for reinforcement learning research and experimentation with a well-known game mechanic.

What's New

Core Environment (`src/envs/snake_env/`)

Client API (client.py): HTTP client for connecting to snake environment servers
Data Models (models.py): Type-safe action and observation models
- SnakeAction: Discrete actions (turn left/right or directional movement)
- SnakeObservation: Rich observations including grid state, score, and metadata
Server Implementation (server/):
- FastAPI-based HTTP server
- Wraps marlenv Snake-v1 with OpenEnv interface
- Configurable grid size, vision range, and reward functions

Example Code

examples/snake_simple.py: Comprehensive example demonstrating:
- Docker and local server modes
- Automated gameplay with random/simple policies
- Multi-episode training loops with statistics
- Real-time visualization with matplotlib
- Performance tracking and analysis

Infrastructure

Docker Support: Full containerization with automated builds
- Dockerfile for snake environment
- Updated CI/CD workflow for automated image builds
Documentation: Extensive README with:
- Quick start guide
- API reference
- Configuration options
- Troubleshooting tips

Key Features

Gameplay

Grid-based environment: Configurable grid size (default: 20×20)
Classic mechanics: Navigate, eat fruits, grow, avoid walls and self-collision
Two control modes:
- Snake mode: Relative actions (turn left/right)
- Human mode: Global directions (up/down/left/right)

Configurability

Adjustable grid dimensions
Optional partial observability (vision range)
Customizable reward function for different learning objectives
Configurable episode length

Observations

Each step provides:

Full grid state (2D array)
Encoded observation (supports vision range)
Episode statistics (score, steps, fruits collected)
Alive status

Deployment Options

Docker: One-command deployment with SnakeEnv.from_docker_image()
Local server: Direct server connection for development
Web interface: Optional browser-based interaction

Example Usage

from envs.snake_env import SnakeAction, SnakeEnv

# Start from Docker
client = SnakeEnv.from_docker_image("snake-env:latest")

# Reset and play
result = client.reset()
while not result.done:
    action = SnakeAction(action=0)  # Go straight
    result = client.step(action)
    print(f"Score: {result.observation.episode_score}")

client.close()

Files Changed

New Files

src/envs/snake_env/ - Complete environment implementation
- __init__.py, client.py, models.py
- server/app.py, server/snake_environment.py, server/Dockerfile
- README.md, pyproject.toml, openenv.yaml
examples/snake_simple.py - Interactive example with visualization (378 lines)

Modified Files

.github/workflows/docker-build.yml - Added snake-env to build matrix
.gitignore - Updated to exclude uv.lock files more broadly

Technical Details

Dependencies

marlenv>=1.0.0 - Core snake game implementation
gym==0.24.1 - Required by marlenv
numpy>=1.24.0 - Grid and array operations
Standard OpenEnv dependencies (FastAPI, Pydantic, Uvicorn)

Architecture

The implementation follows OpenEnv's standard pattern:

Environment server wraps marlenv Snake-v1
HTTP API exposes reset/step/state endpoints
Type-safe client provides Pythonic interface
Docker containerization for easy deployment

Reward Customization

Supports flexible reward shaping:

reward_dict = {
    'fruit': 1.0,      # Eating fruit
    'lose': -1.0,      # Death penalty
    'time': 0.0,       # Per-step reward/penalty
    'kill': 0.0,       # Multi-agent (unused in single-agent)
    'win': 0.0,        # Multi-agent (unused in single-agent)
}

Testing

The example script demonstrates:

Docker container startup and connection
Local server connection
Reset and step operations
State tracking across episodes
Reward collection and scoring
Multi-episode statistics gathering
Real-time visualization

To test:

# Docker mode
python examples/snake_simple.py --mode docker --play-mode auto

# Local server mode (terminal 1)
cd src/envs/snake_env && uv run --project . server

# Local server mode (terminal 2)
python examples/snake_simple.py --mode local --play-mode multi --episodes 10

Future Enhancements

Potential additions for future PRs:

Pre-trained baseline agents
Additional reward shaping examples
Multi-agent support (multiple snakes)
Advanced visualization options
Performance benchmarks

Credits

Based on marlenv by ML2 (KC-ML2)

Lines of Code: ~1,246 additions across 14 files
Documentation: Comprehensive README with examples and troubleshooting
CI/CD: Automated Docker builds integrated

github-actions · 2025-11-17T07:09:33Z

✅ Validation succeeded for snake_env

Your env passes the vibe check. However, most environments should go straight to the hub, they will automatically be added to the official Env Hub collection on a nightly basis. Environments in the official specification repo are only meant to demonstrate usage of a specific spec feature for educational purposes. Re-run locally with:

openenv validate --verbose src/envs/snake_env

[OK] snake: Ready for multi-mode deployment

Supported deployment modes:
  [YES] docker
  [YES] openenv_serve
  [YES] uv_run
  [YES] python_module

Usage examples:
  cd snake_env && uv run server
  cd snake_env && openenv build
  cd snake_env && openenv push

You can deploy the environment to Hugging Face Spaces by running openenv push.

Success run: https://github.com/meta-pytorch/OpenEnv/actions/runs/19521160596

AlirezaShamsoshoara · 2025-11-17T07:24:27Z

cc @init27 @HamidShojanazeri @Darktex (Since I cannot add a reviewer directly)

burtenshaw · 2025-11-17T10:19:35Z

Thanks for the great contribution @AlirezaShamsoshoara ! Can you deploy it to the hf hub with openenv push and drop a link on this PR.

… str

AlirezaShamsoshoara · 2025-11-17T22:33:24Z

Thank you @burtenshaw for taking a look at this! Just ran the openenv push from the snake_env directory and changed a little bit of files to make it compatible with the deployment and the web interface!
The environment is available here and seems running fine:

https://huggingface.co/spaces/Crashbandicoote2/snake_env

burtenshaw · 2025-11-19T10:53:22Z

@AlirezaShamsoshoara great. Could you update docs/environments.md and then I think we can merge it.

AlirezaShamsoshoara · 2025-11-20T00:48:06Z

@burtenshaw Thanks for the comment! Just rebased it and added the snake env to docs/environments.md file!
Please let me know if anything else left on this.

AlirezaShamsoshoara added 5 commits November 16, 2025 22:34

update the gitignore file to exclude broader range of uvlock

6f81d87

update the docker build yaml file

c5a705e

update the gitignore for uv.lock

357ed56

add the Snake Environment from marlenv snake

8b7a2fe

add a simple example for Snake env

103f445

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 17, 2025

add uv.lock

9568859

burtenshaw added the New Environment label Nov 17, 2025

AlirezaShamsoshoara added 4 commits November 17, 2025 14:25

update the models.py to only consider integer values and convert from…

f3df37b

… str

fix the uvicorn dependency and fix the server package

a6e5294

update the Dockerfile for the deployment

d08369d

update the README for colors and deployment and add Dockerfile backup

b00e411

burtenshaw and others added 13 commits November 19, 2025 16:26

fixes on the env to get working.

9fe411a

basic browser gym example

48e2b67

remove miniwob check

9239fc8

add custom port to browsergym app

39a1648

add incremental rewards

0452185

add help example for browsergym

eb1be54

update latest changes in grpo

6c9b2b6

improve guide on grpo setup

8cdf813

integrate miniwob tasks with server

6cfd6d3

update inference example based on environment changes

79a4f5e

remove excess fixes in core server

18de390

Delete grpo.py

6daebdc

Update feeback count func

66f0a93

burtenshaw and others added 29 commits November 19, 2025 16:34

add environment grid with links to hf

17df645

move docs yaml back to parent

a6c438e

fix gh action to match

c4816da

re-fix docs location

2abcce4

fix action

3f0bee4

pin docs dir in repo

6646c65

another fix on docs_dir

bb44ac8

fix autopython config for docs

23e5e76

update environments with existing envs

fcf1205

use mkdocs style tip blocks

0615672

fix warning on home page

a10d7d7

fix discord logo in button

e6caa1f

fix colab link

b61a9ec

update dockrfile and make expander

0b73e9d

prose in index

47b733e

fixes in quickstart

7f31137

deploy docs from main

9754780

added openenv hub files for browser-gym

ee8930b

Delete src/envs/browsergym_env/uv.lock

1fba777

Small nits in README

d040011

Reverse reward calculation

0391a5f

Add meta extension to prevent yaml from rendering in docs

7fb103a

Update environment-builder.md

de5a0b3

[HOTFIX] Build error in project toml

269e5d2

Add docs link to README

584a682

removing the additional space

a34488a

add the snake.md to the environments

790ab5e

add the snake env to the docs / environments for the spaces

7e030f2

Merge branch 'main' into alidev_snake_env_01

094818c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Enhancement] Add Snake Environment - Classic Single-Agent RL Game #202

[Enhancement] Add Snake Environment - Classic Single-Agent RL Game #202

Uh oh!

AlirezaShamsoshoara commented Nov 17, 2025

Uh oh!

github-actions bot commented Nov 17, 2025 •

edited

Loading

Uh oh!

AlirezaShamsoshoara commented Nov 17, 2025

Uh oh!

burtenshaw commented Nov 17, 2025

Uh oh!

AlirezaShamsoshoara commented Nov 17, 2025

Uh oh!

burtenshaw commented Nov 19, 2025

Uh oh!

AlirezaShamsoshoara commented Nov 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Enhancement] Add Snake Environment - Classic Single-Agent RL Game #202

Are you sure you want to change the base?

[Enhancement] Add Snake Environment - Classic Single-Agent RL Game #202

Uh oh!

Conversation

AlirezaShamsoshoara commented Nov 17, 2025

Add Snake Environment - Classic Single-Agent RL Game

Summary

What's New

Core Environment (src/envs/snake_env/)

Example Code

Infrastructure

Key Features

Gameplay

Configurability

Observations

Deployment Options

Example Usage

Files Changed

New Files

Modified Files

Technical Details

Dependencies

Architecture

Reward Customization

Testing

Future Enhancements

Credits

Uh oh!

github-actions bot commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlirezaShamsoshoara commented Nov 17, 2025

Uh oh!

burtenshaw commented Nov 17, 2025

Uh oh!

AlirezaShamsoshoara commented Nov 17, 2025

Uh oh!

burtenshaw commented Nov 19, 2025

Uh oh!

AlirezaShamsoshoara commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Core Environment (`src/envs/snake_env/`)

github-actions bot commented Nov 17, 2025 •

edited

Loading

AlirezaShamsoshoara commented Nov 20, 2025 •

edited

Loading