Apex_rl

A reinforcement learning library focused on pragmatic, extensible training loops.

Documentation: https://apex-rl-doc.readthedocs.io/

Installation

Clone and install from source:

git clone https://github.com/Atticlmr/Apex_rl.git
cd Apex_rl
pip install -e .

or use uv

git clone https://github.com/Atticlmr/Apex_rl.git
cd Apex_rl
uv pip install -e .

Status

Algorithm	Status	Notes
PPO	✅ Available	On-policy runner, continuous + discrete actions
DQN	✅ Available	Replay buffer, OffPolicyRunner, Double DQN, Dueling DQN
SAC	🚧 Planned	Next Next release

Development Plan

Near-term roadmap:

SAC for continuous-control off-policy training
Temporal neural network training support for partially observable tasks
Recurrent and sequence-model policies such as LSTM, GRU, and Transformer-based agents
Sequence-aware training pipelines for rollout collection, hidden-state handling, and truncated backpropagation through time
Benchmarks for temporal models on POMDP-style control tasks

Quick Start

PPO

import gymnasium as gym

from apexrl.agent.on_policy_runner import OnPolicyRunner
from apexrl.envs.gym_wrapper import GymVecEnv
from apexrl.models import MLPCritic, MLPDiscreteActor

env = GymVecEnv([lambda: gym.make("CartPole-v1") for _ in range(4)], device="cpu")

runner = OnPolicyRunner(
    env=env,
    algorithm="ppo",
    actor_class=MLPDiscreteActor,
    critic_class=MLPCritic,
)
runner.learn(total_timesteps=20_000)

DQN / Dueling DQN

import gymnasium as gym
import torch

from apexrl.agent.off_policy_runner import OffPolicyRunner
from apexrl.algorithms.dqn import DQNConfig
from apexrl.envs.gym_wrapper import GymVecEnv
from apexrl.models import MLPQNetwork

env = GymVecEnv([lambda: gym.make("CartPole-v1") for _ in range(4)], device="cpu")

cfg = DQNConfig(
    double_dqn=True,
    dueling=True,
    learning_starts=1_000,
)

runner = OffPolicyRunner(
    env=env,
    cfg=cfg,
    q_network_class=MLPQNetwork,
    device=torch.device("cpu"),
)
runner.learn(total_timesteps=50_000)

Smoke Benchmarks

Run the lightweight benchmark suite with:

/Users/air/workspace/abc/bin/python benchmarks/run_smoke_benchmarks.py --iterations 1 --num-envs 1

Current smoke tasks:

CartPole-v1 with PPO
CartPole-v1 with DQN
CartPole-v1 with Dueling DQN
Acrobot-v1 with DQN
Acrobot-v1 with Dueling DQN
Pendulum-v1 with PPO
MountainCarContinuous-v0 with PPO

License

Apache-2.0

Citation

If you use this library in your research, please cite:

@software{li2025apexrl,
  author = {Li, Mingrui},
  title = {Apex\_rl: A Reinforcement Learning Library},
  url = {https://github.com/Atticlmr/Apex_rl},
  year = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github		.github
benchmarks		benchmarks
docs @ 4541a04		docs @ 4541a04
src/apexrl		src/apexrl
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE-HEADER.txt		LICENSE-HEADER.txt
README.md		README.md
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apex_rl

Installation

Status

Development Plan

Quick Start

PPO

DQN / Dueling DQN

Smoke Benchmarks

License

Citation

About

Licenses found

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Apex_rl

Installation

Status

Development Plan

Quick Start

PPO

DQN / Dueling DQN

Smoke Benchmarks

License

Citation

About

Resources

License

Licenses found

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages