Modifications on UniZero based on the LightZero implementation

LightZero is a lightweight, efficient, and easy-to-understand open-source toolkit combining Monte Carlo Tree Search (MCTS) and deep Reinforcement Learning (RL).
This repository includes our UniZero implementation and the research on local, adaptive, and Gaussian attention mechanisms for model-based RL in our paper: Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning

Updated: 2025-04-09 LightZero-v0.2.0

⚙️ Installation

git clone https://github.com/daniallegue/learning-to-focus.git
cd LightZero
pip install -e .

Note: Linux and macOS are officially supported. Windows support is coming soon.

🚀 Running UniZero

Train a UniZero agent on Atari Pong:

    python -m zoo.atari.config.atari_unizero_vanilla_config \
      --env PongNoFrameskip-v4 \
      --seed 1

Train a Local UniZero with window size 7 agent on Atari Pong:

    python -m zoo.atari.config.atari_unizero_local_config \
      --env PongNoFrameskip-v4 \
      --seed 1 \
      --local_window_size 7

Train a Gaussian UniZero on Atari Pong (Params set on the config)

    python -m zoo.atari.config.atari_unizero_gaussian_config \
      --env PongNoFrameskip-v4 \
      --seed 1

Train an Adaptive UniZero with initial span 6 on Atari Pong

    python -m zoo.atari.config.atari_unizero_gaussian_config \
      --env PongNoFrameskip-v4 \
      --seed 1 \
      --init_adaptive_span 6

For other UniZero configs, see:

zoo/atari/config/atari_unizero_vanilla_config.py
zoo/atari/config/atari_unizero_adaptive_config.py
zoo/atari/config/atari_unizero_gaussian_config.py
zoo/atari/config/atari_unizero_local_config.py

🔬 Attention Mechanisms Research

We implement and evaluate three attention variants within the UniZero world model:

Local Attention (fixed window)
Adaptive Span (soft, learnable window via ramp mask)
Gaussian Adaptive Attention (soft bell-shaped mask with learnable center μₕ and width σₕ)

Code organization:

Transformer & Attention lzero/model/unizero_world_models/modeling/ (self-attention implementations, span parametrizations, Gaussian masks)
UniZero Policy lzero/policy/unizero.py
Analysis & Plotting analysis/ (learning-curve plots, attention-map visualizations, ablation scripts)
Utilities lzero/model/unizero_world_models/visualize_utils.py lzero/model/unizero_world_models/attention_maps.py (frame-sequence and reward/value/policy visualizers)

📄 Citation

To cite our paper, please use:

@inproceedings{allegue2025learning,
    title={Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning},
    author={Daniel De Dios Allegue and Jinke He and Frans A Oliehoek},
    booktitle={NeurIPS 2025 Workshop on Embodied World Models for Decision Making},
    year={2025},
}

If you use UniZero or its attention research, please cite:

@article{pu2024unizero,
  title={UniZero: Generalized and Efficient Planning with Scalable Latent World Models},
  author={Pu, Yuan and Niu, Yazhe and Ren, Jiyuan and Yang, Zhenjie and Li, Hongsheng and Liu, Yu},
  journal={arXiv preprint arXiv:2406.10667},
  year={2024}
}

🏷️ License

This project is licensed under the Apache License 2.0.

(Back to top)

```

Name		Name	Last commit message	Last commit date
Latest commit History 246 Commits
.github/workflows		.github/workflows
analysis		analysis
assets		assets
docs		docs
lzero		lzero
zoo		zoo
.coveragerc		.coveragerc
.gitignore		.gitignore
.gitmodules		.gitmodules
.gitpod.Dockerfile		.gitpod.Dockerfile
.gitpod.yml		.gitpod.yml
.nojekyll		.nojekyll
.readthedocs.yaml		.readthedocs.yaml
.style.yapf		.style.yapf
Apptainer.def		Apptainer.def
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
LightZero.png		LightZero.png
Makefile		Makefile
README.md		README.md
README.zh.md		README.zh.md
cloc.sh		cloc.sh
format.sh		format.sh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-build.txt		requirements-build.txt
requirements-doc.txt		requirements-doc.txt
requirements-test.txt		requirements-test.txt
requirements.lock.txt		requirements.lock.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modifications on UniZero based on the LightZero implementation

⚙️ Installation

🚀 Running UniZero

🔬 Attention Mechanisms Research

📄 Citation

🏷️ License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Modifications on UniZero based on the LightZero implementation

⚙️ Installation

🚀 Running UniZero

🔬 Attention Mechanisms Research

📄 Citation

🏷️ License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages