Skip to content
View lupuandr's full-sized avatar
  • FLAIR, University of Oxford / FAIR team at Meta AI
  • Oxford, UK

Highlights

  • Pro

Organizations

@fairinternal

Block or report lupuandr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Target-UCB Target-UCB Public

    Simple implementation of the Target-UCB algorithm.

    Python 2

  2. luchris429/purejaxrl luchris429/purejaxrl Public

    Really Fast End-to-End Jax RL Implementations

    Python 1k 81

  3. FLAIROx/JaxMARL FLAIROx/JaxMARL Public

    Multi-Agent Reinforcement Learning with JAX

    Python 727 134

  4. montrealrobotics/DeepRLInTheWorld montrealrobotics/DeepRLInTheWorld Public

    From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

    280 29

  5. facebookresearch/off-belief-learning facebookresearch/off-belief-learning Public archive

    Implementation of the Off Belief Learning algorithm.

    Python 49 8

  6. FLAIROx/behaviour-distillation FLAIROx/behaviour-distillation Public

    Code for Behaviour Distillation (ICML 2024)

    Python 6 1