Repository navigation

#

muzero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1422
6 天前

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

Python
311
2 年前

A structured implementation of MuZero

Python
205
3 年前

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Jupyter Notebook
160
4 年前

datasets for computer go

C++
153
1 年前

A simple implementation of MuZero algorithm for connect4 game

Jupyter Notebook
96
5 年前

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

Python
69
2 年前

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

Python
41
3 年前

A PyTorch implementation of DeepMind's MuZero agent

Python
35
2 年前

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

Python
31
2 个月前

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

Python
29
2 年前

[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm

C++
21
3 个月前

Generalized AI to perform a multitude of tasks written in python3

Jupyter Notebook
21
2 年前

Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.

Python
18
3 年前

Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)

Jupyter Notebook
17
1 年前

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

Python
16
10 个月前