Repository navigation

#

muzero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1441
2 天前

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

Python
312
2 年前

A structured implementation of MuZero

Python
205
3 年前

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Jupyter Notebook
162
5 年前

datasets for computer go

C++
156
1 年前
C++
104
2 个月前

A simple implementation of MuZero algorithm for connect4 game

Jupyter Notebook
96
5 年前

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

Python
72
2 年前

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

Python
41
3 年前

A PyTorch implementation of DeepMind's MuZero agent

Python
36
2 年前

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

Python
32
3 个月前

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

Python
30
2 年前

[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm

C++
21
5 个月前

Generalized AI to perform a multitude of tasks written in python3

Jupyter Notebook
21
2 年前

Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.

Python
18
3 年前

Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)

Jupyter Notebook
17
2 年前

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

Python
16
1 年前