Repository navigation

#

muzero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1342
3 天前

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

Python
311
2 年前

A structured implementation of MuZero

Python
205
3 年前

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Jupyter Notebook
157
4 年前

datasets for computer go

C++
152
10 个月前

A simple implementation of MuZero algorithm for connect4 game

Jupyter Notebook
97
5 年前

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

Python
64
1 年前

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

Python
41
3 年前

A PyTorch implementation of DeepMind's MuZero agent

Python
33
1 年前

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

Python
28
2 年前

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

Python
27
2 年前

Generalized AI to perform a multitude of tasks written in python3

Jupyter Notebook
21
1 年前

Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.

Python
17
2 年前

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

Python
16
6 个月前

Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)

Jupyter Notebook
16
1 年前
Python
12
2 个月前