Repository navigation

muzero

Website
Wikipedia

MuZero

muzero reinforcement-learning alphazero PyTorch Python self-learning monte-carlo-tree-search 深度学习 deep-reinforcement-learning 神经网络 rl tensorboard gym mcts alphago 机器学习

Python

2617

639

8 个月前

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

alphazero atari continuous-control monte-carlo-tree-search muzero PyTorch reinforcement-learning mcts board-game gym self-play

Python

1342

151

3 天前

huawei-noah / xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

impala dqn ppo muzero reinforcement-learning-algorithms

Python

311

2 年前

johan-gras / MuZero

A structured implementation of MuZero

muzero world-models reinforcement-learning Tensorflow

Python

205

3 年前

kaesve / muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

muzero alphazero reinforcement-learning Tensorflow tensorflow2 mcts tf2 深度学习 deep-reinforcement-learning

Jupyter Notebook

157

4 年前

yenw / computer-go-dataset

datasets for computer go

Go alphago alphazero muzero

C++

152

10 个月前

Zeta36 / muzero

A simple implementation of MuZero algorithm for connect4 game

muzero Python PyTorch deepmind Jupyter Notebook

Jupyter Notebook

5 年前

rlglab / minizero

MiniZero: An AlphaZero and MuZero Training Framework

alphazero deep-reinforcement-learning mcts muzero monte-carlo-tree-search atari Go hex reinforcement-learning

C++

2 个月前

DHDev0 / Stochastic-muzero

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

机器学习 offline-reinforcement-learning deep-reinforcement-learning gym-environments lstm monte-carlo-tree-search muzero PyTorch rl transformer multilayer-perceptron

Python

1 年前

Hwhitetooth / jax_muzero

An implementation of MuZero in JAX.

reinforcement-learning 深度学习 deep-reinforcement-learning model-based-reinforcement-learning muzero jax

Python

2 年前

hr0nix / omega

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

jax model-based-reinforcement-learning muzero reinforcement-learning flax mcts

Python

3 年前

tuero / muzero-cpp

A C++ pytorch implementation of MuZero

C++PyTorch 机器学习 reinforcement-learning mcts alphazero muzero libtorch

C++

1 年前

michaelnny / muzero

A PyTorch implementation of DeepMind's MuZero agent

alphazero muzero PyTorch reinforcement-learning

Python

1 年前

sail-sg / rosmo

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

atari muzero offline-reinforcement-learning reinforcement-learning jax model-based-reinforcement-learning

Python

2 年前

DHDev0 / Muzero-unplugged

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

深度学习 deep-reinforcement-learning gym lstm 机器学习神经网络 Python PyTorch reinforcement-learning transformer arxiv gym-environments monte-carlo-tree-search muzero rl

Python

2 年前

bellerb / chappie.ai

Generalized AI to perform a multitude of tasks written in python3

机器学习人工智能 muzero mcts chess-ai PyTorch attention-mechanism transformer Python

Jupyter Notebook

1 年前

DHDev0 / Muzero

Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.

arxiv 深度学习 deep-reinforcement-learning 机器学习 monte-carlo-tree-search muzero 神经网络 Python PyTorch reinforcement-learning rl gym gym-environments lstm transformer

Python

2 年前