Repository navigation

#

mcts

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6684
5 天前

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook
4106
4 个月前

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python
3460
1 年前

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1342
3 天前

Latest Advances on System-2 Reasoning

Python
940
3 天前

Easily train AlphaZero-like agents on any environment you want!

Python
429
1 年前
Python
280
7 年前

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

Python
267
4 天前

Visualization of MCTS algorithm applied to Tic-tac-toe.

JavaScript
234
4 年前

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Jupyter Notebook
211
2 年前

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

Python
203
2 个月前

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Jupyter Notebook
157
4 年前

Reinforcement learning models in ViZDoom environment

Python
133
3 年前