Repository navigation

#

mcts

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6820
2 天前

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook
4217
8 个月前

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python
3504
1 年前

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1422
6 天前

Latest Advances on System-2 Reasoning

Python
1223
2 个月前

Easily train AlphaZero-like agents on any environment you want!

Python
431
2 年前
Python
280
7 年前

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

Python
266
15 天前

Visualization of MCTS algorithm applied to Tic-tac-toe.

JavaScript
250
4 年前

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Jupyter Notebook
218
2 年前

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

Python
210
6 个月前

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Jupyter Notebook
160
4 年前

The decision-making of multiple vehicles at intersection bases on level-k game and MCTS

C++
135
7 个月前