Repository navigation

#

rl

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python
14546
13 天前

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook
10710
5 个月前
thu-ml/tianshou
Python
8412
1 个月前

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python
3459
1 年前

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++
3389
6 年前
DLR-RM/rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python
2376
15 天前

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Python
2345
2 年前

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

Python
1429
2 年前
araffin/rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Python
1168
3 年前
Python
1125
12 天前

Latest Advances on System-2 Reasoning

Python
939
3 天前

Understanding R1-Zero-Like Training: A Critical Perspective

Python
863
5 天前

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook
825
8 个月前

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Python
818
2 年前

Implementation of all RL algorithms in a simpler way

Jupyter Notebook
730
10 天前

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

Python
725
14 小时前