Repository navigation
rl
- Website
- Wikipedia
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
An elegant PyTorch deep reinforcement learning library.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Distributed RL System for LLM Reasoning
Implementation of papers in 100 lines of code.
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
Python library for Reinforcement Learning.