Repository navigation

#

gym

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python
11683
8 天前
Farama-Foundation/Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python
10301
5 天前

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python
7990
3 个月前

🏋 Modern open-source fitness coaching platform. Create workout plans, track progress, and access a comprehensive exercise database.

TypeScript
6541
13 天前

Self hosted FLOSS fitness/workout, nutrition and weight tracker

Python
5191
1 天前
Farama-Foundation/PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python
3133
1 个月前
DLR-RM/rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python
2568
1 个月前
Farama-Foundation/Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Python
2332
1 个月前
utiasDSL/gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

Python
1647
5 小时前

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1440
2 天前

Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.

Python
1276
25 天前

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python
1260
2 年前
sail-sg/envpool
C++
1196
1 年前
araffin/rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

Python
1184
3 年前

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python
1166
2 天前

Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros

Python
1098
1 年前

Source codes for the book "Reinforcement Learning: Theory and Python Implementation"

HTML
985
2 个月前