Repository navigation

#

offline-reinforcement-learning

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python
1244
2 年前
Jupyter Notebook
707
3 年前

An elegant PyTorch offline reinforcement learning library for researchers.

Python
354
1 个月前

Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Python
278
3 年前

A collection of offline reinforcement learning algorithms.

Python
195
9 个月前

Clean single-file implementation of offline RL algorithms in JAX

Python
150
8 个月前

A simple and easy-to-use autonomous driving environment for reinforcement learning, based on the CARLA simulator.

Python
145
3 个月前

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Python
140
1 年前

Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets

Python
124
9 个月前

Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

Python
109
2 年前

[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"

Python
108
6 个月前

Unified Implementations of Offline Reinforcement Learning Algorithms

Python
89
4 个月前

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Python
75
3 年前

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

Python
69
2 年前

[ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilitate efficient and flexible skill expansion and composition, iteratively evolve the agents' capabilities and efficiently address new challenges

Python
61
6 个月前

[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"

Python
57
2 年前

Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC

Jupyter Notebook
55
2 年前