Repository navigation

#

continuous-control

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Python
3732
3 年前

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1340
2 天前
Jupyter Notebook
673
2 年前
Python
441
7 年前

PyTorch Implementation of REINFORCE for both discrete & continuous control

Python
265
8 年前
Python
250
6 年前

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Python
84
1 年前

PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)

Python
55
2 年前

Catalyst.RL: A Distributed Framework for Reproducible RL Research

Python
39
6 年前

A workbench for online model-free Reinforcement Learning on continuous control problems

C++
37
2 年前

PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method

Jupyter Notebook
29
4 年前

Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.

Python
28
4 年前

Proximal Policy Optimization (Continuous Version) in PyTorch.

Python
27
4 年前

Neural Ordinary Differential Equations for Reinforcement Learning

Python
23
2 年前