Repository navigation

#

continuous-control

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Python
3817
3 年前

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python
1422
6 天前
C++
838
3 个月前
Jupyter Notebook
707
3 年前
Python
447
7 年前

PyTorch Implementation of REINFORCE for both discrete & continuous control

Python
267
8 年前
Python
249
7 年前

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

Python
88
2 年前

PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)

Python
55
3 年前

Catalyst.RL: A Distributed Framework for Reproducible RL Research

Python
39
6 年前

A workbench for online model-free Reinforcement Learning on continuous control problems

C++
37
2 年前

Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.

Python
30
5 年前

Proximal Policy Optimization (Continuous Version) in PyTorch.

Python
29
3 个月前

PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method

Jupyter Notebook
28
5 年前

Neural Ordinary Differential Equations for Reinforcement Learning

Python
24
2 年前