Repository navigation
continuous-control
- Website
- Wikipedia
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
The Fastest Deep Reinforcement Learning Library
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
PyTorch implementation of Soft Actor-Critic (SAC)
PyTorch implementation of Trust Region Policy Optimization
PyTorch Implementation of REINFORCE for both discrete & continuous control
Code for the paper "Evolved Policy Gradients"
End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo
Tensorflow implementation of generative adversarial imitation learning
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
Implement A3C for Mujoco gym envs
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
Catalyst.RL: A Distributed Framework for Reproducible RL Research
A workbench for online model-free Reinforcement Learning on continuous control problems
PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method
Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.
Proximal Policy Optimization (Continuous Version) in PyTorch.
Neural Ordinary Differential Equations for Reinforcement Learning