Repository navigation

#

actor-critic

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python
7991
3 个月前

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

Jupyter Notebook
3875
2 年前

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Python
3835
3 年前

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Python
1299
6 年前

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Python
1195
4 年前

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Jupyter Notebook
1078
4 年前

Deep Reinforcement Learning For Sequence to Sequence Models

Python
769
3 年前

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Jupyter Notebook
766
7 年前

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

Python
684
5 年前

Simple A3C implementation with pytorch + multiprocessing

Python
654
3 年前

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python
623
2 年前

Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.

Python
561
5 年前