Repository navigation

#

advantage-actor-critic

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python
6845
12 天前

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Python
3733
3 年前

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Jupyter Notebook
1067
4 年前

Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch

Python
380
2 年前
Jupyter Notebook
279
4 年前

Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"

Jupyter Notebook
207
4 年前
Python
198
3 年前

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

Python
145
3 年前

[ICRA 2023] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot

Python
37
2 年前

Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex

TeX
37
7 年前

The friendly robot that beats you in Yahtzee 🤖 🎲

Python
21
4 个月前

MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.

Jupyter Notebook
20
7 年前

Deep reinforcement learning package for torch7

Lua
16
9 年前