Repository navigation

#

pytorch-rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python
1239
5 年前

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

Python
669
7 年前
Python
586
5 年前

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Python
340
2 年前
Jupyter Notebook
285
5 年前

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Python
236
7 年前

A high-performance Atari A3C agent in 180 lines of PyTorch

Python
171
4 年前

A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.

Python
166
6 年前

Pytorch solutions for UC Berkeley's cs285 assignments

Python
140
4 年前

A repository for implementation of deep reinforcement learning lectured at Samsung

Python
108
4 年前

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

Jupyter Notebook
100
6 年前

Pytorch starter code for UC Berkeley's cs285 assignments

Python
72
4 年前

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Python
65
6 年前