Repository navigation

actor-critic

Website
Wikipedia

MorvanZhou / Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

reinforcement-learning 教程 q-learning sarsa sarsa-lambda deep-q-network a3c ddpg policy-gradient dqn double-dqn dueling-dqn deep-deterministic-policy-gradient actor-critic Tensorflow proximal-policy-optimization ppo 机器学习

Python

9322

5020

2 年前

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandb reinforcement-learning PyTorch Python gym 机器学习 deep-reinforcement-learning 深度学习 atari ale a2c proximal-policy-optimization ppo advantage-actor-critic actor-critic phasic-policy-gradient

Python

7991

859

3 个月前

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

policy-gradient PyTorch actor-critic-algorithm alphago deep-reinforcement-learning a2c dqn sarsa ppo a3c resnet 算法深度学习 reinforce actor-critic sac td3

Python

4460

893

3 年前

simoninithomas / Deep_reinforcement_learning_Course

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

deep-reinforcement-learning 深度学习 Tensorflow ppo a2c actor-critic deep-q-network deep-q-learning PyTorch Unity

Jupyter Notebook

3875

1223

2 年前

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

PyTorch reinforcement-learning 深度学习 deep-reinforcement-learning actor-critic advantage-actor-critic a2c ppo proximal-policy-optimization hessian atari mujoco roboschool continuous-control ale

Python

3835

841

3 年前

rlcode / reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

reinforcement-learning 深度学习 deep-reinforcement-learning 机器学习 policy-gradient deep-q-network dqn actor-critic a3c

Python

3585

739

3 年前

ikostrikov / pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

Python reinforcement-learning PyTorch 深度学习 actor-critic a3c deep-reinforcement-learning

Python

1299

280

6 年前

chainer / chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

chainer reinforcement-learning 深度学习机器学习 Python dqn actor-critic

Python

1195

224

4 年前

qfettes / DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Python PyTorch reinforcement-learning deep-reinforcement-learning deep-q-network double-dqn dueling-dqn rainbow actor-critic advantage-actor-critic a2c ppo

Jupyter Notebook

1078

327

4 年前

jingweiz / pytorch-rl

Deep Reinforcement Learning with pytorch & visdom

dqn a3c PyTorch visdom deep-reinforcement-learning reinforcement-learning 深度学习 actor-critic acer

Python

802

144

5 年前

yaserkl / RLSeq2Seq

Deep Reinforcement Learning For Sequence to Sequence Models

reinforcement-learning actor-critic policy-gradient 自然语言处理

Python

769

163

3 年前

omerbsezer / Reinforcement_learning_tutorial_with_demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

reinforcement-learning 教程机器学习 q-learning sarsa policy-gradient deep-reinforcement-learning imitation-learning meta-learning actor-critic pomdps dynamic-programming a3c

Jupyter Notebook

766

177

7 年前

TianhongDai / reinforcement-learning-algorithms

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

deep-reinforcement-learning ddpg ppo proximal-policy-optimization 深度学习 actor-critic 算法 dqn flappy-bird a2c atari2600 dueling-dqn PyTorch soft-actor-critic sac

Python

684

109

5 年前

MorvanZhou / pytorch-A3C

Simple A3C implementation with pytorch + multiprocessing

PyTorch a3c gym 神经网络 multiprocessing actor-critic

Python

654

144

3 年前