Repository navigation

advantage-actor-critic

Website
Wikipedia

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

wandb reinforcement-learning PyTorch Python gym 机器学习 deep-reinforcement-learning 深度学习 atari ale a2c proximal-policy-optimization ppo advantage-actor-critic actor-critic phasic-policy-gradient

Python

6845

743

12 天前

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

PyTorch reinforcement-learning 深度学习 deep-reinforcement-learning actor-critic advantage-actor-critic a2c ppo proximal-policy-optimization hessian atari mujoco roboschool continuous-control ale

Python

3733

835

3 年前

qfettes / DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

Python PyTorch reinforcement-learning deep-reinforcement-learning deep-q-network double-dqn dueling-dqn rainbow actor-critic advantage-actor-critic a2c ppo

Jupyter Notebook

1067

329

4 年前

Kismuz / btgym

Scalable, event-driven, deep-learning-friendly backtesting library

reinforcement-learning deep-reinforcement-learning gym-environment openai-gym backtesting-trading-strategies algorithmic-trading-library time-series a3c Tensorflow unreal advantage-actor-critic policy-gradient statistical-arbitrage Hacktoberfest

Python

996

259

4 年前

cpnota / autonomous-learning-library

A PyTorch library for building deep reinforcement learning agents.

reinforcement-learning reinforcement-learning-algorithms deep-reinforcement-learning soft-actor-critic proximal-policy-optimization deep-q-learning advantage-actor-critic deep-deterministic-policy-gradient sac a2c ddpg ppo dqn dqn-pytorch

Python

652

1 年前

ChenglongChen / pytorch-DRL

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

PyTorch deep-reinforcement-learning multi-agent deep-q-network actor-critic advantage-actor-critic a2c proximal-policy-optimization ppo deep-deterministic-policy-gradient ddpg rl dqn reinforcement-learning

Python

569

106

7 年前

Omegastick / pytorch-cpp-rl

PyTorch C++ Reinforcement Learning

PyTorch C++reinforcement-learning reinforcement-learning-algorithms a2c ppo pytorch-rl pytorch-cpp-frontend libtorch actor-critic advantage-actor-critic proximal-policy-optimization continuous-control

C++

522

5 年前

PacktPublishing / Hands-On-Intelligent-Agents-with-OpenAI-Gym

Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch

deep-reinforcement-learning openai-gym carla-simulator dqn PyTorch advantage-actor-critic actor-critic

Python

380

151

2 年前

bentrevett / pytorch-rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

PyTorch pytorch-tutorial pytorch-implmention pytorch-implementation reinforcement-learning reinforcement-learning-algorithms rl pytorch-tutorials pytorch-rl policy-gradient actor-critic a2c advantage-actor-critic

Jupyter Notebook

279

4 年前

inoryy / tensorflow2-deep-reinforcement-learning

Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"

Tensorflow tensorflow2 Keras deep-reinforcement-learning advantage-actor-critic a2c

Jupyter Notebook

207

4 年前

lcswillems / torch-ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

PyTorch reinforcement-learning actor-critic deep-reinforcement-learning multi-process a2c a3c ppo advantage-actor-critic proximal-policy-optimization recurrent-neural-networks

Python

198

3 年前

CherryPieSexy / imitation_learning

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

reinforcement-learning ppo imitation-learning PyTorch a2c 深度学习 deep-reinforcement-learning proximal-policy-optimization advantage-actor-critic policy-gradient

Python

145

3 年前

jcwleo / curiosity-driven-exploration-pytorch

Curiosity-driven Exploration by Self-supervised Prediction

icm reinforcement-learning PyTorch advantage-actor-critic proximal-policy-optimization

Python

137

2 年前

Urinx / ReinforcementLearning

Reinforcing Your Learning of Reinforcement Learning

reinforcement-learning alphago-zero mcts q-learning policy-gradient doom tic-tac-toe space-invaders ppo advantage-actor-critic dqn alphago ddpg

Python

6 年前

rpatrik96 / pytorch-a2c

A well-documented A2C written in PyTorch

Python PyTorch pytorch-implementation pytorch-tutorial a2c actor-critic advantage-actor-critic 深度学习深度神经网络 deep-reinforcement-learning reinforcement-learning reinforcement-learning-algorithms stable-baselines baselines openai-gym

Python

6 年前

med-air / DEX

[ICRA 2023] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot

advantage-actor-critic ddpg pytorch-rl reinforcement-learning sac

Python

2 年前

popovicidaniela / Master-Thesis

Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex

reinforcement-learning a3c actor-critic reinforcement-learning-algorithms Tensorflow autonomous-driving deep-reinforcement-learning 深度学习深度神经网络 tensorboard multi-threading multithreading Python NumPy asynchronous advantage-actor-critic LaTeX

TeX

7 年前

dionhaefner / yahtzotron

The friendly robot that beats you in Yahtzee 🤖 🎲

reinforcement-learning advantage-actor-critic jax

Python

4 个月前

monoelh / deep-reinforcement-learning_DDQN_PPO_HER

MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.

openai-gym NumPy game deep-reinforcement-learning deep-q-network ppo advantage-actor-critic

Jupyter Notebook

7 年前

Po-Hsun-Su / dprl

Deep reinforcement learning package for torch7

deep-reinforcement-learning advantage-actor-critic dqn

Lua

9 年前