Repository navigation

#

policy-gradient

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook
11011
23 天前
thu-ml/tianshou
Python
8412
1 个月前
kengz/SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Python
1277
2 个月前

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python
1188
4 年前

Deep Reinforcement Learning For Sequence to Sequence Models

Python
767
2 年前

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

Jupyter Notebook
753
6 年前

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

Python
646
7 年前

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

Python
542
5 年前

Structural implementation of RL key algorithms

Python
512
2 年前

DEEp Reinforcement learning framework

Python
484
1 年前