Repository navigation

#

ddpg-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Python
1270
2 年前

Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

Python
145
9 天前

A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing

Python
133
9 个月前

强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行

Python
76
1 年前
Python
67
1 天前

The simulation of paper: Joint Cooperation Clustering and Content Caching in Cell-Free Massive MIMO Networks

Python
37
3 年前

An implementation of DDPG using PyTorch for algorithmic trading on Chinese SH50 stock market.

Python
29
5 年前

La combinación más inteligente de Deep Q-Learning, Políticas de Gradiente, Actor-Crítico y DDPG utilizando PyTorch

Jupyter Notebook
13
5 年前

PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation tasks.

Python
11
3 年前

Common deep reinforcement learning algorithms implemented using PyTorch, including DQN、DDPG、DDQN、PPO、MADDPG.

Python
11
1 个月前

PyTorch application of reinforcement learning algorithm in OpenAI LunarLander - DDPG

Python
5
3 年前

Twin Delayed Deep Deterministic Policy Gradient Algorithm On PybulletAnt agent.

Jupyter Notebook
3
4 年前

Deep Deterministic Policy Gradients in Pytorch with Simulation in PyBullet

Python
3
3 年前

This repository contains an implementation of Deep Deterministic Policy Gradient (DDPG), a reinforcement learning algorithm designed for environments with continuous action spaces. It features actor-critic architecture, experience replay, and exploration strategies, and is tested on environments like MountainCarContinuous. More info on Medium blog!

Jupyter Notebook
3
3 个月前

PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball

Python
2
3 年前