Repository navigation

#

exploration-exploitation

推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction

1007
1 年前

Python implementations of contextual bandits algorithms

Python
771
3 个月前

Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).

Python
464
2 年前

This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.

Python
66
6 年前

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

Python
29
1 年前

Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL

Jupyter Notebook
25
3 年前

Personalized and Interactive Music Recommendation with Bandit approach

Jupyter Notebook
10
6 年前

Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits

Python
10
4 年前

The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025

Python
7
6 个月前

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

Jupyter Notebook
7
22 天前

Official implementation of LECO (NeurIPS'22)

Python
7
2 年前

The official code release for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo, ICLR 2024.

Python
6
1 年前

Deep Intrinsically Motivated Exploration in Continuous Control

Python
5
1 年前

A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB

Python
5
3 年前

The official code release for "More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling", Reinforcement Learning Conference (RLC) 2024

Python
4
10 个月前