Repository navigation

#

contextual-bandits

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

C++
8552
6 个月前

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Python
2895
1 个月前

Python implementations of contextual bandits algorithms

Python
771
3 个月前

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Python
664
1 年前

[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

Python
231
7 个月前

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

Python
186
5 年前

🐈‍⬛ Contextual bandits library for continuous action trees with smoothing in JAX

Python
66
3 年前

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

Python
66
4 年前

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

Python
40
6 年前

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

Python
39
7 年前

implement basic and contextual MAB algorithms for recommendation system

Jupyter Notebook
36
3 年前

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

Java
29
2 年前

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

Cuda
27
4 年前

Study of the paper 'Neural Thompson Sampling' published in October 2020

Jupyter Notebook
21
3 年前

Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

Jupyter Notebook
14
8 年前