Repository navigation

#

contextual-bandits

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

C++
8596
10 个月前

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Python
2949
2 个月前

Python implementations of contextual bandits algorithms

Python
797
2 个月前

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Python
675
1 年前

[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

Python
252
1 年前

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

Python
187
6 年前

🐈‍⬛ Contextual bandits library for continuous action trees with smoothing in JAX

Python
68
3 年前

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

Python
66
4 年前

implement basic and contextual MAB algorithms for recommendation system

Jupyter Notebook
42
4 年前

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

Python
40
7 年前

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

Python
39
7 年前

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

Java
30
3 年前

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

Cuda
28
4 年前

Study of the paper 'Neural Thompson Sampling' published in October 2020

Jupyter Notebook
23
3 年前

Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

Jupyter Notebook
14
8 年前