Repository navigation

contextual-bandits

Website
Wikipedia

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

C++机器学习 online-learning contextual-bandits reinforcement-learning active-learning learning-to-search

C++

8609

1934

1 年前

tensorflow / agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

reinforcement-learning Tensorflow contextual-bandits dqn

Python

2955

738

4 个月前

david-cortes / contextualbandits

Python implementations of contextual bandits algorithms

contextual-bandits reinforcement-learning exploration-exploitation

Python

804

149

4 个月前

st-tech / zr-obp

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

datasets contextual-bandits research

Python

682

1 年前

fidelity / mabwiser

[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

contextual-bandits 机器学习 recsys

Python

260

1 年前

alison-carrera / onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

神经网络 neural-architecture-search pytorch-implementation 机器学习 contextual-bandits reinforcement-learning-algorithms reinforcement-learning PyTorch

Python

188

6 年前

alison-carrera / mabalgs

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm Simulation ucb 算法 ranking-algorithm rank contextual-bandits reinforcement-learning reinforcement-learning-algorithms

Python

133

3 年前

Nth-iteration-labs / contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

bandit Simulation 统计 contextual-bandits bandit-learning reinforcement-learning exploitation exploration evaluation 机器学习

5 年前

instadeepai / catx

🐈‍⬛ Contextual bandits library for continuous action trees with smoothing in JAX

contextual-bandits jax 深度学习 Python

Python

3 年前

banditml / banditml

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

contextual-bandits PyTorch personalization neural-networks reinforcement-learning

Python

4 年前

Heewon-Hailey / multi-armed-bandits-for-recommendation-systems

implement basic and contextual MAB algorithms for recommendation system

Python scikit-learn NumPy matplotlib recommendation-system contextual-bandits

Jupyter Notebook

4 年前

lil-lab / blocks

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

自然语言处理机器学习 natural-language-understanding reinforcement-learning contextual-bandits

Python

7 年前

pemami4911 / sinkhorn-policy-gradient.pytorch

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

深度学习 combinatorial-optimization reinforcement-learning contextual-bandits

Python

7 年前

thunfischtoast / LinUCB

Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire

Java bandit-learning contextual-bandits

Java

3 年前

doerlbh / MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

speaker-diarization Bukkit speaker-recognition online-learning contextual-bandits self-supervised-learning

Cuda

4 年前

RonyAbecidan / Neural-Thompson-Sampling

Study of the paper 'Neural Thompson Sampling' published in October 2020

神经网络 contextual-bandits

Jupyter Notebook

3 年前

mmalekzadeh / privacy-preserving-bandits

Privacy-Preserving Bandits (MLSys'20)

differential-privacy 机器学习 online-machine-learning reinforcement-learning contextual-bandits privacy-preserving-machine-learning federated-learning recommender-system recommendation bandit-learning

Jupyter Notebook

3 年前

improve-ai / python-ranker

Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions

ab-testing 人工智能 contextual-bandits 机器学习 personalization Python recommender-system xgboost reinforcement-learning multivariate-testing

Python

2 年前

thoughtworks / simplebandit

lightweight contextual bandit library for ts/js

contextual-bandits recommender personalization recommendation-system recommender-systems

TypeScript

2 年前

jtcho / FairMachineLearning

Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

机器学习 contextual-bandits Python Jupyter Notebook NumPy

Jupyter Notebook

8 年前