"topic:multi-armed-bandit" — Search

150 results for “topic:multi-armed-bandit”

mpatacchiola/dissecting-reinforcement-learning

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

actor-criticdeep-reinforcement-learningdissecting-reinforcement-learningdrone-landinggenetic-algorithminverted-pendulummarkov-chainmountain-carmulti-armed-banditneural-networksq-learningreinforcement-learningsarsatemporal-differencing-learning

SMPyBandits/SMPyBandits

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

Jupyter Notebook42061Updated 1 year ago

bandit-algorithmscognitive-radiointernet-of-thingslearning-theorymulti-arm-banditsmulti-armed-banditopen-sourcepythonresearchsimulations

OnYuKang/Recommendation-systems-paperlist

Papers about recommendation systems that I am interested in

36576Updated 6 years ago

collaborative-filteringdeep-learningexplainable-recommendationsmulti-armed-banditrecommendationrecommender-systemsession-based-recommendation-systemsocial-networksurvey

MLBazaar/BTB

A simple, extensible library for developing AutoML systems

Python17540Updated 2 years ago

automlgaussian-processeshyperparameter-optimizationmulti-armed-bandit

taoensso/touchstone

Simple A/B testing library for Clojure

Clojure1405Updated 2 years ago

clojureengagement-testingeplmulti-armed-banditsplit-testingtaoensso

alison-carrera/mabalgs

:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:

Python13525Updated 3 years ago

algorithmarmcontextual-banditsmabmonte-carlomontecarlo-simulationmulti-armed-banditrankranked-mabranking-algorithmreinforcement-learningreinforcement-learning-algorithmsrewardsimulationthompson-samplingucb

Unity-Technologies/BanditDungeon

Demo project using multi-armed bandit algorithm

C#10021Updated 6 years ago

multi-armed-banditunityunity3d

Nth-iteration-labs/streamingbandit

Python application to setup and run streaming (contextual) bandit experiments.

Python8418Updated 6 months ago

banditcmabcontextualmabmulti-armedmulti-armed-banditonlinesequentialstreaming

roycoding/slots

A multi-armed bandit library for Python

Python8123Updated 6 years ago

multi-armed-banditpython

Nth-iteration-labs/contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

R8025Updated 5 years ago

banditbandit-experimentsbandit-learningcmabcontextualcontextual-bandit-policiescontextual-banditscranevaluationexploitationexplorationmachine-learningmulti-armedmulti-armed-banditmulti-armed-banditsoffline-banditreinforcementreinforcement-learningsimulationstatistics

stitchfix/mab

Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.

Go668Updated 2 weeks ago

data-scienceexperimentationgogolangmulti-armed-banditmulti-armed-banditsmultiarmed-banditsreinforcement-learningthompsonthompson-sampling

PlaytikaOSS/pybandits

Python library for Multi-Armed Bandits

Python523Updated 3 days ago

bayesian-neural-networkscontextual-bandit-algorithmscontextual-banditsmulti-armed-banditmulti-armed-banditsmultiarmed-banditsoffline-policy-evaluationreinforcement-learningstochastic-banditstochastic-bandit-algorithmsthompson-sampling

ardaegeunlu/Contextual-Gaussian-Process-Bandit-Optimization

Simple implementation of the CGP-UCB algorithm.

Python385Updated 6 years ago

gaussian-processesmachine-learningmachine-learning-algorithmsmulti-armed-banditreinforcement-learningreinforcement-learning-algorithms

google/MABArchived

R package for Multi-Armed Bandit Simulation Study

R3811Updated 8 years ago

multi-armed-bandit

gdmarmerola/advanced-bandit-problems

More about the exploration-exploitation tradeoff with harder bandits

Jupyter Notebook2415Updated 6 years ago

bandit-algorithmsmachine-learningmulti-armed-bandit

antoine-hochart/bandit_algo_evaluation

Offline evaluation of multi-armed bandit algorithms

Python234Updated 5 years ago

epsilon-greedymulti-armed-banditpolicy-evaluationthompson-samplingupper-confidence-bound

jacksonpradolima/coleman4hcs

COLEMAN (Combinatorial VOlatiLE Multi-Armed BANdit) - and strategies for HCS context

Jupyter Notebook2310Updated just now

cicolemancontinuous-integrationhcshighly-configurable-systemmabmulti-armed-bandittcptcpcitest-case-prioritization

improve-ai/python-ranker

Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions

Python221Updated 2 years ago

ab-testingaicontextual-banditsimprove-aimachine-learningmulti-armed-banditmultivariate-testingpersonalizationpythonrecommender-systemreinforcement-learningxgboost

nathanwispinski/meta-rl

A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.

Jupyter Notebook180Updated 3 years ago

a2ca3cdeep-learninghaikujaxmulti-armed-banditneural-networkpythonreinforcement-learningrnn

ZIYU-DEEP/Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

A curated list on papers about combinatorial multi-armed bandit problems.

180Updated 4 years ago

bandit-algorithmscombinatorial-banditcombinatorial-optimizationmulti-armed-banditthompson-sampling

idanmoradarthas/MutiArmedBandit-DeepLearning

Multi-armed bandit algorithm with tensorflow and 11 policies

Python161Updated 3 years ago

deep-reinforcement-learningepsilonmulti-armed-banditpython3softmaxtensorflowucb

xin-pu/DeepSharp

secondary development by torchsharp for Deep Learning and Reinforcement Learning

C#151Updated 2 years ago

actor-criticdeep-learningdqnmulti-armed-banditqlearningreinforcement-learningtorch

ir-uam/EnsembleBandits

Software for the experiments reported in the RecSys 2019 paper "Multi-Armed Recommender System Bandit Ensembles"

Java145Updated 6 years ago

ensemblemulti-armed-banditrecommender-system

singhsidhukuldeep/contextual-bandits

A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications

Python131Updated 1 year ago

algorithmsbandit-algorithmscontextual-banditsepsilon-greedylinucbmachine-learningmulti-armed-banditpythonreinforcement-learning

improve-ai/swift-ranker

Easily Score & Rank Codable Objects with ML

Swift130Updated 2 years ago

ab-testingaicontextual-banditsimprove-aiiosmachine-learningmulti-armed-banditmultivariate-testingobjective-cpersonalizationrecommender-systemreinforcement-learningswiftxgboost

ishank-juneja/Correlated-AoI-Bandits

Author's implementation of the paper Correlated Age-of-Information Bandits.

Python131Updated 4 years ago

age-of-informationaoiaoi-regretcorrelated-armscorrelated-multi-armed-banditsmulti-armed-banditthompson-samplingucb

ardaegeunlu/X-armed-Bandits

Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.

Python111Updated 7 years ago

machine-learning-algorithmsmulti-armed-banditmulti-armed-banditsreinforcement-learningreinforcement-learning-algorithms

DURUII/Replica-AUCB

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

Python110Updated 2 years ago

aucbautionbandit-algorithmsbanditscmabmabmulti-armed-bandit

RicardoMoya/Reinforcemente_Learning_with_Python

En este proyecto de GitHhub podrás encontrar parte del material que utilizo para impartir las clases del módulo introductorio de Reinforcement Learning (Aprendizaje por Refuerzo)

Jupyter Notebook107Updated 3 years ago

aprendizaje-por-refuerzomulti-armed-banditq-learningreinforcement-learningsarsa-learning

KaleabTessera/Multi-Armed-Bandit

Implementation of greedy, E-greedy and Upper Confidence Bound (UCB) algorithm on the Multi-Armed-Bandit problem.

Python94Updated 3 years ago

epsilon-greedygreedymulti-armed-banditreinforcement-learningupper-confidence-bounds

Page 1 of 5