"topic:mab" — Search

This project implements famous MAB algorithms and evaluates them on the basis of their performance - EpsilonGreedy, UCB, BetaThompson, LinUCB, LinThompson.

Jupyter Notebook41Updated 5 years ago

algorithmsevaluationgridsearchmabmulti-armed-banditspython3

vmarchaud/ts-mab

Typescript implementation of a multi-armed bandit

TypeScript30Updated 5 years ago

mabthompson-samplingtypescript

tuhinsharma121/pybandit-archiveArchived

A Python library for all popular multi-armed bandit algorithms.

Jupyter Notebook20Updated 2 years ago

maboptimization-algorithms

ReinerJasin/Multi-Armed-Bandit

Implementation of the Multi-Armed Bandit where each arm returns continuous numerical rewards. Covers Epsilon-Greedy, UCB1, and Thompson Sampling with detailed explanations.

Jupyter Notebook20Updated 10 months ago

contextual-banditsepsilon-greedylinearucblinucbmabmultiarmed-banditsthompson-samplingucbupper-confidence-bounds

pm3310/mab-covid19

Multi-Armed-Bandit solutions on AWS to deliver Covid-19 test kits efficiently and effectively

Jupyter Notebook21Updated 5 years ago

awscoronaviruscovid-19mabmulti-armed-banditspythonsagemaker

rrayy-25809/MAB_music_recommend

멀티 암드 밴딧 기반 음악 장르 추천 프로그램

Python10Updated 9 months ago

mabseleniumsklearnstreamlit

VladMarianCimpeanu/OLA_project

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

Jupyter Notebook13Updated 3 years ago

mabmontecarlo-simulationmulti-armed-banditonline-learning-applicationspricingreinforcement-learningthompson-samplingucb1

aijunbai/bandit

Algorithms for multi-armed bandit (MAB) problems

C++10Updated 10 years ago

mab

jiseongHAN/reinforcement

My Little Reinforcement Learning

Python10Updated 4 years ago

ddqndqnmabppo-pytorchpytorchreinforcereinforcement-learning

avorozhtsov/shipit

Exploitation vs Exploration problem stated as A/B-testing with maximum profit per unit time.

Mathematica00Updated 2 years ago

ab-testingcontinuous-testingexploration-exploitationmabpeaking

Bachfischer/COMP90051-StatML-Assignment-2

Source code for Assignment 2 of COMP90051 (Semester 2 2020)

Jupyter Notebook00Updated 5 years ago

mabmulti-armed-banditucb

Skelf-Research/compere

Intelligent pairwise comparisons. Better rankings with fewer votes.

Python00Updated 1 month ago

algorithmsmabpython

JoelJa835/MAB_Algorithms

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

Python00Updated 2 years ago

banditse-greedymabreinforcement-learning-algorithmsucb

sshaplygin/as-cache

Adaptive selection cache with mab

Go00Updated 3 weeks ago

2q-cacheadaptive-cachearc-cachegolanglfu-cachelfuda-cachelru-cachemabmab-cachestatistics