#

bandit-algorithms

Here are 89 public repositories matching this topic...

SMPyBandits

SMPyBandits / SMPyBandits

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

python open-source research internet-of-things simulations multi-arm-bandits multi-armed-bandit learning-theory bandit-algorithms cognitive-radio

Updated Apr 30, 2024
Jupyter Notebook

c-bata / goptuna

A hyperparameter optimization framework, inspired by Optuna.

bayesian-optimization evolution-strategies blackbox-optimization bandit-algorithms

Updated Aug 30, 2024
Go

WilliamLwj / PyXAB

PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms

data-science machine-learning algorithm reinforcement-learning optimization machine-learning-algorithms hyperparameter-optimization hyperparameter-tuning optimization-algorithms online-learning automl blackbox-optimization bandit-algorithms lipschitz-bandit x-armed-bandit continuous-armed-bandit

Updated Oct 24, 2024
Python

KKeishiro / Yahoo_recommendation

Yahoo! news article recommendation system by linUCB

recommendation-system contextual-bandit bandit-algorithms linucb

Updated Feb 1, 2018
Python

gdmarmerola / interactive-intro-rl

Big Data's open seminars: An Interactive Introduction to Reinforcement Learning

machine-learning reinforcement-learning bandit-algorithms

Updated Jun 7, 2021
Jupyter Notebook

sshkhr / Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

reinforcement-learning tensorflow deep-reinforcement-learning pytorch policy-gradient evolutionary-algorithms markov-decision-processes td-learning monte-carlo-sampling bandit-algorithms

Updated Dec 22, 2021
Jupyter Notebook

Alanthink / banditpylib

A lightweight python library for bandit algorithms

bandit-algorithms

Updated Jul 21, 2022
Python

niffler92 / Bandit

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

Updated Oct 12, 2017
Python

kulinshah98 / Multi-Armed-Bandit-Algorithms

Python implementation of UCB, EXP3 and Epsilon greedy algorithms

epsilon-greedy multi-armed-bandits upper-confidence-bounds bandit-algorithms stochastic-bandit-algorithms adversarial-bandit-algorithms exp3-algorithm

Updated Oct 4, 2018
Python

doerlbh / MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

paper speaker-recognition online-learning speaker-diarization contextual-bandits bandit-algorithms interspeech self-supervised-learning acml interspeech2020 online-speaker-diarization

Updated Sep 20, 2021
Cuda

gdmarmerola / advanced-bandit-problems

More about the exploration-exploitation tradeoff with harder bandits

machine-learning multi-armed-bandit bandit-algorithms

Updated May 12, 2019
Jupyter Notebook

mmalekzadeh / privacy-preserving-bandits

Privacy-Preserving Bandits (MLSys'20)

machine-learning reinforcement-learning recommender-system recommendation bandit-learning differential-privacy contextual-bandits bandit-algorithm federated-learning bandit-algorithms privacy-preserving-machine-learning online-machine-learning criteo-dataset differentially-private privacy-preserving-bandits

Updated Dec 8, 2022
Jupyter Notebook

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

A curated list on papers about combinatorial multi-armed bandit problems.

thompson-sampling multi-armed-bandit combinatorial-optimization bandit-algorithms combinatorial-bandit

Updated May 10, 2021

sparsh-ai / reco-bandit

Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning

recommender-system contextual-bandits bandit-algorithms

Updated Jul 1, 2021
Jupyter Notebook

rssalessio / reading-list

This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.

learning machine-learning statistics reinforcement-learning deep-learning optimization reading-list bandit-algorithms

Updated Apr 3, 2025

DURUII / Replica-AUCB

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

singhsidhukuldeep / contextual-bandits

A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications

python machine-learning reinforcement-learning algorithms epsilon-greedy multi-armed-bandit contextual-bandits bandit-algorithms linucb

Updated Dec 31, 2024
Python

gokceuludogan / interactive-music-recommendation

Personalized and Interactive Music Recommendation with Bandit approach

music-recommendation bandit-algorithms exploration-exploitation bayes-ucb

Updated Sep 15, 2019
Jupyter Notebook

MaxenceGiraud / MachineLearningAlgos

Personal reimplementation of some ML algorithms for learning purposes

Updated Jul 13, 2021
Python

ngutowski / algossim

This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.

recommendation-system artificial-intelligence-algorithms contextual-bandits bandit-algorithms

Updated Dec 7, 2021
Python

Improve this page

Add a description, image, and links to the bandit-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandit-algorithms topic, visit your repo's landing page and select "manage topics."

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Alternative Proxies:

Alternative Proxy