Skip to content

Authors' implementation of the paper - "A New Approach to Correlated Multi Armed Bandits"

License

Notifications You must be signed in to change notification settings

ishank-juneja/UCUCB

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A New Approach to Correlated Multi Armed Bandits

Authors' implementation of the paper - "A New Approach to Correlated Multi Armed Bandits"

Link to paper

The paper above introduces the UCUCB (Also called U-CUCB, Uni-CUCB or Uniform CUCB) algorithm which is a distribution learning approach to minimizing regret for the Correlated Multi-Armed Bandit problem. The code also implements the CUCB (Also called UCB-C or Correlated UCB) algorithm first described in an early version of reference [2].

For background on the Correlated Multi-Armed bandit framework see references [1], [2]

These scripts simulate the vanilla UCB, CUCB and UCUCB policies on a specified correlated bandit instance and produce an expected cumulative regret vs. horizon T plot with separate trend lines corresponding to each of the policies. The code is designed to work for a general discrete random variable X as opposed to a parameterized continuos random variable.


A regret plot example for an instance where UCUCB outperformed CUCB

Usage

  1. Create a bandit instance .txt file defining the correlated bandit instance on which the policies are to be simulated

  2. Set the simulation parameters in the script

  3. Run script and save results in a .txt file

Details Coming soon

Related Work on Github

  1. Our implementation of other sequential decision making algorithms for Multi-Armed-Bandits implemented in this repository. Implemented algorithms

  2. Work on correlated Age-of-Information bandits repository

References

[1] I. Juneja, D. S. Gaharwar, D. Varshney and S. Moharir, "A New Approach to Correlated Multi Armed Bandits," 2020 International Conference on COMmunication Systems & NETworkS (COMSNETS), Bengaluru, India, 2020, pp. 634-637, doi: 10.1109/COMSNETS48256.2020.9027344.

[2] S. Gupta, G. Joshi and O. Yağan, "Correlated Multi-Armed Bandits with A Latent Random Source," ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pp. 3572-3576, doi: 10.1109/ICASSP40776.2020.9054429.

About

Authors' implementation of the paper - "A New Approach to Correlated Multi Armed Bandits"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages