Discusses formulation of MAB problems and its application. There are several methods for solving multi arm bandits. Few of them are dicussed in this git. Contextual MAB problems are interesting and extension of the solutions are also discussed.
- Epsilon greedy
- Upper Confidence Bound
- Thompson sampling
- Exp3
- Exp4
- LinUCB