Use Beta Distribution and Thompson Sampling to Beat The Multi-armed Bandit at the Casino

Use Beta Distribution and Thompson Sampling to Beat The Multi-armed Bandit at the Casino

As a logical person at the casino. you want to put your money on the machine with the maximum expected return. This is the origin of the multi-armed bandit problem. We will cover the two most basic concept here: Beta distribution and Thompson sampling.

Beta Distribution