Research Article

Combining Multiple Strategies for Multiarmed Bandit Problems and Asymptotic Optimality

Figure 3

Percentage of selections of the best strategy of (tuned) -comb(2 PLAs) and of (tuned) Exp4/NEXP/EEEs for the distribution ().