International Journal of Stochastic Analysis

International Journal of Stochastic Analysis / 1999 / Article

Open Access

Volume 12 |Article ID 924375 | 10 pages | https://doi.org/10.1155/S1048953399000155

Exact solution of the Bellman equation for a β-discounted reward in a two-armed bandit with switching arms

Received01 May 1998
Revised01 Oct 1998

Abstract

We consider the symmetric Poissonian two-armed bandit problem. For the case of switching arms, only one of which creates reward, we solve explicitly the Bellman equation for a β-discounted reward and prove that a myopic policy is optimal.

Copyright © 1999 Hindawi Publishing Corporation. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


More related articles

58 Views | 216 Downloads | 1 Citation
 PDF  Download Citation  Citation
 Order printed copiesOrder

Related articles

We are committed to sharing findings related to COVID-19 as quickly and safely as possible. Any author submitting a COVID-19 paper should notify us at help@hindawi.com to ensure their research is fast-tracked and made available on a preprint server as soon as possible. We will be providing unlimited waivers of publication charges for accepted articles related to COVID-19. Sign up here as a reviewer to help fast-track new submissions.