O’Sullivan, Michael J.; Saunders, Michael A. Stabilizing policy improvement for large-scale infinite-horizon dynamic programming. (English) Zbl 1191.49028 SIAM J. Matrix Anal. Appl. 31, No. 2, 434-459 (2009). MSC: 49L20 15A06 65F50 93E03 90C70 PDF BibTeX XML Cite \textit{M. J. O'Sullivan} and \textit{M. A. Saunders}, SIAM J. Matrix Anal. Appl. 31, No. 2, 434--459 (2009; Zbl 1191.49028) Full Text: DOI OpenURL
Yushkevich, A. A. Sensitive criteria in the continuous-time two-armed bandit problem. (English. Russian original) Zbl 0757.62010 Theory Probab. Appl. 35, No. 4, 819-824 (1991). MSC: 62C10 93E35 90C40 60J05 PDF BibTeX XML Cite \textit{A. A. Yushkevich}, Theory Probab. Appl. 35, No. 4, 819--824 (1990; Zbl 0757.62010) Full Text: DOI OpenURL
Yushkevich, A. A. Sensitive criteria in the two-armed bandit problem with continuous time. (Russian) Zbl 0729.62007 Teor. Veroyatn. Primen. 35, No. 4, 793-797 (1990). Reviewer: A.A.Pervozvanskij (St.Petersburg) MSC: 62C10 93E35 90C40 60J05 PDF BibTeX XML Cite \textit{A. A. Yushkevich}, Teor. Veroyatn. Primen. 35, No. 4, 793--797 (1990; Zbl 0729.62007) OpenURL