O’Sullivan, Michael J.; Saunders, Michael A. Stabilizing policy improvement for large-scale infinite-horizon dynamic programming. (English) Zbl 1191.49028 SIAM J. Matrix Anal. Appl. 31, No. 2, 434-459 (2009). MSC: 49L20 15A06 65F50 93E03 90C70 × Cite Format Result Cite Review PDF Full Text: DOI
Yushkevich, A. A. Sensitive criteria in the continuous-time two-armed bandit problem. (English. Russian original) Zbl 0757.62010 Theory Probab. Appl. 35, No. 4, 819-824 (1991). MSC: 62C10 93E35 90C40 60J05 × Cite Format Result Cite Review PDF Full Text: DOI
Yushkevich, A. A. Sensitive criteria in the two-armed bandit problem with continuous time. (Russian) Zbl 0729.62007 Teor. Veroyatn. Primen. 35, No. 4, 793-797 (1990). Reviewer: A.A.Pervozvanskij (St.Petersburg) MSC: 62C10 93E35 90C40 60J05 × Cite Format Result Cite Review PDF