×

Markov renewal decision processes with finite horizon. (English) Zbl 0447.90087


MSC:

90C40 Markov and semi-Markov decision processes
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Feller W (1971) An introduction to probability theory and its applications, vol. 2, 2nd ed. J Wiley, New York · Zbl 0219.60003
[2] Hinderer K (1971) Instationäre dynamische Optimierung bei schwachen Voraussetzungen über die Gewinnfunktionen. Abh Math Sem Univ Hamburg 36:208–223 · Zbl 0246.49026 · doi:10.1007/BF02995923
[3] Hinderer K (1970) Foundations of non-stationary dynamic programming with discrete time parameter. Springer, Berlin Heidelberg New York · Zbl 0202.18401
[4] Hinderer K (1978) On approximate solutions of finite-stage dynamic programs. In: Puterman ML (ed) Dynamic Programming and its Applications, Proc. Internat. Conference on Dynamic Programming, Vancouver 1977. Academic Press, New York, pp 289–317 · Zbl 0394.90100
[5] Hinderer K, Hübner G (1977) On exact and approximate solutions of unstructured finite-stage dynamic programs. Proc. Advanced Seminar on Markov Decision Theory, Amsterdam 1976. Math Centre Tracts 93:57–76
[6] Jewell WS (1963) Markov-renewal programming. I: Formulation, finite return models. II: Infinite return models, example. Oper Res 11:938–971 · Zbl 0126.15905 · doi:10.1287/opre.11.6.938
[7] Lembersky MR (1974) On maximal rewards and{\(\epsilon\)}-optimal policies in continuous time Markov decision chains. Ann Statist 2:159–169 · Zbl 0272.90083 · doi:10.1214/aos/1176342621
[8] Lembersky MR (1974) Preferred rules in continuous time Markov decision processes. Manage Sci 21:348–357 · Zbl 0321.93063 · doi:10.1287/mnsc.21.3.348
[9] Porteus E (1975) Bounds and transformations for discounted finite Markov decision chains. Oper Res 23:761–784 · Zbl 0322.90073 · doi:10.1287/opre.23.4.761
[10] Rieder U (1976) On dynamic programming with unbounded reward functions. Report, Inst. f. Math. Stochastik, University of Hamburg · Zbl 0353.90091
[11] Schäl M (1975) Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal. Z Wahrsch Verw. Gebiete 32:179–196 · Zbl 0316.90080 · doi:10.1007/BF00532612
[12] Schellhaas H (1974) Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung. Z Oper Res 18:91–104 · Zbl 0288.90085 · doi:10.1007/BF01949684
[13] Schellhaas H (1979) Über Semi-Markoffsche Entscheidungsprozesse mit endlichem Horizont. Proc. in Operat. Res. Vol. 8. Gaede KW et al. (eds) Physica-Verlag, Würzburg Wien, pp 122–129 · Zbl 0432.90079
[14] Stidham S On the convergence of successive approximations in dynamic programming with non-zero terminal reward. NCSU Technical report No. 78-9 · Zbl 0454.90087
[15] Waldman K-H (1978) A natural extension of the MacQueen extrapolation. Preprint Nr. 436, Fachbereich Mathematik, Technische Hochschule Darmstadt
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.