MSC:  68T05
MSC:  68Txx
### An SDN routing algorithm based on deep reinforcement learning. (Chinese. English summary)Zbl 1488.68001

MSC:  68M10 68T07
### Points gained in football: using Markov process-based value functions to assess team performance. (English)Zbl 1472.90151

MSC:  90C40 90C90
MSC:  60K05
### Reward of reinforcement learning of test optimization for continuous integration. (Chinese. English summary)Zbl 1438.68031

MSC:  68N99 68T05
### Resource allocation problems with concave reward functions. (English)Zbl 1410.91311

MSC:  91B32 91A12
### Reliability analysis of an aging unit with a controllable repair facility activation. (English)Zbl 1397.62591

Pilz, Jürgen (ed.) et al., Statistics and simulation. Contributions given at the 8th international workshop on simulation, IWS 8, Vienna, Austria, September 21–25, 2015. Cham: Springer (ISBN 978-3-319-76034-6/hbk; 978-3-319-76035-3/ebook). Springer Proceedings in Mathematics & Statistics 231, 403-417 (2018).
MSC:  62N05 60J28
### On the asymptotic behaviour of the covariance function of the rewards of a multivariate renewal-reward process. (English)Zbl 1377.60080

MSC:  60K05 60F05
### On the rationality of some crisp choice functions based on strongly complete fuzzy pre-orders. (English)Zbl 1376.91050

MSC:  91B06 91B08
### Reward processes and performance optimization in asymmetric supermarket models. (Chinese. English summary)Zbl 1349.60121

MSC:  60J20 60K25 60K30
### Another set of verifiable conditions for average Markov decision processes with Borel spaces. (English)Zbl 1340.90255

MSC:  90C40 93E20
MSC:  90C40
### Renewal processes. (English)Zbl 1300.60004

SpringerBriefs in Statistics. Cham: Springer (ISBN 978-3-319-05854-2/pbk; 978-3-319-05855-9/ebook). viii, 122 p. (2014).
### An output feedback reinforcement learning control method based on a reference model. (Chinese. English summary)Zbl 1289.93071

MSC:  93C40 93B52

### On optimal stopping problems for matrix-exponential jump-diffusion processes. (English)Zbl 1252.60039

MSC:  60G40 60J75 60G51
### Optimal stopping problem in a model with compensated refusal of reward. (English. Russian original)Zbl 1229.60052

Math. Notes 89, No. 2, 238-244 (2011); translation from Mat. Zametki 89, No. 2, 241-248 (2011).
MSC:  60G40 91G80
### A numerical method for the expected penalty-reward function in a Markov-modulated jump-diffusion process. (English)Zbl 1218.91075

MSC:  91B30 60J70 60K10
### Reactive self-rescue control for autonomous mobile robot based on reinforcement learning. (Chinese. English summary)Zbl 1212.93233

MSC:  93C85 68T40 68T05

### Reward distributions associated with some block tridiagonal transition matrices with applications to identity by descent. (English)Zbl 1168.60008

MSC:  60E10 92D10 60K15
### An optimal stopping problem for a random walk with polynomial reward functions. (Ukrainian. English summary)Zbl 1199.60145

MSC:  60G40 60G50

### Reward functions and cooperative games: characterization and economic application. (English)Zbl 1185.91038

MSC:  91A12 91A43 91D30
MSC:  68T05
### Optimal time to invest under tax exemptions. (English)Zbl 1103.60044

Kabanov, Yuri (ed.) et al., From stochastic calculus to mathematical finance. The Shiryaev Festschrift. Allmost all papers based on the presentation at the second Bachelier colloquium on stochastic calculus and probability, Meatbief, France, January 9–15, 2005. Berlin: Springer (ISBN 3-540-30782-6/hbk). 17-32 (2006).
MSC:  60G40 91B76

### Perceptive evaluation for the optimal discounted reward in Markov decision processes. (English)Zbl 1121.68425

Torra, Vicenç (ed.) et al., Modeling decisions for artificial intelligence. Second international conference, MDAI 2005, Tsukuba, Japan, July 25–27, 2005. Proceedings. Berlin: Springer (ISBN 3-540-27871-0/pbk). Lecture Notes in Computer Science 3558. Lecture Notes in Artificial Intelligence, 283-293 (2005).
MSC:  68T37

### Distributions of reward functions on continuous-time Markov chains. (English)Zbl 1015.60064

Latouche, Guy (ed.) et al., Matrix-analytic methods. Theory and applications. Proceedings of the 4th international conference, Adelaide, Australia, July 14-16, 2002. Singapore: World Scientific. 39-62 (2002).
MSC:  60J27

MSC:  91A15
### Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces. (English)Zbl 1038.90087

MSC:  90C39 91B30 91B16
### Markov fuzzy criterion decision models. (English)Zbl 0970.90111

MSC:  90C40 03E72

### Optimal admission control for $$M/D/1/K$$ queueing systems. (English)Zbl 0972.90017

MSC:  90B22 90B15 60K25
MSC:  90C70
MSC:  60J20
MSC:  90C40
### Minimizing some cost functions related to both burn-in and field use. (English)Zbl 0864.90053

MSC:  90B25 62P30 90B30
### Fuzzy decision processes with an average reward criterion. (English)Zbl 0965.90500

MSC:  90C70 90C40 90B50

### Fuzzy decision processes with an average reward criterion. (English)Zbl 0965.97001

MSC:  90C70 90C40 90B50

### Discrete-time Markov-reward models of production systems. (English)Zbl 0837.90062

Kumar, P. R. (ed.) et al., Discrete event systems, manufacturing systems, and communication networks. Based on the proceedings of a workshop that was an integral part of the 1992-93 IMA program on control theory, held at the University of Minnesota, Minneapolis, MN, USA. New York, NY: Springer-Verlag. IMA Vol. Math. Appl. 73, 149-175 (1995).

MSC:  90C39
### Possibilities of solution in stochastic decision models with recursive reward functions. (English)Zbl 0717.93065

MSC:  93E20 90C39
### On the optimal reward function of the continuous time multiarmed bandit problem. (English)Zbl 0714.90096

Reviewer: J.L.Menaldi
MSC:  90C40 60J25 93E20 35B37 90C39
### The high contact principle in optimal stopping and stochastic waves. (English)Zbl 0687.60045

Stochastic processes, Semin., San Diego/CA (USA) 1989, Prog. Probab. 18, 177-192 (1990).
MSC:  60G40 60J45

### Control of a diffusion process in a region with fixed reflection on the boundary. (English)Zbl 0753.93080

Statistics and control of stochastic processes. Vol. 2, Pap. Steklov Semin., Moscow/USSR 1985-86, Transl. Ser. Math. Eng., 1-15 (1989).
MSC:  93E20

### The optimal value of Markov stopping problems with one-step look ahead policy. (English)Zbl 0658.60071

Reviewer: T.Bojdecki
MSC:  60G40
### Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains. (English)Zbl 0645.90099

Reviewer: J.Preater
MSC:  90C40
### Continuous dependence of stochastic control models on the noise distribution. (English)Zbl 0639.93068

Reviewer: Sv.Gaidov
### Markov decision programming with reward function depending on time. (Chinese. English summary)Zbl 0662.90087

J., Huazhong (Cent. China) Univ. Sci. Technol. 15, No. 1, 115-122 (1987).
MSC:  90C40

### On the existence of the optimal stopping moment in the optimal stopping problem of the Markov chain with discounting. (Russian. English summary)Zbl 0649.60053

Reviewer: T.Bojdecki
MSC:  60G40 60J10

### A two-armed bandit problem with one arm known including switching costs and terminal rewards. (English)Zbl 0617.62083

Reviewer: R.Theodorescu

MSC:  90C40
### Matrix inequality in distributional sense. (English)Zbl 0631.60044

Reviewer: R.A.Horn
MSC:  60G15 60E15 60H05

### Optimal control in the neighbourhood of an optimal equilibrium with examples from fisheries models. (English)Zbl 0612.92012

MSC:  92D25 90C39 49L20

### Parameter imprecision in finite state, finite action dynamic programs. (English)Zbl 0605.90129

Reviewer: K.-H.Waldmann
MSC:  90C40 90C39
### Optimality equations and sensitive optimality in bounded Markov decision processes. (English)Zbl 0587.90099

Reviewer: A.Nowak
MSC:  90C40 90C39
### General stochastic games. (English)Zbl 0598.90101

Probability theory, Proc. 7th Conf., Braşov/Rom. 1982, 643-647 (1984).
Reviewer: Y.Ohtsubo
MSC:  91A15 91A10 91A60

### Negative dynamic programming. (English)Zbl 0531.90094

Operations research, Proc. 12th Annu. Meet., Mannheim 1983, 475-478 (1984).
MSC:  90C39

### On the convergence of costs in the case of approximation of the continuous Kalman-Bucy scheme by discrete schemes. (Russian. English summary)Zbl 0574.60054

Tr. Tbilis. Univ. 239, Mat. Mekh. Astron. 15, 65-76 (1983).
MSC:  60G35 93E11

### Extreme-point solutions in Markov decision processes. (English)Zbl 0544.90098

Reviewer: G.Hübner
MSC:  90C40
### Markov decision problems with countable state spaces. Optimality criteria - algorithms - clustering. (English)Zbl 0543.90078

Mathematical Research, 15. Berlin: Akademie-Verlag. 174 p. DDR M 22.00 (1983).
Reviewer: M.Schäl
MSC:  90C40 90-02

### A method of maximizing probabilities in sequential problems. (Polish)Zbl 0524.60046

MSC:  60G40 62L15 60J10

### The average-optimal adaptive control of a Markov renewal model in presence of an unknown parameter. (English)Zbl 0518.90092

MSC:  90C40 60K20
### On semi-Markov controlled models with an average reward criterion. (English)Zbl 0499.60094

MSC:  60K15 90C40
### On the semi-Markov controlled models with the average reward criterion. (Russian)Zbl 0478.60091

MSC:  60K15 90C40

MSC:  90C39
### Stochastic dynamic programming. Successive approximations and nearly optimal strategies for Markov Decision Processes and Markov Games. (English)Zbl 0443.90055

Proefschrift, Technische Hogeschool Eindhoven. Amsterdam: Mathematisch Centrum. XI, 253 p. (1980).
MSC:  90C40 91A05 90C39

### Semi-Markov decision processes with countable state space and compact action space. (English)Zbl 0396.62068

MSC:  62M99 90C40

