### Zero and non-zero sum risk-sensitive semi-Markov games. (English)Zbl 07648503

MSC:  90C40 91A15
### An application of approximate dynamic programming in multi-period multi-product advertising budgeting. (English)Zbl 07599092

MSC:  90C39 90C40
### Optimizing the first response to sepsis: an electronic health record-based Markov decision process model. (English)Zbl 07646914

MSC:  90B50 90C40 60G40
### Low-rank representation of reinforcement learning policies. (English)Zbl 07639806

MSC:  68T05 90C40
### Mean-semivariance policy optimization via risk-averse reinforcement learning. (English)Zbl 07639805

MSC:  68T05 68T20 90C40
### Some multidimensional stochastic models of inventory control with a separable cost function. (English. Ukrainian original)Zbl 07630518

Cybern. Syst. Anal. 58, No. 4, 523-529 (2022); translation from Kibern. Sist. Anal. 58, No. 4, 38-45 (2022).
MSC:  90C40
### Optimal admission and routing with congestion-sensitive customer classes. (English)Zbl 07621944

MSC:  90B22 60K25 90C40
### Risk-sensitive Markov decision problems under model uncertainty: finite time horizon case. (English)Zbl 07616563

Yin, George (ed.) et al., Stochastic analysis, filtering, and stochastic optimization. A commemorative volume to honor Mark H. A. Davis’s contributions. Cham: Springer. 33-52 (2022).
MSC:  93E35 68T05 90C40
### Sensor scheduling design for complex networks under a distributed state estimation framework. (English)Zbl 07616421

MSC:  93E10 93B70 90C40
### Economic MPC of Markov decision processes: dissipativity in undiscounted infinite-horizon optimal control. (English)Zbl 07616406

MSC:  93B45 90C40 93D30
### $$K$$ competing queues with customer abandonment: optimality of a generalised $$c \mu$$-rule by the smoothed rate truncation method. (English)Zbl 1498.90244

MSC:  90C40 60K25
### The exponential cost optimality for finite horizon semi-Markov decision processes. (English)Zbl 07613047

MSC:  90C40 60Exx
### Optimal dynamic mining policy of blockchain selfish mining through sensitivity-based optimization. (English)Zbl 07610146

MSC:  90C40 90C90
### Semi-Markov decision processes with vector pay-offs. (English)Zbl 07597699

Giri, Debasis (ed.) et al., Proceedings of the seventh international conference on mathematics and computing, ICMC 2021, Shibpur, India, March 2–5, 2021. Singapore: Springer. Adv. Intell. Syst. Comput. 1412, 1011-1027 (2022).
MSC:  90C40 90C29
### Bellman’s principle of optimality and deep reinforcement learning for time-varying tasks. (English)Zbl 1500.93144

MSC:  93E20 90C40
### A Markovian decision model of adaptive cancer treatment and quality of life. (English)Zbl 1497.92107

MSC:  92C50 90C40
### Asymptotic optimality and rates of convergence of quantized stationary policies in continuous-time Markov decision processes. (English)Zbl 1497.90216

MSC:  90C40 93E20 60J27
### Distributionally robust Markov decision processes and their connection to risk measures. (English)Zbl 07592357

MSC:  90C40 90C17 91G70
### Optimal treatment of chronic kidney disease with uncertainty in obtaining a transplantable kidney: an MDP based approach. (English)Zbl 1500.90082

MSC:  90C40 90C90
### Fast global convergence of natural policy gradient methods with entropy regularization. (English)Zbl 1500.90086

MSC:  90C52 90C40
### Learning Markov models via low-rank optimization. (English)Zbl 1500.90083

MSC:  90C40 90C26
### Dynamic stochastic matching under limited time. (English)Zbl 1500.90053

MSC:  90C27 90C40
MSC:  90C40
### Optimal pair-trade execution with generalized cross-impact. (English)Zbl 1497.91300

MSC:  91G15 90C40
### Dynamic air ticket pricing using reinforcement learning method. (English)Zbl 1497.90068

MSC:  90B22 90C40 91B24
### Markov decision processes on finite spaces with fuzzy total rewards. (English)Zbl 07584152

MSC:  90C40 93C40
### Hybrid offline/online optimization for energy management via reinforcement learning. (English)Zbl 07577873

Schaus, Pierre (ed.), Integration of constraint programming, artificial intelligence, and operations research. 19th international conference, CPAIOR 2022, Los Angeles, CA, USA, June 20–23, 2022. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 13292, 358-373 (2022).
MSC:  90C40 68T07
### Markov decision processes with incomplete information and semiuniform Feller transition probabilities. (English)Zbl 1498.90245

MSC:  90C40 90C39
### Machine learning and control theory. (English)Zbl 1493.68292

Trélat, Emmanuel (ed.) et al., Numerical control. Part A. Amsterdam: Elsevier/North Holland. Handb. Numer. Anal. 23, 531-558 (2022).
### Computing transience bounds of emergency call centers: a hierarchical timed Petri net approach. (English)Zbl 1499.68215

Bernardinello, Luca (ed.) et al., Application and theory of Petri nets and concurrency. 43rd international conference, PETRI NETS 2022, Bergen, Norway, June 19–24, 2022. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 13288, 90-112 (2022).
MSC:  68Q85 90C40
### Asymptotic optimality of quantized stationary policies in continuous-time Markov decision processes with Polish spaces. (English)Zbl 07572910

MSC:  60J10 90C40 93E20
### Quantile Markov decision processes. (English)Zbl 1496.90106

MSC:  90C40 90C39
### Search under accumulated pressure. (English)Zbl 1494.90038

MSC:  90B50 90C40
### Necessary conditions in generalized semi-infinite optimization with nondifferentiable convex data. (English)Zbl 07568097

MSC:  90C34 90C40 49J52
### Optimizing pig marketing decisions under price fluctuations. (English)Zbl 1496.90107

MSC:  90C40 90C90
### Wolfe type duality for nonsmooth optimization problems with vanishing constraints. (English)Zbl 1491.90171

MSC:  90C34 90C40 49J52
### Stability-constrained Markov decision processes using MPC. (English)Zbl 1497.93065

MSC:  93B45 93E15 90C40
### Optimal stopping time on semi-Markov processes with finite horizon. (English)Zbl 1491.60055

MSC:  60G40 60K15 90C40
### Stochastic control of a class of dynamical systems via path limits. (English)Zbl 1492.60062

MSC:  60F10 90C40 93E03
### First passage risk probability minimization for piecewise deterministic Markov decision processes. (English)Zbl 1489.90210

MSC:  90C40 60J27
### Decision making under uncertainty and reinforcement learning. Theory and algorithms. (English)Zbl 07556504

Intelligent Systems Reference Library 223. Cham: Springer (ISBN 978-3-031-07612-1/hbk; 978-3-031-10892-1/pbk; 978-3-031-07614-5/ebook). xiii, 243 p. (2022).
### A consumption and investment problem via a Markov decision processes approach with random horizon. (English)Zbl 1493.90081

MSC:  90B50 90C40 90C39
### Gradual-impulsive control for continuous-time Markov decision processes with total undiscounted costs and constraints: linear programming approach via a reduction method. (English)Zbl 1495.90230

MSC:  90C40 60J76
### Cooperative decision-making to minimize biased perceived value effect on business process decisions using partially observable Markov decision processes. (English)Zbl 1492.90070

MSC:  90B50 90C40
### Sufficiency of Markov policies for continuous-time jump Markov decision processes. (English)Zbl 1489.90208

MSC:  90C40 90C39 60J25
MSC:  90C40
### Dynamic reinsurance in discrete time minimizing the insurer’s cost of capital. (English)Zbl 1494.91123

MSC:  91G05 90C40
### Nonzero-sum risk-sensitive continuous-time stochastic games with ergodic costs. (English)Zbl 1492.91039

MSC:  91A15 90C40
### Ergodic risk-sensitive control of Markov processes on countable state space revisited. (English)Zbl 1493.90218

MSC:  90C40 91B06 60J10
### On linear and super-linear convergence of natural policy gradient algorithm. (English)Zbl 1492.93060

MSC:  93B47 90C40
### Robust Markov decision processes with data-driven, distance-based ambiguity sets. (English)Zbl 1493.90215

MSC:  90C39 90C40 90C17
### Control systems and reinforcement learning. (English)Zbl 1492.93001

Cambridge: Cambridge University Press (ISBN 978-1-316-51196-1/hbk; 978-1-00-905187-3/ebook). xvi, 436 p. (2022).
### Mean-field Markov decision processes with common noise and open-loop controls. (English)Zbl 1491.90179

MSC:  90C40 49L20
### Constrained discounted stochastic games. (English)Zbl 1489.91016

MSC:  91A15 91A10 90C40
### Variable demand and multi-commodity flow in Markovian network equilibrium. (English)Zbl 1486.91019

MSC:  91A43 90C40 90B15
### LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systems. (English)Zbl 1489.90207

MSC:  90C40 90C39
### Process-based risk measures and risk-averse control of discrete-time systems. (English)Zbl 1489.90077

MSC:  90C15 90C39 90C40
### A step-by-step tutorial on active inference and its application to empirical data. (English)Zbl 1484.91352

MSC:  91E10 91E30 90C40
### Multi-objective dynamic programming with limited precision. (English)Zbl 1486.90177

MSC:  90C29 90C40 90C39
### Multiply accelerated value iteration for nonsymmetric affine fixed point problems and application to Markov decision processes. (English)Zbl 1486.90203

MSC:  90C39 90C40 47H09
### Risk-sensitive semi-Markov decision problems with discounted cost and general utilities. (English)Zbl 1480.90248

MSC:  90C40 93E20
### Mathematics of reinforcement learning. (English)Zbl 1487.68198

Heng, Liao (ed.) et al., Mathematics for future computing and communications. Cambridge: Cambridge University Press. 329-374 (2022).
### A restless bandit model for resource allocation, competition, and reservation. (English)Zbl 1484.91225

MSC:  91B32 91B70 90C40
### Optimal sequential multiclass diagnosis. (English)Zbl 1482.90111

MSC:  90B50 90C40
### Sample complexity of asynchronous Q-learning: sharper analysis and variance reduction. (English)Zbl 1489.90209

MSC:  90C40 68T07
### On the equivalence of the integral and differential Bellman equations in impulse control problems. (English)Zbl 1482.49038

MSC:  49N25 49L20 90C40
MSC:  90C40
### Learning to scan: a deep reinforcement learning approach for personalized scanning in CT imaging. (English)Zbl 1482.92044

MSC:  92C55 68T07 90C40
MSC:  90C40
MSC:  90C40
### Artificial neural networks and logic circuit synthesis. (English. Russian original)Zbl 1499.68328

Comput. Math. Model. 32, No. 4, 490-499 (2021); translation from Prikl. Mat. Inf. 68, 75-87 (2021).
MSC:  68T07 90C40 94C11
### Dynamic bus dispatch policies. (English)Zbl 1497.90048

Lasaulce, Samson (ed.) et al., Network games, control and optimization. 10th international conference, NetGCooP 2020, Cargèse, Corsica, France, September 22–24, 2021. Proceedings. Cham: Springer. Commun. Comput. Inf. Sci. 1354, 139-153 (2021).
MSC:  90B06 90C40
### Approximation and mean field control of systems of large populations. (English)Zbl 1496.49020

Hernández-Hernández, Daniel (ed.) et al., Advances in probability and mathematical statistics. CLAPEM 2019. Contributions of the 15th Latin American congress of probability and mathematical statistics, Mérida, Mexico, December 2–6, 2019. Cham: Birkhäuser. Prog. Probab. 79, 103-122 (2021).
