## Found 4,885 Documents (Results 1–100)

### Zero and non-zero sum risk-sensitive semi-Markov games. (English)Zbl 07648503

MSC:  90C40 91A15
Full Text:

Full Text:

### An application of approximate dynamic programming in multi-period multi-product advertising budgeting. (English)Zbl 07599092

MSC:  90C39 90C40
Full Text:

### Optimizing the first response to sepsis: an electronic health record-based Markov decision process model. (English)Zbl 07646914

MSC:  90B50 90C40 60G40
Full Text:

### Low-rank representation of reinforcement learning policies. (English)Zbl 07639806

MSC:  68T05 90C40
Full Text:

### Mean-semivariance policy optimization via risk-averse reinforcement learning. (English)Zbl 07639805

MSC:  68T05 68T20 90C40
Full Text:

Full Text:

Full Text:

Full Text:

### Some multidimensional stochastic models of inventory control with a separable cost function. (English. Ukrainian original)Zbl 07630518

Cybern. Syst. Anal. 58, No. 4, 523-529 (2022); translation from Kibern. Sist. Anal. 58, No. 4, 38-45 (2022).
Full Text:

MSC:  90C40
Full Text:

### Optimal admission and routing with congestion-sensitive customer classes. (English)Zbl 07621944

MSC:  90B22 60K25 90C40
Full Text:

### Risk-sensitive Markov decision problems under model uncertainty: finite time horizon case. (English)Zbl 07616563

Yin, George (ed.) et al., Stochastic analysis, filtering, and stochastic optimization. A commemorative volume to honor Mark H. A. Davis’s contributions. Cham: Springer. 33-52 (2022).
MSC:  93E35 68T05 90C40
Full Text:

### Sensor scheduling design for complex networks under a distributed state estimation framework. (English)Zbl 07616421

MSC:  93E10 93B70 90C40
Full Text:

Full Text:

### Economic MPC of Markov decision processes: dissipativity in undiscounted infinite-horizon optimal control. (English)Zbl 07616406

MSC:  93B45 90C40 93D30
Full Text:

Full Text:

Full Text:

### $$K$$ competing queues with customer abandonment: optimality of a generalised $$c \mu$$-rule by the smoothed rate truncation method. (English)Zbl 1498.90244

MSC:  90C40 60K25
Full Text:

Full Text:

### The exponential cost optimality for finite horizon semi-Markov decision processes. (English)Zbl 07613047

MSC:  90C40 60Exx
Full Text:

### Optimal dynamic mining policy of blockchain selfish mining through sensitivity-based optimization. (English)Zbl 07610146

MSC:  90C40 90C90
Full Text:

### Semi-Markov decision processes with vector pay-offs. (English)Zbl 07597699

Giri, Debasis (ed.) et al., Proceedings of the seventh international conference on mathematics and computing, ICMC 2021, Shibpur, India, March 2–5, 2021. Singapore: Springer. Adv. Intell. Syst. Comput. 1412, 1011-1027 (2022).
MSC:  90C40 90C29
Full Text:

### Bellman’s principle of optimality and deep reinforcement learning for time-varying tasks. (English)Zbl 1500.93144

MSC:  93E20 90C40
Full Text:

### A Markovian decision model of adaptive cancer treatment and quality of life. (English)Zbl 1497.92107

MSC:  92C50 90C40
Full Text:

### Asymptotic optimality and rates of convergence of quantized stationary policies in continuous-time Markov decision processes. (English)Zbl 1497.90216

MSC:  90C40 93E20 60J27
Full Text:

Full Text:

Full Text:

### Distributionally robust Markov decision processes and their connection to risk measures. (English)Zbl 07592357

MSC:  90C40 90C17 91G70
Full Text:

### Optimal treatment of chronic kidney disease with uncertainty in obtaining a transplantable kidney: an MDP based approach. (English)Zbl 1500.90082

MSC:  90C40 90C90
Full Text:

### Fast global convergence of natural policy gradient methods with entropy regularization. (English)Zbl 1500.90086

MSC:  90C52 90C40
Full Text:

### Learning Markov models via low-rank optimization. (English)Zbl 1500.90083

MSC:  90C40 90C26
Full Text:

### Dynamic stochastic matching under limited time. (English)Zbl 1500.90053

MSC:  90C27 90C40
Full Text:

MSC:  90C40
Full Text:

### Optimal pair-trade execution with generalized cross-impact. (English)Zbl 1497.91300

MSC:  91G15 90C40
Full Text:

### Dynamic air ticket pricing using reinforcement learning method. (English)Zbl 1497.90068

MSC:  90B22 90C40 91B24
Full Text:

### Markov decision processes on finite spaces with fuzzy total rewards. (English)Zbl 07584152

MSC:  90C40 93C40
Full Text:

### Hybrid offline/online optimization for energy management via reinforcement learning. (English)Zbl 07577873

Schaus, Pierre (ed.), Integration of constraint programming, artificial intelligence, and operations research. 19th international conference, CPAIOR 2022, Los Angeles, CA, USA, June 20–23, 2022. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 13292, 358-373 (2022).
MSC:  90C40 68T07
Full Text:

### Markov decision processes with incomplete information and semiuniform Feller transition probabilities. (English)Zbl 1498.90245

MSC:  90C40 90C39
Full Text:

### Machine learning and control theory. (English)Zbl 1493.68292

Trélat, Emmanuel (ed.) et al., Numerical control. Part A. Amsterdam: Elsevier/North Holland. Handb. Numer. Anal. 23, 531-558 (2022).
Full Text:

### Computing transience bounds of emergency call centers: a hierarchical timed Petri net approach. (English)Zbl 1499.68215

Bernardinello, Luca (ed.) et al., Application and theory of Petri nets and concurrency. 43rd international conference, PETRI NETS 2022, Bergen, Norway, June 19–24, 2022. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 13288, 90-112 (2022).
MSC:  68Q85 90C40
Full Text:

### Asymptotic optimality of quantized stationary policies in continuous-time Markov decision processes with Polish spaces. (English)Zbl 07572910

MSC:  60J10 90C40 93E20
Full Text:

Full Text:

### Quantile Markov decision processes. (English)Zbl 1496.90106

MSC:  90C40 90C39
Full Text:

### Search under accumulated pressure. (English)Zbl 1494.90038

MSC:  90B50 90C40
Full Text:

### Necessary conditions in generalized semi-infinite optimization with nondifferentiable convex data. (English)Zbl 07568097

MSC:  90C34 90C40 49J52
Full Text:

### Optimizing pig marketing decisions under price fluctuations. (English)Zbl 1496.90107

MSC:  90C40 90C90
Full Text:

### Wolfe type duality for nonsmooth optimization problems with vanishing constraints. (English)Zbl 1491.90171

MSC:  90C34 90C40 49J52
Full Text:

### Stability-constrained Markov decision processes using MPC. (English)Zbl 1497.93065

MSC:  93B45 93E15 90C40
Full Text:

Full Text:

### Optimal stopping time on semi-Markov processes with finite horizon. (English)Zbl 1491.60055

MSC:  60G40 60K15 90C40
Full Text:

### Stochastic control of a class of dynamical systems via path limits. (English)Zbl 1492.60062

MSC:  60F10 90C40 93E03
Full Text:

### First passage risk probability minimization for piecewise deterministic Markov decision processes. (English)Zbl 1489.90210

MSC:  90C40 60J27
Full Text:

Full Text:

### Decision making under uncertainty and reinforcement learning. Theory and algorithms. (English)Zbl 07556504

Intelligent Systems Reference Library 223. Cham: Springer (ISBN 978-3-031-07612-1/hbk; 978-3-031-10892-1/pbk; 978-3-031-07614-5/ebook). xiii, 243 p. (2022).
Full Text:

### A consumption and investment problem via a Markov decision processes approach with random horizon. (English)Zbl 1493.90081

MSC:  90B50 90C40 90C39
Full Text:

### Gradual-impulsive control for continuous-time Markov decision processes with total undiscounted costs and constraints: linear programming approach via a reduction method. (English)Zbl 1495.90230

MSC:  90C40 60J76
Full Text:

### Cooperative decision-making to minimize biased perceived value effect on business process decisions using partially observable Markov decision processes. (English)Zbl 1492.90070

MSC:  90B50 90C40
Full Text:

Full Text:

Full Text:

Full Text:

Full Text:

### Sufficiency of Markov policies for continuous-time jump Markov decision processes. (English)Zbl 1489.90208

MSC:  90C40 90C39 60J25
Full Text:

MSC:  90C40
Full Text:

### Dynamic reinsurance in discrete time minimizing the insurer’s cost of capital. (English)Zbl 1494.91123

MSC:  91G05 90C40
Full Text:

### Nonzero-sum risk-sensitive continuous-time stochastic games with ergodic costs. (English)Zbl 1492.91039

MSC:  91A15 90C40
Full Text:

### Ergodic risk-sensitive control of Markov processes on countable state space revisited. (English)Zbl 1493.90218

MSC:  90C40 91B06 60J10
Full Text:

### On linear and super-linear convergence of natural policy gradient algorithm. (English)Zbl 1492.93060

MSC:  93B47 90C40
Full Text:

### Robust Markov decision processes with data-driven, distance-based ambiguity sets. (English)Zbl 1493.90215

MSC:  90C39 90C40 90C17
Full Text:

### Control systems and reinforcement learning. (English)Zbl 1492.93001

Cambridge: Cambridge University Press (ISBN 978-1-316-51196-1/hbk; 978-1-00-905187-3/ebook). xvi, 436 p. (2022).
Full Text:

Full Text:

### Mean-field Markov decision processes with common noise and open-loop controls. (English)Zbl 1491.90179

MSC:  90C40 49L20
Full Text:

### Constrained discounted stochastic games. (English)Zbl 1489.91016

MSC:  91A15 91A10 90C40
Full Text:

Full Text:

### Variable demand and multi-commodity flow in Markovian network equilibrium. (English)Zbl 1486.91019

MSC:  91A43 90C40 90B15
Full Text:

### LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systems. (English)Zbl 1489.90207

MSC:  90C40 90C39
Full Text:

Full Text:

### Process-based risk measures and risk-averse control of discrete-time systems. (English)Zbl 1489.90077

MSC:  90C15 90C39 90C40
Full Text:

Full Text:

### A step-by-step tutorial on active inference and its application to empirical data. (English)Zbl 1484.91352

MSC:  91E10 91E30 90C40
Full Text:

### Multi-objective dynamic programming with limited precision. (English)Zbl 1486.90177

MSC:  90C29 90C40 90C39
Full Text:

Full Text:

### Multiply accelerated value iteration for nonsymmetric affine fixed point problems and application to Markov decision processes. (English)Zbl 1486.90203

MSC:  90C39 90C40 47H09
Full Text:

### Risk-sensitive semi-Markov decision problems with discounted cost and general utilities. (English)Zbl 1480.90248

MSC:  90C40 93E20
Full Text:

### Mathematics of reinforcement learning. (English)Zbl 1487.68198

Heng, Liao (ed.) et al., Mathematics for future computing and communications. Cambridge: Cambridge University Press. 329-374 (2022).
Full Text:

### A restless bandit model for resource allocation, competition, and reservation. (English)Zbl 1484.91225

MSC:  91B32 91B70 90C40
Full Text:

### Optimal sequential multiclass diagnosis. (English)Zbl 1482.90111

MSC:  90B50 90C40
Full Text:

### Sample complexity of asynchronous Q-learning: sharper analysis and variance reduction. (English)Zbl 1489.90209

MSC:  90C40 68T07
Full Text:

### On the equivalence of the integral and differential Bellman equations in impulse control problems. (English)Zbl 1482.49038

MSC:  49N25 49L20 90C40
Full Text:

MSC:  90C40
Full Text:

### Learning to scan: a deep reinforcement learning approach for personalized scanning in CT imaging. (English)Zbl 1482.92044

MSC:  92C55 68T07 90C40
Full Text:

MSC:  90C40
Full Text:

Full Text:

Full Text:

Full Text:

Full Text:

MSC:  90C40
Full Text:

### Artificial neural networks and logic circuit synthesis. (English. Russian original)Zbl 1499.68328

Comput. Math. Model. 32, No. 4, 490-499 (2021); translation from Prikl. Mat. Inf. 68, 75-87 (2021).
MSC:  68T07 90C40 94C11
Full Text:

### Dynamic bus dispatch policies. (English)Zbl 1497.90048

Lasaulce, Samson (ed.) et al., Network games, control and optimization. 10th international conference, NetGCooP 2020, Cargèse, Corsica, France, September 22–24, 2021. Proceedings. Cham: Springer. Commun. Comput. Inf. Sci. 1354, 139-153 (2021).
MSC:  90B06 90C40
Full Text:

### Approximation and mean field control of systems of large populations. (English)Zbl 1496.49020

Hernández-Hernández, Daniel (ed.) et al., Advances in probability and mathematical statistics. CLAPEM 2019. Contributions of the 15th Latin American congress of probability and mathematical statistics, Mérida, Mexico, December 2–6, 2019. Cham: Birkhäuser. Prog. Probab. 79, 103-122 (2021).
Full Text:

all top 5

all top 5

all top 5

all top 3

all top 3