Jain, Adit; Krishnamurthy, Vikram Controlling stochastic gradient descent using stochastic approximation for robust distributed optimization. (English) Zbl 07985290 Numer. Algebra Control Optim. 15, No. 1, 173-195 (2025). MSC: 62L20 91A15 90C40 60J20 × Cite Format Result Cite Review PDF Full Text: DOI
Dufour, F.; Genadot, A.; Costa, O. L. V. Minimum contrast estimators for piecewise deterministic Markov processes. (English) Zbl 07985282 Numer. Algebra Control Optim. 15, No. 1, 1-14 (2025). MSC: 90C40 60J25 × Cite Format Result Cite Review PDF Full Text: DOI
Mifrani, Anas A counterexample and a corrective to the vector extension of the Bellman equations of a Markov decision process. (English) Zbl 07985171 Ann. Oper. Res. 345, No. 1, 351-369 (2025). MSC: 90Cxx 91Bxx 62Cxx × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Gracia, Ibon; Boskos, Dimitris; Lahijanian, Morteza; Laurenti, Luca; Mazo, Manuel jun. Efficient strategy synthesis for switched stochastic systems with distributional uncertainty. (English) Zbl 07983415 Nonlinear Anal., Hybrid Syst. 55, Article ID 101554, 22 p. (2025). MSC: 93-XX 90-XX × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Chen, Fang; Guo, Xin Zero-sum semi-Markov games with the risk-sensitive average reward criterion. (English) Zbl 07980155 J. Optim. Theory Appl. 204, No. 3, Paper No. 42, 30 p. (2025). MSC: 90C40 91A15 × Cite Format Result Cite Review PDF Full Text: DOI
Yüksel, Serdar Another look at partially observed optimal stochastic control: existence, ergodicity, and approximations without belief-reduction. (English) Zbl 07971569 Appl. Math. Optim. 91, No. 1, Paper No. 16, 42 p. (2025). MSC: 60J05 60J20 93E20 93E11 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Li, Yan; Lan, Guanghui Policy mirror descent inherently explores action space. (English) Zbl 07966996 SIAM J. Optim. 35, No. 1, 116-156 (2025). MSC: 90C40 90C15 90C26 68Q25 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Feinberg, Eugene A.; He, Gaojin Properties of Turnpike Functions for Discounted Finite MDPs. arXiv:2502.05375 Preprint, arXiv:2502.05375 [math.OC] (2025). MSC: 90C40 60J05 × Cite Format Result Cite Full Text: arXiv
Cruzado, Omar Briceno Standardized Measurement Approach (SMA) vs Advanced Measurement Approaches (AMA): A Critical Review of Approaches in Operational Risk. arXiv:2502.00962 Preprint, arXiv:2502.00962 [q-fin.RM] (2025). MSC: 91B06 62P99 62M05 × Cite Format Result Cite Full Text: arXiv
de Jongh, M. C.; Boucherie, Richard J.; van Lieshout, M. N. M. Controlling the low-temperature Ising model using spatiotemporal Markov decision theory. arXiv:2501.03668 Preprint, arXiv:2501.03668 [math.OC] (2025). MSC: 90C40 82B20 60G60 × Cite Format Result Cite Full Text: arXiv
Cannarsa, Piermarco; Gaubert, Stephane; Mendico, Cristian; Quincampoix, Marc Analysis of the vanishing discount limit for optimal control problems in continuous and discrete time. (English) Zbl 07984304 Math. Control Relat. Fields 14, No. 4, 1275-1305 (2024). MSC: 49L20 93C15 90C39 90C40 35F21 35B40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Blancas-Rivera, Rubén; Jasso-Fuentes, Héctor Discrete-time hybrid control with risk-sensitive discounted costs. (English) Zbl 07983884 Discrete Event Dyn. Syst. 34, No. 4, 659-687 (2024). MSC: 93-XX × Cite Format Result Cite Review PDF Full Text: DOI
Feng, Yiting; Zhou, Ye; Ho, Hann Woei; Dong, Hongyang; Zhao, Xiaowei Online adaptive critic designs with tensor product B-splines and incremental model techniques. (English) Zbl 07979168 J. Franklin Inst. 361, No. 18, Article ID 107357, 21 p. (2024). MSC: 93C40 65D07 68T07 90C39 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Chatterjee, Krishnendu; Doyen, Laurent Stochastic processes with expected stopping time. (English) Zbl 07977493 Log. Methods Comput. Sci. 20, No. 4, Paper No. 11, 34 p. (2024). MSC: 60G40 60J10 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Yang, Bo; Nadarajah, Selvaprabu; Secomandi, Nicola Least squares Monte Carlo and pathwise optimization for merchant energy production. (English) Zbl 07976721 Oper. Res. 72, No. 6, 2758-2775 (2024). MSC: 90Cxx × Cite Format Result Cite Review PDF Full Text: DOI
Gast, Nicolas; Gaujal, Bruno; Yan, Chen Linear program-based policies for restless bandits: necessary and sufficient conditions for (exponentially fast) asymptotic optimality. (English) Zbl 07975720 Math. Oper. Res. 49, No. 4, 2468-2491 (2024). MSC: 90C40 90C05 90B99 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Jasso-Fuentes, Héctor; Salgado-Suárez, Gladys D. A discrete-time benchmark tracking problem in two markets subject to random environments. (English) Zbl 07971819 OR Spectrum 46, No. 4, 1265-1294 (2024). MSC: 91G15 82B41 60K37 90C40 93C30 × Cite Format Result Cite Review PDF Full Text: DOI
Duan, Yaqi; Wang, Mengdi; Wainwright, Martin J. Optimal policy evaluation using kernel-based temporal difference methods. (English) Zbl 07961543 Ann. Stat. 52, No. 5, 1927-1952 (2024). MSC: 62G05 62M05 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Bäuerle, Nicole; Pitera, Marcin; Stettner, Łukasz Blackwell optimality and policy stability for long-run risk-sensitive stochastic control. (English) Zbl 07957556 SIAM J. Control Optim. 62, No. 6, 3172-3194 (2024). MSC: 60J05 60J35 90C39 90C40 93C55 93E20 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Main, James C. A.; Randour, Mickael Different strokes in randomised strategies: revisiting Kuhn’s theorem under finite-memory assumptions. (English) Zbl 07953368 Inf. Comput. 301, Article ID 105229, 31 p. (2024). MSC: 68Qxx × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Ma, Minghong; Yang, Fei Dynamic migratory beekeeping route recommendation based on spatio-temporal distribution of nectar sources. (English) Zbl 07950225 Ann. Oper. Res. 341, No. 2-3, 1075-1105 (2024). MSC: 90B06 90C39 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Jasso-Fuentes, Héctor; Salgado-Suárez, Gladys D. Discrete-time hybrid control processes with unbounded costs. (English) Zbl 07948509 Appl. Math. Optim. 90, No. 3, Paper No. 51, 40 p. (2024). MSC: 93C55 93C30 90C40 90C39 × Cite Format Result Cite Review PDF Full Text: DOI
Ansari, Sina; Enayati, Shakiba; Akhavan-Tabatabaei, Raha; Kapp, Julie M. Curbing the opioid crisis: optimal dynamic policies for preventive and mitigating interventions. (English) Zbl 07948411 Decis. Anal. 21, No. 3, 165-193 (2024). MSC: 92D30 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Demirci, Yunus Emre; Kara, Ali Devran; Yüksel, Serdar Average cost optimality of partially observed MDPs: contraction of nonlinear filters and existence of optimal solutions and approximations. (English) Zbl 07946596 SIAM J. Control Optim. 62, No. 6, 2859-2883 (2024). MSC: 90C40 93E11 93E20 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Tunçalp, Feray; Örmeci, Lerzan Appointment requests from multiple channels: characterizing optimal set of appointment days to offer with patient preferences. (English) Zbl 07946008 Stoch. Syst. 14, No. 3, 273-295 (2024). MSC: 60K99 × Cite Format Result Cite Review PDF Full Text: DOI
Dufour, François; Prieto-Rumeau, Tomás Maximizing the probability of visiting a set infinitely often for a Markov decision process with Borel state and action spaces. (English) Zbl 07945548 J. Appl. Probab. 61, No. 4, 1424-1447 (2024). MSC: 90C40 60J10 × Cite Format Result Cite Review PDF Full Text: DOI
Hobert, James P.; Khare, Kshitij Recurrence and transience of a Markov chain on \(\mathbb{Z}^+\) and evaluation of prior distributions for a Poisson mean. (English) Zbl 07945545 J. Appl. Probab. 61, No. 4, 1361-1379 (2024). MSC: 60J10 62C15 × Cite Format Result Cite Review PDF Full Text: DOI
Costa, O. L. V.; Dufour, F.; Genadot, A. Adaptive average control for piecewise deterministic Markov processes. (English) Zbl 07942357 Syst. Control Lett. 192, Article ID 105894, 8 p. (2024). MSC: 93C40 93E03 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Yu, Huizhen On strategic measures and optimality properties in discrete-time stochastic control with universally measurable policies. (English) Zbl 07940068 Math. Oper. Res. 49, No. 3, 1734-1760 (2024). MSC: 60J05 90C40 93E20 03E15 49J55 91B05 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Zheng, Rui Structured replacement policies for a system subject to random mission types. (English) Zbl 07939162 Nav. Res. Logist. 71, No. 7, 1055-1069 (2024). MSC: 90B25 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Cayci, Semih; He, Niao; Srikant, R. Finite-time analysis of natural actor-critic for POMDPs. (English) Zbl 07938968 SIAM J. Math. Data Sci. 6, No. 4, 869-896 (2024). MSC: 68T05 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Arık, Ayşe; Cairns, Andrew J. G.; Dodd, Erengul; Macdonald, Angus S.; Streftaris, George The effect of the COVID-19 health disruptions on breast cancer mortality for older women: a semi-Markov modelling approach. (English) Zbl 07938352 Scand. Actuar. J. 2024, No. 8, 848-879 (2024). MSC: 91G05 92C50 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Liu, Keqin; Weber, Richard; Zhang, Chengzhong Low-complexity algorithm for restless bandits with imperfect observations. (English) Zbl 07935771 Math. Methods Oper. Res. 100, No. 2, 467-508 (2024). MSC: 90B36 93E20 93E35 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Singh, Vartika; Kavitha, Veeraruna Partial information games and competitive advertising. (English) Zbl 07932125 Dyn. Games Appl. 14, No. 4, 888-920 (2024). MSC: 91D30 90B60 91A27 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Gao, Xuefeng; Zhou, Xun Yu Logarithmic regret bounds for continuous-time average-reward Markov decision processes. (English) Zbl 07916661 SIAM J. Control Optim. 62, No. 5, 2529-2556 (2024). MSC: 90C40 60J27 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Jasso-Fuentes, Héctor; Prieto-Rumeau, Tomás Constrained Markov decision processes with non-constant discount factor. (English) Zbl 07916642 J. Optim. Theory Appl. 202, No. 2, 897-931 (2024). MSC: 90C40 90C05 93E20 × Cite Format Result Cite Review PDF Full Text: DOI
Geevers, Kevin; van Hezewijk, Lotte; Mes, Martijn R. K. Multi-echelon inventory optimization using deep reinforcement learning. (English) Zbl 07916330 CEJOR, Cent. Eur. J. Oper. Res. 32, No. 3, 653-683 (2024). MSC: 90B05 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Xu, Jie; Zhang, Hui; Shi, Yihan; Xiangli, Ying Transfer entropy on collective motion with undeclared loose leader-follower (LLF) structure. (English) Zbl 07916124 Inf. Sci. 684, Article ID 121248, 20 p. (2024). MSC: 76A30 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Li, Yan; Lan, Guanghui; Zhao, Tuo Homotopic policy mirror descent: policy convergence, algorithmic regularization, and improved sample complexity. (English) Zbl 07915921 Math. Program. 207, No. 1-2 (A), 457-513 (2024). MSC: 90C40 90C15 90C26 68Q25 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Neufeld, Ariel; Sester, Julian Robust \(Q\)-learning algorithm for Markov decision processes under Wasserstein uncertainty. (English) Zbl 07913884 Automatica 168, Article ID 111825, 13 p. (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Gosavi, Abhijit; Le, Vy K. Maintenance optimization in a digital twin for industry 4.0. (English) Zbl 07910183 Ann. Oper. Res. 340, No. 1, 245-269 (2024). MSC: 90B25 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Pternea, Moschoula; Singh, Prerna; Chakraborty, Abir; Oruganti, Yagna; Milletari, Mirco; Bapat, Sayli; Jiang, Kebei The RL/LLM taxonomy tree: reviewing synergies between reinforcement learning and large language models. (English) Zbl 1546.68017 J. Artif. Intell. Res. (JAIR) 80, 1525-1573 (2024). MSC: 68T05 68T20 68T40 68T50 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Zhu, Yi; Dong, Jing; Lam, Henry Uncertainty quantification and exploration for reinforcement learning. (English) Zbl 07907221 Oper. Res. 72, No. 4, 1689-1709 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Désir, Antoine; Goyal, Vineet; Jiang, Bo; Xie, Tian; Zhang, Jiawei Robust assortment optimization under the Markov chain choice model. (English) Zbl 07907216 Oper. Res. 72, No. 4, 1595-1614 (2024). MSC: 90C40 90C17 × Cite Format Result Cite Review PDF Full Text: DOI
Yüksel, Serdar On Borkar and Young relaxed control topologies and continuous dependence of invariant measures on control policy. (English) Zbl 07902755 SIAM J. Control Optim. 62, No. 4, 2367-2386 (2024). MSC: 90C40 93E20 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Nguyen, Hoang Nam; Lisser, Abdel; Singh, Vikas Vikram Distributionally robust chance-constrained Markov decision processes with random payoff. (English) Zbl 1545.60092 Appl. Math. Optim. 90, No. 1, Paper No. 25, 39 p. (2024). MSC: 60J35 90C40 90C25 × Cite Format Result Cite Review PDF Full Text: DOI
Bäuerle, Nicole; Höfer, Sebastian Continuous-time mean field Markov decision models. (English) Zbl 07898801 Appl. Math. Optim. 90, No. 1, Paper No. 12, 32 p. (2024). MSC: 90C40 60G55 60F17 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Lee, Donghwan Final iteration convergence bound of Q-learning: switching system approach. (English) Zbl 1546.90273 IEEE Trans. Autom. Control 69, No. 7, 4765-4772 (2024). MSC: 90C40 68T07 93E35 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Gao, Yulong; Abate, Alessandro; Xie, Lihua; Johansson, Karl Henrik Distributional reachability for Markov decision processes: theory and applications. (English) Zbl 1546.93045 IEEE Trans. Autom. Control 69, No. 7, 4598-4613 (2024). MSC: 93B03 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Delimpaltadakis, Giannis; Laurenti, Luca; Mazo, Manuel Formal analysis of the sampling behavior of stochastic event-triggered control. (English) Zbl 1546.93659 IEEE Trans. Autom. Control 69, No. 7, 4491-4505 (2024). MSC: 93E03 90C40 93C57 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Calvo-Fullana, Miguel; Paternain, Santiago; Chamon, Luiz F. O.; Ribeiro, Alejandro State augmented constrained reinforcement learning: overcoming the limitations of learning with rewards. (English) Zbl 1546.90259 IEEE Trans. Autom. Control 69, No. 7, 4275-4290 (2024). MSC: 90C40 68T07 93E35 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Yekkehkhany, Ali; Feng, Han; Ying, Donghao; Lavaei, Javad A hitting time analysis for stochastic time-varying functions with applications to adversarial attacks on computation of Markov decision processes. (English) Zbl 1546.90285 IEEE Trans. Autom. Control 69, No. 6, 3615-3630 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Sadri, Shadi; Fatemi Ghomi, S. M. T.; Dehghanian, Amin Analysis of a time-cost trade-off in a resource-constrained GERT project scheduling problem using the Markov decision process. (English) Zbl 1545.90081 Ann. Oper. Res. 338, No. 1, 535-568 (2024). MSC: 90B35 90B36 90B50 90C40 90C15 90C59 90C05 × Cite Format Result Cite Review PDF Full Text: DOI
Das, Souvik; Dey, Priyanka; Chatterjee, Debasish Almost sure detection of the presence of malicious components in cyber-physical systems. (English) Zbl 1542.93148 Automatica 167, Article ID 111789, 11 p. (2024). MSC: 93B70 93C83 93E03 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Huo, Haifeng; Cui, Jinhua; Wen, Xian Minimizing risk probability for infinite discounted piecewise deterministic Markov decision processes. (English) Zbl 07893461 Kybernetika 60, No. 3, 357-378 (2024). MSC: 90C40 60Exx × Cite Format Result Cite Review PDF Full Text: DOI
Soeffker, Ninja; Ulmer, Marlin W.; Mattfeld, Dirk C. Balancing resources for dynamic vehicle routing with stochastic customer requests. (English) Zbl 1544.90039 OR Spectrum 46, No. 2, 331-373 (2024). MSC: 90B06 90C40 90C27 × Cite Format Result Cite Review PDF Full Text: DOI
Zhang, Amy B. Z.; Gurvich, Itai A low-rank approximation for MDPs via moment coupling. (English) Zbl 07888759 Oper. Res. 72, No. 3, 1255-1277 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Bennett, Andrew; Kallus, Nathan Proximal reinforcement learning: efficient off-policy evaluation in partially observed Markov decision processes. (English) Zbl 07888749 Oper. Res. 72, No. 3, 1071-1086 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Simchowitz, Max; Slivkins, Aleksandrs Exploration and incentives in reinforcement learning. (English) Zbl 07888744 Oper. Res. 72, No. 3, 983-998 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Braverman, Anton; Dai, J. G.; Fang, Xiao High-order steady-state diffusion approximations. (English) Zbl 07887715 Oper. Res. 72, No. 2, 604-616 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Chen, Xian; Wei, Qingda Risk-sensitive average Markov decision processes in general spaces. (English) Zbl 07885159 SIAM J. Control Optim. 62, No. 4, 2115-2147 (2024). MSC: 90C40 60J10 × Cite Format Result Cite Review PDF Full Text: DOI
Tsur, Dor; Aharoni, Ziv; Goldfeld, Ziv; Permuter, Haim Data-driven optimization of directed information over discrete alphabets. (English) Zbl 1547.94280 IEEE Trans. Inf. Theory 70, No. 3, 1652-1670 (2024). MSC: 94A40 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Gutierrez-Pachas, Daniel A.; Costa, Eduardo F.; Vargas, Alessandro N. Linear quadratic control problem of systems with Markov jumps in reverse time and observation with anticipation of the jumps. (English) Zbl 1546.90267 IEEE Trans. Autom. Control 69, No. 4, 2469-2475 (2024). MSC: 90C40 60J10 93E20 × Cite Format Result Cite Review PDF Full Text: DOI
Su, Yan; Li, Junping Admission control of double-sided queues with multiple customer types. (English) Zbl 1546.90053 IEEE Trans. Autom. Control 69, No. 3, 1960-1966 (2024). MSC: 90B22 60K25 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Kharade, Sonam; Sutavani, Sarang; Yerudkar, Amol; Wagh, Sushama; Liu, Yang; Del Vecchio, Carmen; Singh, N. M. On exact embedding framework for optimal control of Markov decision processes. (English) Zbl 1546.90272 IEEE Trans. Autom. Control 69, No. 2, 1316-1323 (2024). MSC: 90C40 93E20 × Cite Format Result Cite Review PDF Full Text: DOI
Ma, Chenglin; Zhao, Huaizhong Optimal control of probability on a target set for continuous-time Markov chains. (English) Zbl 1546.93812 IEEE Trans. Autom. Control 69, No. 2, 1202-1209 (2024). MSC: 93E20 49L20 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Bahari Kordabad, Arash; Zanon, Mario; Gros, Sebastien Equivalence of optimality criteria for Markov decision process and model predictive control. (English) Zbl 1546.90256 IEEE Trans. Autom. Control 69, No. 2, 1149-1156 (2024). MSC: 90C40 93B45 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Chang, Hyeong Soo On supervised online rolling-horizon control for infinite-horizon discounted Markov decision processes. (English) Zbl 1546.90261 IEEE Trans. Autom. Control 69, No. 2, 1060-1065 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Gargiani, Matilde; Martinelli, Andrea; Martinez, Max Ruts; Lygeros, John Parallel and flexible dynamic programming via the mini-batch Bellman operator. (English) Zbl 1546.90251 IEEE Trans. Autom. Control 69, No. 1, 455-462 (2024). MSC: 90C39 90C40 × Cite Format Result Cite Review PDF Full Text: DOI Link
Molloy, Timothy L.; Nair, Girish N. Entropy-regularized partially observed Markov decision processes. (English) Zbl 1546.90277 IEEE Trans. Autom. Control 69, No. 1, 379-386 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Ahmadi, Mohamadreza; Rosolia, Ugo; Ingham, Michel D.; Murray, Richard M.; Ames, Aaron D. Risk-averse decision making under uncertainty. (English) Zbl 1546.90255 IEEE Trans. Autom. Control 69, No. 1, 55-68 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv Link
Chen, Gongpu; Liew, Soung Chang An index policy for minimizing the uncertainty-of-information of Markov sources. (English) Zbl 1547.94198 IEEE Trans. Inf. Theory 70, No. 1, 698-721 (2024). MSC: 94A17 94A15 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Golui, Subrata; Pal, Chandan; Manikandan, R.; Sobhanan, Abhay Optimal control of a dynamic production-inventory system with various cost criteria. (English) Zbl 1545.90006 Ann. Oper. Res. 337, No. 1, 75-103 (2024). MSC: 90B05 90B30 90B22 90C40 93E20 60K25 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Hu, Jiaqiao; Yang, Xiangyu; Hu, Jian-Qiang; Peng, Yijie A Q-learning algorithm for Markov decision processes with continuous state spaces. (English) Zbl 1545.93697 Syst. Control Lett. 187, Article ID 105782, 8 p. (2024). MSC: 93E20 90C40 68T05 × Cite Format Result Cite Review PDF Full Text: DOI
Liu, Congying; Zhang, Yining; Zhang, Wenzhao Discrete-time nonstationary average stochastic games. (English) Zbl 1545.91027 J. Dyn. Games 11, No. 3, 265-279 (2024). MSC: 91A15 91A50 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Huang, Tanhao; Lu, Xiaoyang; Chen, Jinwen A discount vanishing approximation for Markov decision processes with risk sensitivity. (English) Zbl 07878108 J. Dyn. Control Syst. 30, No. 2, Paper No. 23, 21 p. (2024). MSC: 90C40 47J10 93E99 × Cite Format Result Cite Review PDF Full Text: DOI
Brafman, Ronen I.; De Giacomo, Giuseppe Regular decision processes. (English) Zbl 07875643 Artif. Intell. 331, Article ID 104113, 17 p. (2024). MSC: 68Qxx × Cite Format Result Cite Review PDF Full Text: DOI
Piribauer, Jakob; Baier, Christel Positivity-hardness results on Markov decision processes. (English) Zbl 07875513 TheoretiCS 3, Paper No. 9, 47 p. (2024). MSC: 68-XX × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Zhang, Haotian; Sun, Jianyong; Bäck, Thomas; Xu, Zongben Learning to select the recombination operator for derivative-free optimization. (English) Zbl 07873883 Sci. China, Math. 67, No. 6, 1457-1480 (2024). MSC: 68T05 68W01 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Xia, Tian; Liu, Jia; Chen, Zhiping A dynamical neural network approach for distributionally robust chance-constrained Markov decision process. (English) Zbl 07873880 Sci. China, Math. 67, No. 6, 1395-1418 (2024). MSC: 90C40 68T07 90C15 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Jourak, M.; Nezhadhosein, S.; Rahpeymaii, F. A new self-scaling memoryless quasi-Newton update for unconstrained optimization. (English) Zbl 07873851 4OR 22, No. 2, 235-252 (2024). MSC: 90C34 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Pitera, Marcin; Stettner, Łukasz Existence of bounded solutions to multiplicative Poisson equations under mixing property. (English) Zbl 1548.90482 ESAIM, Control Optim. Calc. Var. 30, Paper No. 49, 30 p. (2024). MSC: 90C40 93E20 60J35 93C55 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Moug, Kati; Shen, Siqian The costs of overcrowding (and release): strategic discharges for isolated facilities during epidemiological outbreaks. (English) Zbl 07870676 Comput. Oper. Res. 165, Article ID 106578, 13 p. (2024). MSC: 90Bxx × Cite Format Result Cite Review PDF Full Text: DOI
Tuncay, Gamze; Kaya, Kıymet; Yılmaz, Yaren; Yaslan, Yusuf; Gündüz Öğüdücü, Şule A reinforcement learning based dynamic room pricing model for hotel industry. (English) Zbl 1544.91145 INFOR: Inf. Syst. Oper. Res. 62, No. 2, 211-231 (2024). MSC: 91B24 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Kivanç, İpek; Fecarotti, Claudia; Raassens, Néomie; van Houtum, Geert-Jan A scalable multi-objective maintenance optimization model for systems with multiple heterogeneous components and a finite lifespan. (English) Zbl 07864424 Eur. J. Oper. Res. 315, No. 2, 567-579 (2024). MSC: 90Bxx × Cite Format Result Cite Review PDF Full Text: DOI
Satic, U.; Jacko, P.; Kirkbride, C. A simulation-based approximate dynamic programming approach to dynamic and stochastic resource-constrained multi-project scheduling problem. (English) Zbl 07864416 Eur. J. Oper. Res. 315, No. 2, 454-469 (2024). MSC: 90Bxx × Cite Format Result Cite Review PDF Full Text: DOI
López-Barrientos, José Daniel; Mendoza-Madrid, José Manuel; Gonzáles-Vega, Paola Friné An abelian theorem for a Markov decision process in a system of interacting objects with unknown random disturbance law. (English) Zbl 1541.93385 Pure Appl. Funct. Anal. 9, No. 3, 763-782 (2024). MSC: 93E20 93C55 90C40 46B09 × Cite Format Result Cite Review PDF Full Text: Link
Guo, Xin Risk-sensitive discounted Markov decision processes with unbounded reward functions and Borel spaces. (English) Zbl 1548.90480 Stochastics 96, No. 1, 649-666 (2024). MSC: 90C40 90B05 60J05 91B06 × Cite Format Result Cite Review PDF Full Text: DOI
Park, Mingyu; Shin, Jaeuk; Yang, Insoon Anderson acceleration for partially observable Markov decision processes: a maximum entropy approach. (English) Zbl 1547.90224 Automatica 163, Article ID 111557, 11 p. (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Jin, Jiangliang; Xu, Yunjian Optimal differentiated threshold characterization for multi-task stochastic deadline scheduling with queuing. (English) Zbl 1543.90123 Automatica 163, Article ID 111545, 12 p. (2024). MSC: 90B36 90B22 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Mou, Wenlong; Pananjady, Ashwin; Wainwright, Martin J.; Bartlett, Peter L. Optimal and instance-dependent guarantees for Markovian linear stochastic approximation. (English) Zbl 07854615 Math. Stat. Learn. 7, No. 1-2, 41-153 (2024). MSC: 62L20 60J22 62C20 62M05 93E35 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Andronov, Alexander; Mahareva, Kristina Continuous-time Markov-modulated chains in operations research. (English) Zbl 1537.90001 Singapore: World Scientific (ISBN 978-981-12-8615-5/hbk; 978-981-12-8617-9/ebook). xvi, 210 p. (2024). MSC: 90-01 90C40 90Bxx 60Gxx × Cite Format Result Cite Review PDF Full Text: DOI
Zheng, Yi; Julaiti, Juxihong; Pang, Guodong Adaptive service rate control of an \(M/M/1\) queue with server breakdowns. (English) Zbl 1537.60124 Queueing Syst. 106, No. 1-2, 159-191 (2024). MSC: 60K25 90C40 93E20 93E35 × Cite Format Result Cite Review PDF Full Text: DOI
Bäuerle, Nicole; Jaśkiewicz, Anna Markov decision processes with risk-sensitive criteria: an overview. (English) Zbl 1546.90257 Math. Methods Oper. Res. 99, No. 1-2, 141-178 (2024). MSC: 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Gupta, Piyush; Srivastava, Vaibhav Structural properties of optimal fidelity selection policies for human-in-the-loop queues. (English) Zbl 1540.90075 Automatica 159, Article ID 111388, 9 p. (2024). MSC: 90B22 90C40 60K25 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Anderson, Robert M.; Duanmu, Haosui; Ghosh, Aniruddha; Khan, M. Ali On existence of Berk-Nash equilibria in misspecified Markov decision processes with infinite spaces. (English) Zbl 1539.91089 J. Econ. Theory 217, Article ID 105813, 30 p. (2024). MSC: 91B99 90C40 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Piunovskiy, Alexey; Zhang, Yi On the continuity of the projection mapping from strategic measures to occupation measures in absorbing Markov decision processes. (English) Zbl 1539.90134 Appl. Math. Optim. 89, No. 3, Paper No. 58, 25 p. (2024). MSC: 90C40 93E20 × Cite Format Result Cite Review PDF Full Text: DOI arXiv
Ramani, Sivaramakrishnan; Ghate, Archis A family of \(s\)-rectangular robust MDPs: relative conservativeness, asymptotic analyses, and finite-sample properties. (English) Zbl 1545.90197 SIAM J. Optim. 34, No. 2, 1540-1568 (2024). MSC: 90C39 90C40 90C17 × Cite Format Result Cite Review PDF Full Text: DOI
Forootani, Ali; Iervolino, Raffaele; Tipaldi, Massimo; Baccari, Silvio A kernel-based approximate dynamic programming approach: theory and application. (English) Zbl 1545.90196 Automatica 162, Article ID 111517, 9 p. (2024). MSC: 90C39 90C40 × Cite Format Result Cite Review PDF Full Text: DOI
Heinold, Arne A tutorial on value function approximation for stochastic and dynamic transportation. (English) Zbl 1544.90202 4OR 22, No. 1, 145-173 (2024). MSC: 90C39 90C40 90C59 × Cite Format Result Cite Review PDF Full Text: DOI