Dressler, Mareike; Garrote-López, Marina; Montúfar, Guido; Müller, Johannes; Rose, Kemal Algebraic optimization of sequential decision problems. (English) Zbl 07740057 J. Symb. Comput. 121, Article ID 102241, 19 p. (2024). MSC: 62R01 90C23 90C40 PDF BibTeX XML Cite \textit{M. Dressler} et al., J. Symb. Comput. 121, Article ID 102241, 19 p. (2024; Zbl 07740057) Full Text: DOI arXiv
Thangeda, Pranay; Ornik, Melkior; Topcu, Ufuk Expedited online learning with spatial side information. (English) Zbl 07743764 IEEE Trans. Autom. Control 68, No. 3, 1479-1491 (2023). MSC: 93-XX PDF BibTeX XML Cite \textit{P. Thangeda} et al., IEEE Trans. Autom. Control 68, No. 3, 1479--1491 (2023; Zbl 07743764) Full Text: DOI
Hmedi, Hassan; Carroll, Johnson; Arapostathis, Ari Optimal sensor scheduling under intermittent observations subject to network dynamics. (English) Zbl 07743759 IEEE Trans. Autom. Control 68, No. 3, 1399-1414 (2023). MSC: 93-XX PDF BibTeX XML Cite \textit{H. Hmedi} et al., IEEE Trans. Autom. Control 68, No. 3, 1399--1414 (2023; Zbl 07743759) Full Text: DOI
Pan, Lipeng; Deng, Yong; Cheong, Kang Hao Dynamical Markov decision-making model based on mass function to quantitatively predict interference effects. (English) Zbl 07741607 Inf. Sci. 648, Article ID 119482, 17 p. (2023). MSC: 91B06 90C40 91-05 PDF BibTeX XML Cite \textit{L. Pan} et al., Inf. Sci. 648, Article ID 119482, 17 p. (2023; Zbl 07741607) Full Text: DOI
Bayraktar, Erhan; Chen, Tao Nonparametric adaptive robust control under model uncertainty. (English) Zbl 07738691 SIAM J. Control Optim. 61, No. 5, 2737-2760 (2023). MSC: 49J55 60J99 60J10 49L20 93E20 93E35 60G15 65K05 90C39 90C40 91G10 91G60 62G05 PDF BibTeX XML Cite \textit{E. Bayraktar} and \textit{T. Chen}, SIAM J. Control Optim. 61, No. 5, 2737--2760 (2023; Zbl 07738691) Full Text: DOI arXiv
Portillo-Ramírez, Gustavo; Cavazos-Cadena, Rolando; Cruz-Suárez, Hugo Contractive approximations in average Markov decision chains driven by a risk-seeking controller. (English) Zbl 07735004 Math. Methods Oper. Res. 98, No. 1, 75-91 (2023). MSC: 93E20 90C40 PDF BibTeX XML Cite \textit{G. Portillo-Ramírez} et al., Math. Methods Oper. Res. 98, No. 1, 75--91 (2023; Zbl 07735004) Full Text: DOI
Lan, Guanghui; Li, Yan; Zhao, Tuo Block policy mirror descent. (English) Zbl 07734884 SIAM J. Optim. 33, No. 3, 2341-2378 (2023). MSC: 90C40 90C15 90C26 68Q25 PDF BibTeX XML Cite \textit{G. Lan} et al., SIAM J. Optim. 33, No. 3, 2341--2378 (2023; Zbl 07734884) Full Text: DOI arXiv
Costa, O. L. V.; Dufour, F. Adaptive discounted control for piecewise deterministic Markov processes. (English) Zbl 07732429 J. Math. Anal. Appl. 528, No. 2, Article ID 127517, 23 p. (2023). MSC: 90Cxx 60Jxx 93Exx PDF BibTeX XML Cite \textit{O. L. V. Costa} and \textit{F. Dufour}, J. Math. Anal. Appl. 528, No. 2, Article ID 127517, 23 p. (2023; Zbl 07732429) Full Text: DOI
Bayraktar, Erhan; Kara, Ali Devran Approximate Q learning for controlled diffusion processes and its near optimality. (English) Zbl 07732335 SIAM J. Math. Data Sci. 5, No. 3, 615-638 (2023). MSC: 93E35 90C40 93E20 60J60 PDF BibTeX XML Cite \textit{E. Bayraktar} and \textit{A. D. Kara}, SIAM J. Math. Data Sci. 5, No. 3, 615--638 (2023; Zbl 07732335) Full Text: DOI arXiv
Da Costa, Lancelot; Sajid, Noor; Parr, Thomas; Friston, Karl; Smith, Ryan Reward maximization through discrete active inference. (English) Zbl 07732303 Neural Comput. 35, No. 5, 807-852 (2023). MSC: 91E10 91E40 90C40 PDF BibTeX XML Cite \textit{L. Da Costa} et al., Neural Comput. 35, No. 5, 807--852 (2023; Zbl 07732303) Full Text: DOI arXiv
Hasanbeig, Hosein; Kroening, Daniel; Abate, Alessandro Certified reinforcement learning with logic guidance. (English) Zbl 07732224 Artif. Intell. 322, Article ID 103949, 22 p. (2023). MSC: 68Txx PDF BibTeX XML Cite \textit{H. Hasanbeig} et al., Artif. Intell. 322, Article ID 103949, 22 p. (2023; Zbl 07732224) Full Text: DOI arXiv
Tanhao, Huang; Siqi, Jian; Jinwen, Chen; Yanan, Dai Modeling and control of data transmission. (English) Zbl 07727790 Methodol. Comput. Appl. Probab. 25, No. 3, Paper No. 74, 18 p. (2023). Reviewer: Pavel Stoynov (Sofia) MSC: 60F99 90C40 93E99 37A50 PDF BibTeX XML Cite \textit{H. Tanhao} et al., Methodol. Comput. Appl. Probab. 25, No. 3, Paper No. 74, 18 p. (2023; Zbl 07727790) Full Text: DOI
Rogers, L. C. G. The Bruss-Robertson-Steele inequality. (English) Zbl 07727568 J. Appl. Probab. 60, No. 3, 1112-1114 (2023). Reviewer: Yilun Shang (Newcastle upon Tyne) MSC: 60C05 60G40 90C05 90C40 90C27 PDF BibTeX XML Cite \textit{L. C. G. Rogers}, J. Appl. Probab. 60, No. 3, 1112--1114 (2023; Zbl 07727568) Full Text: DOI
Ghate, Archis Dual ascent and primal-dual algorithms for infinite-horizon nonstationary Markov decision processes. (English) Zbl 07725746 SIAM J. Optim. 33, No. 3, 1391-1415 (2023). MSC: 90C39 90C40 PDF BibTeX XML Cite \textit{A. Ghate}, SIAM J. Optim. 33, No. 3, 1391--1415 (2023; Zbl 07725746) Full Text: DOI
Li, Gen; Wei, Yuting; Chi, Yuejie; Chen, Yuxin Softmax policy gradient methods can take exponential time to converge. (English) Zbl 07720818 Math. Program. 201, No. 1-2 (A), 707-802 (2023). MSC: 60J10 90C30 90C40 68T05 PDF BibTeX XML Cite \textit{G. Li} et al., Math. Program. 201, No. 1--2 (A), 707--802 (2023; Zbl 07720818) Full Text: DOI arXiv
Hibbard, Michael; Tanaka, Takashi; Topcu, Ufuk Simultaneous perception-action design via invariant finite belief sets. (English) Zbl 07720563 Automatica 155, Article ID 111140, 15 p. (2023). MSC: 93C85 93E03 90C40 PDF BibTeX XML Cite \textit{M. Hibbard} et al., Automatica 155, Article ID 111140, 15 p. (2023; Zbl 07720563) Full Text: DOI arXiv
Gast, Nicolas; Gaujal, Bruno; Yan, Chen Exponential asymptotic optimality of Whittle index policy. (English) Zbl 1517.90155 Queueing Syst. 104, No. 1-2, 107-150 (2023). MSC: 90C40 90B18 90C05 91A60 PDF BibTeX XML Cite \textit{N. Gast} et al., Queueing Syst. 104, No. 1--2, 107--150 (2023; Zbl 1517.90155) Full Text: DOI
Wedel, Michel; Pieters, Rik; van der Lans, Ralf Modeling eye movements during decision making: a review. (English) Zbl 1516.62088 Psychometrika 88, No. 2, 697-729 (2023). MSC: 62P15 62M05 62P20 91E30 PDF BibTeX XML Cite \textit{M. Wedel} et al., Psychometrika 88, No. 2, 697--729 (2023; Zbl 1516.62088) Full Text: DOI
Palmborg, Lina; Lindskog, Filip Premium control with reinforcement learning. (English) Zbl 07712476 ASTIN Bull. 53, No. 2, 233-257 (2023). MSC: 91G05 90C40 68T05 PDF BibTeX XML Cite \textit{L. Palmborg} and \textit{F. Lindskog}, ASTIN Bull. 53, No. 2, 233--257 (2023; Zbl 07712476) Full Text: DOI
Lesage-Landry, Antoine; Callaway, Duncan S. Approximated multi-agent fitted Q iteration. (English) Zbl 07712469 Syst. Control Lett. 177, Article ID 105563, 10 p. (2023). MSC: 93A16 93E03 90C39 90C40 PDF BibTeX XML Cite \textit{A. Lesage-Landry} and \textit{D. S. Callaway}, Syst. Control Lett. 177, Article ID 105563, 10 p. (2023; Zbl 07712469) Full Text: DOI arXiv
Kumar, Uday M.; Bhat, Sanjay P.; Kavitha, Veeraruna; Hemachandra, Nandyala Approximate solutions to constrained risk-sensitive Markov decision processes. (English) Zbl 07709815 Eur. J. Oper. Res. 310, No. 1, 249-267 (2023). MSC: 90Bxx PDF BibTeX XML Cite \textit{U. M. Kumar} et al., Eur. J. Oper. Res. 310, No. 1, 249--267 (2023; Zbl 07709815) Full Text: DOI arXiv
Krishnamurthy, Vikram Interval dominance based structural results for Markov decision process. (English) Zbl 07707478 Automatica 153, Article ID 111024, 8 p. (2023). MSC: 90C40 PDF BibTeX XML Cite \textit{V. Krishnamurthy}, Automatica 153, Article ID 111024, 8 p. (2023; Zbl 07707478) Full Text: DOI arXiv
Soroush, Hamed On nonsmooth multiobjective semi-infinite programming problems with mixed constraints. (English) Zbl 07707335 J. Math. Ext. 17, No. 1, Paper No. 1, 16 p. (2023). MSC: 90C34 90C40 49J52 PDF BibTeX XML Cite \textit{H. Soroush}, J. Math. Ext. 17, No. 1, Paper No. 1, 16 p. (2023; Zbl 07707335) Full Text: DOI
Zhang, Jian; Luo, Kelin; Florio, Alexandre M.; Van Woensel, Tom Solving large-scale dynamic vehicle routing problems with stochastic requests. (English) Zbl 07705414 Eur. J. Oper. Res. 306, No. 2, 596-614 (2023). MSC: 90Bxx PDF BibTeX XML Cite \textit{J. Zhang} et al., Eur. J. Oper. Res. 306, No. 2, 596--614 (2023; Zbl 07705414) Full Text: DOI arXiv
Stanković, Miloš S.; Beko, Marko; Stanković, Srdjan S. Distributed consensus-based multi-agent temporal-difference learning. (English) Zbl 07705252 Automatica 151, Article ID 110922, 11 p. (2023). MSC: 93D50 93A16 93A14 90C40 PDF BibTeX XML Cite \textit{M. S. Stanković} et al., Automatica 151, Article ID 110922, 11 p. (2023; Zbl 07705252) Full Text: DOI
Palopoli, Luigi; Fontanelli, Daniele; Frego, Marco; Roveri, Marco A Markovian model for the spread of the SARS-CoV-2 virus. (English) Zbl 07705251 Automatica 151, Article ID 110921, 13 p. (2023). Reviewer: Jiaying Zhou (Shenzhen) MSC: 92D30 90C40 PDF BibTeX XML Cite \textit{L. Palopoli} et al., Automatica 151, Article ID 110921, 13 p. (2023; Zbl 07705251) Full Text: DOI arXiv
Malikopoulos, Andreas A. Separation of learning and control for cyber-physical systems. (English) Zbl 07705244 Automatica 151, Article ID 110912, 13 p. (2023). MSC: 93B70 93C83 93E20 93E35 90C40 PDF BibTeX XML Cite \textit{A. A. Malikopoulos}, Automatica 151, Article ID 110912, 13 p. (2023; Zbl 07705244) Full Text: DOI arXiv
Li, Sarah H. Q.; Yu, Yue; Miguel, Nicolas I.; Calderone, Dan; Ratliff, Lillian J.; Açıkmeşe, Behçet Adaptive constraint satisfaction for Markov decision process congestion games: application to transportation networks. (English) Zbl 07705221 Automatica 151, Article ID 110879, 8 p. (2023). MSC: 91A14 91A15 90C40 90B06 PDF BibTeX XML Cite \textit{S. H. Q. Li} et al., Automatica 151, Article ID 110879, 8 p. (2023; Zbl 07705221) Full Text: DOI arXiv
Xia, Li; Guo, Xianping; Cao, Xi-Ren A note on the existence of optimal stationary policies for average Markov decision processes with countable states. (English) Zbl 07705219 Automatica 151, Article ID 110877, 8 p. (2023). MSC: 90C40 PDF BibTeX XML Cite \textit{L. Xia} et al., Automatica 151, Article ID 110877, 8 p. (2023; Zbl 07705219) Full Text: DOI arXiv
Dai, Yanan; Chen, Jinwen Duality between large deviation control and risk-sensitive control for Markov decision processes. (English) Zbl 07702846 Syst. Control Lett. 174, Article ID 105490, 13 p. (2023). MSC: 90C40 PDF BibTeX XML Cite \textit{Y. Dai} and \textit{J. Chen}, Syst. Control Lett. 174, Article ID 105490, 13 p. (2023; Zbl 07702846) Full Text: DOI
Li, Yongfeng; Zhao, Mingming; Chen, Weijie; Wen, Zaiwen A stochastic composite augmented Lagrangian method for reinforcement learning. (English) Zbl 07702805 SIAM J. Optim. 33, No. 2, 921-949 (2023). MSC: 90C05 90C15 90C26 90C40 93E20 PDF BibTeX XML Cite \textit{Y. Li} et al., SIAM J. Optim. 33, No. 2, 921--949 (2023; Zbl 07702805) Full Text: DOI arXiv
Bossens, David M.; Bishop, Nicholas Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial time. (English) Zbl 07702687 Mach. Learn. 112, No. 3, 817-858 (2023). MSC: 68T05 PDF BibTeX XML Cite \textit{D. M. Bossens} and \textit{N. Bishop}, Mach. Learn. 112, No. 3, 817--858 (2023; Zbl 07702687) Full Text: DOI arXiv
Varagapriya, V.; Singh, Vikas Vikram; Lisser, Abdel Joint chance-constrained Markov decision processes. (English) Zbl 1514.90243 Ann. Oper. Res. 322, No. 2, 1013-1035 (2023). MSC: 90C40 90C15 90C39 90C05 90C47 PDF BibTeX XML Cite \textit{V. Varagapriya} et al., Ann. Oper. Res. 322, No. 2, 1013--1035 (2023; Zbl 1514.90243) Full Text: DOI
Ren, Xingyu; Fu, Michael C.; Marcus, Steven I. Stochastic control for organ donations: a review. (English) Zbl 07701272 Syst. Control Lett. 173, Article ID 105476, 9 p. (2023). MSC: 93E20 90C40 92C50 PDF BibTeX XML Cite \textit{X. Ren} et al., Syst. Control Lett. 173, Article ID 105476, 9 p. (2023; Zbl 07701272) Full Text: DOI
Feinberg, Eugene A.; Kasyanov, Pavlo O. Equivalent conditions for weak continuity of nonlinear filters. (English) Zbl 1516.93265 Syst. Control Lett. 173, Article ID 105458, 7 p. (2023). MSC: 93E11 93C10 90C40 PDF BibTeX XML Cite \textit{E. A. Feinberg} and \textit{P. O. Kasyanov}, Syst. Control Lett. 173, Article ID 105458, 7 p. (2023; Zbl 1516.93265) Full Text: DOI arXiv
Wu, Chengwei; Pan, Wei; Staa, Rick; Liu, Jianxing; Sun, Guanghui; Wu, Ligang Deep reinforcement learning control approach to mitigating actuator attacks. (English) Zbl 07701246 Automatica 152, Article ID 110999, 12 p. (2023). MSC: 93B70 93C83 93D05 90C40 PDF BibTeX XML Cite \textit{C. Wu} et al., Automatica 152, Article ID 110999, 12 p. (2023; Zbl 07701246) Full Text: DOI
Anahtarci, Berkay; Kariksiz, Can Deha; Saldi, Naci Q-learning in regularized mean-field games. (English) Zbl 07699152 Dyn. Games Appl. 13, No. 1, 89-117 (2023). MSC: 91A16 91A26 90C40 PDF BibTeX XML Cite \textit{B. Anahtarci} et al., Dyn. Games Appl. 13, No. 1, 89--117 (2023; Zbl 07699152) Full Text: DOI arXiv
Subramanian, Jayakumar; Sinha, Amit; Mahajan, Aditya Robustness and sample complexity of model-based MARL for general-sum Markov games. (English) Zbl 07699151 Dyn. Games Appl. 13, No. 1, 56-88 (2023). MSC: 91A15 91A68 90C40 PDF BibTeX XML Cite \textit{J. Subramanian} et al., Dyn. Games Appl. 13, No. 1, 56--88 (2023; Zbl 07699151) Full Text: DOI arXiv
Saavedra Sueldo, Carolina; Perez Colo, Ivo; De Paula, Mariano; Villar, Sebastián A.; Acosta, Gerardo G. ROS-based architecture for fast digital twin development of smart manufacturing robotized systems. (English) Zbl 07698113 Ann. Oper. Res. 322, No. 1, 75-99 (2023). MSC: 68M99 68T05 90C40 60J05 60K20 PDF BibTeX XML Cite \textit{C. Saavedra Sueldo} et al., Ann. Oper. Res. 322, No. 1, 75--99 (2023; Zbl 07698113) Full Text: DOI
Malladi, Satya S.; Erera, Alan L.; White, Chelsea C. III Inventory control with modulated demand and a partially observed modulation process. (English) Zbl 1517.90005 Ann. Oper. Res. 321, No. 1-2, 343-369 (2023). MSC: 90B05 90C40 PDF BibTeX XML Cite \textit{S. S. Malladi} et al., Ann. Oper. Res. 321, No. 1--2, 343--369 (2023; Zbl 1517.90005) Full Text: DOI arXiv
Velasquez, Alvaro; Alkhouri, Ismail; Subramani, K.; Wojciechowski, Piotr; Atia, George Optimal deterministic controller synthesis from steady-state distributions. (English) Zbl 07695710 J. Autom. Reasoning 67, No. 1, Paper No. 7, 26 p. (2023). MSC: 68V15 PDF BibTeX XML Cite \textit{A. Velasquez} et al., J. Autom. Reasoning 67, No. 1, Paper No. 7, 26 p. (2023; Zbl 07695710) Full Text: DOI
Piunovskiy, A. B. Turnpikes in finite Markov decision processes and random walk. (English) Zbl 07683798 Theory Probab. Appl. 68, No. 1, 123-149 (2023) and Teor. Veroyatn. Primen. 68, No. 1, 147-176 (2023). MSC: 90C40 90C27 90C39 90C59 PDF BibTeX XML Cite \textit{A. B. Piunovskiy}, Theory Probab. Appl. 68, No. 1, 123--149 (2023; Zbl 07683798) Full Text: DOI
Zhu, Yuhua; Ying, Lexing Variational actor-critic algorithms. (English) Zbl 07683230 ESAIM, Control Optim. Calc. Var. 29, Paper No. 20, 26 p. (2023). MSC: 90C40 93E20 PDF BibTeX XML Cite \textit{Y. Zhu} and \textit{L. Ying}, ESAIM, Control Optim. Calc. Var. 29, Paper No. 20, 26 p. (2023; Zbl 07683230) Full Text: DOI arXiv
Disser, Yann; Friedmann, Oliver; Hopp, Alexander V. An exponential lower bound for Zadeh’s pivot rule. (English) Zbl 07681268 Math. Program. 199, No. 1-2 (A), 865-936 (2023). MSC: 68Q25 90C05 90C40 PDF BibTeX XML Cite \textit{Y. Disser} et al., Math. Program. 199, No. 1--2 (A), 865--936 (2023; Zbl 07681268) Full Text: DOI arXiv
Bandyopadhyay, Susmita Decision support system. Tools and techniques. (English) Zbl 07681229 Boca Raton, FL: CRC Press (ISBN 978-1-032-30992-7/hbk; 978-1-032-31022-0/pbk; 978-1-003-30765-5/ebook). (2023). MSC: 91-02 91B06 90B50 03B52 60J20 PDF BibTeX XML Full Text: DOI
Bäuerle, Nicole Mean field Markov decision processes. (English) Zbl 1517.90153 Appl. Math. Optim. 88, No. 1, Paper No. 12, 36 p. (2023). Reviewer: Wiesław Kotarski (Sosnowiec) MSC: 90C40 49L20 PDF BibTeX XML Cite \textit{N. Bäuerle}, Appl. Math. Optim. 88, No. 1, Paper No. 12, 36 p. (2023; Zbl 1517.90153) Full Text: DOI arXiv
Jonsson, Adam An axiomatic approach to Markov decision processes. (English) Zbl 07678824 Math. Methods Oper. Res. 97, No. 1, 117-133 (2023). Reviewer: Romeo Negrea (Timişoara) MSC: 60J20 62C99 90C39 PDF BibTeX XML Cite \textit{A. Jonsson}, Math. Methods Oper. Res. 97, No. 1, 117--133 (2023; Zbl 07678824) Full Text: DOI arXiv
Kosmala, Tomasz; Martyr, Randall; Moriarty, John Markov risk mappings and risk-sensitive optimal prediction. (English) Zbl 07678823 Math. Methods Oper. Res. 97, No. 1, 91-116 (2023). MSC: 60G40 91B08 91B06 90C40 PDF BibTeX XML Cite \textit{T. Kosmala} et al., Math. Methods Oper. Res. 97, No. 1, 91--116 (2023; Zbl 07678823) Full Text: DOI arXiv
Darendeliler, Alp; Claeys, Dieter; Aghezzaf, El-Houssaine Integrated condition-based maintenance and multi-item lot-sizing with stochastic demand. (English) Zbl 07677917 J. Ind. Manag. Optim. 19, No. 9, 6908-6947 (2023). MSC: 91B70 60J20 PDF BibTeX XML Cite \textit{A. Darendeliler} et al., J. Ind. Manag. Optim. 19, No. 9, 6908--6947 (2023; Zbl 07677917) Full Text: DOI
Cai, Xiaoqiang; Wu, Xianyi; Zhou, Xian Optimal schedule of elective surgery operations subject to disruptions by emergencies. (English) Zbl 07677916 J. Ind. Manag. Optim. 19, No. 9, 6886-6907 (2023). MSC: 90B36 90C40 PDF BibTeX XML Cite \textit{X. Cai} et al., J. Ind. Manag. Optim. 19, No. 9, 6886--6907 (2023; Zbl 07677916) Full Text: DOI
Cruz-Suárez, Hugo; Montes-de-Oca, Raúl; Ortega-Gutiérrez, R. Israel An extended version of average Markov decision processes on discrete spaces under fuzzy environment. (English) Zbl 07675647 Kybernetika 59, No. 1, 160-178 (2023). MSC: 90C40 93C40 PDF BibTeX XML Cite \textit{H. Cruz-Suárez} et al., Kybernetika 59, No. 1, 160--178 (2023; Zbl 07675647) Full Text: DOI
Wei, Qingda; Chen, Xian Continuous-time Markov decision processes under the risk-sensitive first passage discounted cost criterion. (English) Zbl 1517.90156 J. Optim. Theory Appl. 197, No. 1, 309-333 (2023). MSC: 90C40 60J27 PDF BibTeX XML Cite \textit{Q. Wei} and \textit{X. Chen}, J. Optim. Theory Appl. 197, No. 1, 309--333 (2023; Zbl 1517.90156) Full Text: DOI
Li, Quan-Lin; Li, Yi-Meng; Ma, Jing-Yu; Liu, Heng-Li A complete algebraic solution to the optimal dynamic rationing policy in the stock-rationing queue with two demand classes. (English) Zbl 1517.90025 J. Comb. Optim. 45, No. 3, Paper No. 83, 54 p. (2023). MSC: 90B22 90B05 90C40 PDF BibTeX XML Cite \textit{Q.-L. Li} et al., J. Comb. Optim. 45, No. 3, Paper No. 83, 54 p. (2023; Zbl 1517.90025) Full Text: DOI arXiv
Fujita, Toshiharu Converging Markov decision processes with multiplicative reward system. (English) Zbl 1516.90112 Bull. Kyushu Inst. Technol., Pure Appl. Math. 70, 33-42 (2023). MSC: 90C40 PDF BibTeX XML Cite \textit{T. Fujita}, Bull. Kyushu Inst. Technol., Pure Appl. Math. 70, 33--42 (2023; Zbl 1516.90112) Full Text: DOI
Chen, Xian; Wei, Qingda Risk-sensitive average optimality for discrete-time Markov decision processes. (English) Zbl 1517.90154 SIAM J. Control Optim. 61, No. 1, 72-104 (2023). Reviewer: Savin Treanţă (Bucureşti) MSC: 90C40 60J10 PDF BibTeX XML Cite \textit{X. Chen} and \textit{Q. Wei}, SIAM J. Control Optim. 61, No. 1, 72--104 (2023; Zbl 1517.90154) Full Text: DOI
Alaouchiche, Yasmine; Ouazene, Yassine; Yalaoui, Farouk A fast and efficient analytical method for throughput evaluation of unreliable series-parallel production lines. (English) Zbl 07669001 J. Ind. Manag. Optim. 19, No. 8, 6082-6103 (2023). MSC: 90-10 90B30 90B25 90C30 90C40 90C90 PDF BibTeX XML Cite \textit{Y. Alaouchiche} et al., J. Ind. Manag. Optim. 19, No. 8, 6082--6103 (2023; Zbl 07669001) Full Text: DOI
Mayr, Richard; Munday, Eric Strategy complexity of point payoff, mean payoff and total payoff objectives in countable MDPs. (English) Zbl 07667089 Log. Methods Comput. Sci. 19, No. 1, Paper No. 16, 43 p. (2023). MSC: 03B70 68-XX PDF BibTeX XML Cite \textit{R. Mayr} and \textit{E. Munday}, Log. Methods Comput. Sci. 19, No. 1, Paper No. 16, 43 p. (2023; Zbl 07667089) Full Text: DOI arXiv
Badings, Thom; Romao, Licio; Abate, Alessandro; Parker, David; Poonawala, Hasan A.; Stoelinga, Marielle; Jansen, Nils Robust control for dynamical systems with non-Gaussian noise via formal abstractions. (English) Zbl 1508.93279 J. Artif. Intell. Res. (JAIR) 76, 341-391 (2023). MSC: 93E03 93B35 90C40 PDF BibTeX XML Cite \textit{T. Badings} et al., J. Artif. Intell. Res. (JAIR) 76, 341--391 (2023; Zbl 1508.93279) Full Text: DOI arXiv
Hernández-Bustos, Diego; Hernández-Hernández, Daniel Portfolio management under drawdown constraint in discrete-time financial markets. (English) Zbl 1508.91503 J. Appl. Probab. 60, No. 1, 127-147 (2023). MSC: 91G10 90C40 PDF BibTeX XML Cite \textit{D. Hernández-Bustos} and \textit{D. Hernández-Hernández}, J. Appl. Probab. 60, No. 1, 127--147 (2023; Zbl 1508.91503) Full Text: DOI
Golui, Subrata; Pal, Chandan Continuous-time zero-sum games for Markov decision processes with discounted risk-sensitive cost criterion on a general state space. (English) Zbl 07661019 Stochastic Anal. Appl. 41, No. 2, 327-357 (2023). Reviewer: Catherine Rainer (Brest) MSC: 91A15 91A10 90C40 PDF BibTeX XML Cite \textit{S. Golui} and \textit{C. Pal}, Stochastic Anal. Appl. 41, No. 2, 327--357 (2023; Zbl 07661019) Full Text: DOI
Lan, Guanghui Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes. (English) Zbl 1512.90150 Math. Program. 198, No. 1 (A), 1059-1106 (2023). MSC: 90C15 90C30 90C40 PDF BibTeX XML Cite \textit{G. Lan}, Math. Program. 198, No. 1 (A), 1059--1106 (2023; Zbl 1512.90150) Full Text: DOI arXiv
Hernández-Lerma, Onésimo; Laura-Guarachi, Leonardo R.; Mendoza-Palacios, Saul A survey of average cost problems in deterministic discrete-time control systems. (English) Zbl 1514.90241 J. Math. Anal. Appl. 522, No. 1, Article ID 126906, 24 p. (2023). Reviewer: Tullio Zolezzi (Genova) MSC: 90C39 90C40 93C55 PDF BibTeX XML Cite \textit{O. Hernández-Lerma} et al., J. Math. Anal. Appl. 522, No. 1, Article ID 126906, 24 p. (2023; Zbl 1514.90241) Full Text: DOI
Bhabak, Arnab; Saha, Subhamay Zero and non-zero sum risk-sensitive semi-Markov games. (English) Zbl 1512.91011 Stochastic Anal. Appl. 41, No. 1, 134-151 (2023). MSC: 91A15 91A10 90C40 PDF BibTeX XML Cite \textit{A. Bhabak} and \textit{S. Saha}, Stochastic Anal. Appl. 41, No. 1, 134--151 (2023; Zbl 1512.91011) Full Text: DOI arXiv
Srivastava, Amber; Salapaka, Srinivasa M. Dynamic parameters in sequential decision making. (English) Zbl 1507.91061 Automatica 148, Article ID 110795, 8 p. (2023). MSC: 91B06 90C40 PDF BibTeX XML Cite \textit{A. Srivastava} and \textit{S. M. Salapaka}, Automatica 148, Article ID 110795, 8 p. (2023; Zbl 1507.91061) Full Text: DOI arXiv
Bi, Shujun; Yin, Zhen; Weng, Yihong A low-rank spectral method for learning Markov models. (English) Zbl 1511.90416 Optim. Lett. 17, No. 1, 143-162 (2023). MSC: 90C40 PDF BibTeX XML Cite \textit{S. Bi} et al., Optim. Lett. 17, No. 1, 143--162 (2023; Zbl 1511.90416) Full Text: DOI
Liu, Depeng; Wang, Bow-Yaw; Fu, Chen; Zhang, Lijun Model checking differentially private properties. (English) Zbl 1512.68157 Theor. Comput. Sci. 943, 153-170 (2023). MSC: 68Q60 03B44 60J20 68P27 90C40 PDF BibTeX XML Cite \textit{D. Liu} et al., Theor. Comput. Sci. 943, 153--170 (2023; Zbl 1512.68157) Full Text: DOI
Buchholz, Peter; Dohndorf, Iryna Optimal decisions in stochastic graphs with uncorrelated and correlated edge weights. (English) Zbl 07634113 Comput. Oper. Res. 150, Article ID 106085, 19 p. (2023). MSC: 90Bxx PDF BibTeX XML Cite \textit{P. Buchholz} and \textit{I. Dohndorf}, Comput. Oper. Res. 150, Article ID 106085, 19 p. (2023; Zbl 07634113) Full Text: DOI
Della Maestra, Laetitia; Hoffmann, Marc The LAN property for McKean-Vlasov models in a mean-field regime. (English) Zbl 1504.60192 Stochastic Processes Appl. 155, 109-146 (2023). MSC: 60K35 60H10 62M05 62F12 62C20 PDF BibTeX XML Cite \textit{L. Della Maestra} and \textit{M. Hoffmann}, Stochastic Processes Appl. 155, 109--146 (2023; Zbl 1504.60192) Full Text: DOI arXiv
Khalilzadeh, Majid; Neghabi, Hossein; Ahadi, Ramin An application of approximate dynamic programming in multi-period multi-product advertising budgeting. (English) Zbl 1513.90116 J. Ind. Manag. Optim. 19, No. 1, 695-722 (2023). MSC: 90B60 90C39 90C40 91B32 PDF BibTeX XML Cite \textit{M. Khalilzadeh} et al., J. Ind. Manag. Optim. 19, No. 1, 695--722 (2023; Zbl 1513.90116) Full Text: DOI
Chapman, Margaret P.; Bonalli, Riccardo; Smith, Kevin M.; Yang, Insoon; Pavone, Marco; Tomlin, Claire J. Risk-sensitive safety analysis using conditional value-at-risk. (English) Zbl 07742148 IEEE Trans. Autom. Control 67, No. 12, 6521-6536 (2022). MSC: 93-XX PDF BibTeX XML Cite \textit{M. P. Chapman} et al., IEEE Trans. Autom. Control 67, No. 12, 6521--6536 (2022; Zbl 07742148) Full Text: DOI arXiv
Lavaei, Abolfazl; Zamani, Majid From dissipativity theory to compositional synthesis of large-scale stochastic switched systems. (English) Zbl 07740950 IEEE Trans. Autom. Control 67, No. 9, 4422-4437 (2022). MSC: 93-XX PDF BibTeX XML Cite \textit{A. Lavaei} and \textit{M. Zamani}, IEEE Trans. Autom. Control 67, No. 9, 4422--4437 (2022; Zbl 07740950) Full Text: DOI
Duan, Chaoqun; Li, Yifan; Pu, Huayan; Luo, Jun Multi-attribute Bayesian fault prediction for hidden-state systems under condition monitoring. (English) Zbl 07731770 Appl. Math. Modelling 103, 388-408 (2022). MSC: 90Bxx 90Cxx 60-XX PDF BibTeX XML Cite \textit{C. Duan} et al., Appl. Math. Modelling 103, 388--408 (2022; Zbl 07731770) Full Text: DOI
Quatmann, Tim; Junges, Sebastian; Katoen, Joost-Pieter Markov automata with multiple objectives. (English) Zbl 07704610 Form. Methods Syst. Des. 60, No. 1, 33-86 (2022). MSC: 68-XX PDF BibTeX XML Cite \textit{T. Quatmann} et al., Form. Methods Syst. Des. 60, No. 1, 33--86 (2022; Zbl 07704610) Full Text: DOI
García, Javier; Visús, Álvaro; Fernández, Fernando A taxonomy for similarity metrics between Markov decision processes. (English) Zbl 07694463 Mach. Learn. 111, No. 11, 4217-4247 (2022). MSC: 68T05 PDF BibTeX XML Cite \textit{J. García} et al., Mach. Learn. 111, No. 11, 4217--4247 (2022; Zbl 07694463) Full Text: DOI arXiv
Singh, Vinai K. A new approach to infinite decision-making process. (English) Zbl 07673545 Proc. Jangjeon Math. Soc. 25, No. 4, 415-425 (2022). MSC: 90C40 40A15 90C34 PDF BibTeX XML Cite \textit{V. K. Singh}, Proc. Jangjeon Math. Soc. 25, No. 4, 415--425 (2022; Zbl 07673545) Full Text: DOI
Singh, Vinai K. A new approach to infinite decision-making process. (English) Zbl 1516.90113 Proc. Jangjeon Math. Soc. 25, No. 3, 257-267 (2022). MSC: 90C40 40A15 90C34 PDF BibTeX XML Cite \textit{V. K. Singh}, Proc. Jangjeon Math. Soc. 25, No. 3, 257--267 (2022; Zbl 1516.90113) Full Text: DOI
Liu, Qiuli; Ching, Wai-Ki; Zhang, Junyu; Wang, Hongchu An average-value-at-risk criterion for Markov decision processes with unbounded costs. (English) Zbl 1514.90242 Front. Math. China 17, No. 4, 673-687 (2022). MSC: 90C40 93E20 PDF BibTeX XML Cite \textit{Q. Liu} et al., Front. Math. China 17, No. 4, 673--687 (2022; Zbl 1514.90242) Full Text: DOI
Sinha, Kalyan B. Sufficient statistic and Rao-Blackwell theorem in quantum probability. (English) Zbl 07647800 Infin. Dimens. Anal. Quantum Probab. Relat. Top. 25, No. 4, Article ID 2240005, 16 p. (2022). MSC: 81S25 46L53 62M05 81P50 PDF BibTeX XML Cite \textit{K. B. Sinha}, Infin. Dimens. Anal. Quantum Probab. Relat. Top. 25, No. 4, Article ID 2240005, 16 p. (2022; Zbl 07647800) Full Text: DOI
Rosenstrom, Erik; Meshkinfam, Sareh; Ivy, Julie Simmons; Goodarzi, Shadi Hassani; Capan, Muge; Huddleston, Jeanne; Romero-Brufau, Santiago Optimizing the first response to sepsis: an electronic health record-based Markov decision process model. (English) Zbl 1506.90128 Decis. Anal. 19, No. 4, 265-296 (2022). MSC: 90B50 90C40 60G40 PDF BibTeX XML Cite \textit{E. Rosenstrom} et al., Decis. Anal. 19, No. 4, 265--296 (2022; Zbl 1506.90128) Full Text: DOI
Kallus, Nathan; Uehara, Masatoshi Efficiently breaking the curse of horizon in off-policy evaluation with double reinforcement learning. (English) Zbl 1510.90285 Oper. Res. 70, No. 6, 3282-3302 (2022). MSC: 90C40 90C90 PDF BibTeX XML Cite \textit{N. Kallus} and \textit{M. Uehara}, Oper. Res. 70, No. 6, 3282--3302 (2022; Zbl 1510.90285) Full Text: DOI arXiv
Shah, Devavrat; Xie, Qiaomin; Xu, Zhi Nonasymptotic analysis of Monte Carlo tree search. (English) Zbl 1510.90286 Oper. Res. 70, No. 6, 3234-3260 (2022). MSC: 90C40 PDF BibTeX XML Cite \textit{D. Shah} et al., Oper. Res. 70, No. 6, 3234--3260 (2022; Zbl 1510.90286) Full Text: DOI arXiv
Khetarpal, Khimya; Riemer, Matthew; Rish, Irina; Precup, Doina Towards continual reinforcement learning: a review and perspectives. (English) Zbl 07639824 J. Artif. Intell. Res. (JAIR) 75, 1401-1476 (2022). MSC: 68Txx PDF BibTeX XML Cite \textit{K. Khetarpal} et al., J. Artif. Intell. Res. (JAIR) 75, 1401--1476 (2022; Zbl 07639824) Full Text: DOI arXiv
Hansen, Eric A.; Shi, Jinchuan; Kastrantas, James Strategy graphs for influence diagrams. (English) Zbl 07639819 J. Artif. Intell. Res. (JAIR) 75, 1177-1221 (2022). MSC: 68Txx PDF BibTeX XML Cite \textit{E. A. Hansen} et al., J. Artif. Intell. Res. (JAIR) 75, 1177--1221 (2022; Zbl 07639819) Full Text: DOI
Mazoure, Bogdan; Doan, Thang; Li, Tianyu; Makarenkov, Vladimir; Pineau, Joelle; Precup, Doina; Rabusseau, Guillaume Low-rank representation of reinforcement learning policies. (English) Zbl 1502.68262 J. Artif. Intell. Res. (JAIR) 75, 597-636 (2022). MSC: 68T05 90C40 PDF BibTeX XML Cite \textit{B. Mazoure} et al., J. Artif. Intell. Res. (JAIR) 75, 597--636 (2022; Zbl 1502.68262) Full Text: DOI
Ma, Xiaoteng; Ma, Shuai; Xia, Li; Zhao, Qianchuan Mean-semivariance policy optimization via risk-averse reinforcement learning. (English) Zbl 1502.68261 J. Artif. Intell. Res. (JAIR) 75, 569-595 (2022). MSC: 68T05 68T20 90C40 PDF BibTeX XML Cite \textit{X. Ma} et al., J. Artif. Intell. Res. (JAIR) 75, 569--595 (2022; Zbl 1502.68261) Full Text: DOI arXiv
Perez-Salazar, Sebastian; Singh, Mohit; Toriello, Alejandro Adaptive bin packing with overflow. (English) Zbl 07639672 Math. Oper. Res. 47, No. 4, 3317-3356 (2022). MSC: 68W25 90C27 90C39 90C40 PDF BibTeX XML Cite \textit{S. Perez-Salazar} et al., Math. Oper. Res. 47, No. 4, 3317--3356 (2022; Zbl 07639672) Full Text: DOI arXiv
Wen, Jie; Shi, Yuanhao; Pang, Xiaoqiong; Jia, Jianfang Optimal soot blowing and repair plan for boiler based on HJB equation. (English) Zbl 1510.90287 Optimization 71, No. 16, 4603-4622 (2022). MSC: 90C40 90C90 PDF BibTeX XML Cite \textit{J. Wen} et al., Optimization 71, No. 16, 4603--4622 (2022; Zbl 1510.90287) Full Text: DOI
Alkaff, Abdullah State space and binary decision diagram models for discrete standby systems with multistate components. (English) Zbl 1505.62505 Appl. Math. Modelling 110, 298-319 (2022). MSC: 62M99 60K20 90B25 PDF BibTeX XML Cite \textit{A. Alkaff}, Appl. Math. Modelling 110, 298--319 (2022; Zbl 1505.62505) Full Text: DOI
Akbarzadeh, Nima; Mahajan, Aditya Conditions for indexability of restless bandits and an \(\mathcal{O}(K^3)\) algorithm to compute Whittle index. (English) Zbl 1508.90113 Adv. Appl. Probab. 54, No. 4, 1164-1192 (2022). MSC: 90C40 90C39 49M20 91B32 PDF BibTeX XML Cite \textit{N. Akbarzadeh} and \textit{A. Mahajan}, Adv. Appl. Probab. 54, No. 4, 1164--1192 (2022; Zbl 1508.90113) Full Text: DOI arXiv
Bayer, Christian; Belomestny, Denis; Hager, Paul; Pigato, Paolo; Schoenmakers, John; Spokoiny, Vladimir Reinforced optimal control. (English) Zbl 1503.93050 Commun. Math. Sci. 20, No. 7, 1951-1978 (2022). MSC: 93E20 93E24 49L20 90C40 PDF BibTeX XML Cite \textit{C. Bayer} et al., Commun. Math. Sci. 20, No. 7, 1951--1978 (2022; Zbl 1503.93050) Full Text: DOI arXiv
Ayhan, Hayriye Optimal admission control in queues with abandonments. (English) Zbl 07631940 Oper. Res. Lett. 50, No. 6, 712-718 (2022). MSC: 90-XX PDF BibTeX XML Cite \textit{H. Ayhan}, Oper. Res. Lett. 50, No. 6, 712--718 (2022; Zbl 07631940) Full Text: DOI
Knopov, P. S.; Pepelyaeva, T. V. Some multidimensional stochastic models of inventory control with a separable cost function. (English. Ukrainian original) Zbl 1503.90009 Cybern. Syst. Anal. 58, No. 4, 523-529 (2022); translation from Kibern. Sist. Anal. 58, No. 4, 38-45 (2022). MSC: 90B05 90C40 60K10 90B22 PDF BibTeX XML Cite \textit{P. S. Knopov} and \textit{T. V. Pepelyaeva}, Cybern. Syst. Anal. 58, No. 4, 523--529 (2022; Zbl 1503.90009); translation from Kibern. Sist. Anal. 58, No. 4, 38--45 (2022) Full Text: DOI
Agarwal, Chaitanya; Guha, Shibashis; Křetínský, Jan; Muruganandham, Pazhamalai PAC statistical model checking of mean payoff in discrete- and continuous-time MDP. (English) Zbl 1514.68113 Shoham, Sharon (ed.) et al., Computer aided verification. 34th international conference, CAV 2022, Haifa, Israel, August 7–10, 2022. Proceedings. Part II. Cham: Springer. Lect. Notes Comput. Sci. 13372, 3-25 (2022). MSC: 68Q60 90C40 PDF BibTeX XML Cite \textit{C. Agarwal} et al., Lect. Notes Comput. Sci. 13372, 3--25 (2022; Zbl 1514.68113) Full Text: DOI arXiv
Junges, Sebastian; Spaan, Matthijs T. J. Abstraction-refinement for hierarchical probabilistic models. (English) Zbl 1514.68029 Shoham, Sharon (ed.) et al., Computer aided verification. 34th international conference, CAV 2022, Haifa, Israel, August 7–10, 2022. Proceedings. Part I. Cham: Springer. Lect. Notes Comput. Sci. 13371, 102-123 (2022). MSC: 68N19 68Q60 90C40 PDF BibTeX XML Cite \textit{S. Junges} and \textit{M. T. J. Spaan}, Lect. Notes Comput. Sci. 13371, 102--123 (2022; Zbl 1514.68029) Full Text: DOI arXiv
Anantharam, Venkat Reversible Markov decision processes and the Gaussian free field. (English) Zbl 1502.90186 Syst. Control Lett. 169, Article ID 105382, 8 p. (2022). MSC: 90C40 PDF BibTeX XML Cite \textit{V. Anantharam}, Syst. Control Lett. 169, Article ID 105382, 8 p. (2022; Zbl 1502.90186) Full Text: DOI arXiv
Piri, Hossein; Huh, Woonghee Tim; Shechter, Steven M.; Hudson, Darren Individualized dynamic patient monitoring under alarm fatigue. (English) Zbl 1508.90114 Oper. Res. 70, No. 5, 2749-2766 (2022). MSC: 90C40 90C90 PDF BibTeX XML Cite \textit{H. Piri} et al., Oper. Res. 70, No. 5, 2749--2766 (2022; Zbl 1508.90114) Full Text: DOI
O’Connor, Kevin; McGoff, Kevin; Nobel, Andrew B. Optimal transport for stationary Markov chains via policy iteration. (English) Zbl 07625198 J. Mach. Learn. Res. 23, Paper No. 45, 52 p. (2022). MSC: 68T05 PDF BibTeX XML Cite \textit{K. O'Connor} et al., J. Mach. Learn. Res. 23, Paper No. 45, 52 p. (2022; Zbl 07625198) Full Text: arXiv Link
Subramanian, Jayakumar; Sinha, Amit; Seraj, Raihan; Mahajan, Aditya Approximate information state for approximate planning and reinforcement learning in partially observed systems. (English) Zbl 07625165 J. Mach. Learn. Res. 23, Paper No. 12, 83 p. (2022). MSC: 68T05 PDF BibTeX XML Cite \textit{J. Subramanian} et al., J. Mach. Learn. Res. 23, Paper No. 12, 83 p. (2022; Zbl 07625165) Full Text: arXiv Link
Aslan, Ayse Optimal admission and routing with congestion-sensitive customer classes. (English) Zbl 1502.90044 Probab. Eng. Inf. Sci. 36, No. 3, 774-798 (2022). MSC: 90B22 60K25 90C40 PDF BibTeX XML Cite \textit{A. Aslan}, Probab. Eng. Inf. Sci. 36, No. 3, 774--798 (2022; Zbl 1502.90044) Full Text: DOI
Song, Ruiyang; Xu, Kuang Temporal concatenation for Markov decision processes. (English) Zbl 1507.90185 Probab. Eng. Inf. Sci. 36, No. 4, 999-1026 (2022). MSC: 90C40 90C15 90C39 PDF BibTeX XML Cite \textit{R. Song} and \textit{K. Xu}, Probab. Eng. Inf. Sci. 36, No. 4, 999--1026 (2022; Zbl 1507.90185) Full Text: DOI arXiv