Hu, Yuchen; Wager, Stefan Off-policy evaluation in partially observed Markov decision processes under sequential ignorability. (English) Zbl 07783510 Ann. Stat. 51, No. 4, 1561-1585 (2023). MSC: 62M09 62D20 PDFBibTeX XMLCite \textit{Y. Hu} and \textit{S. Wager}, Ann. Stat. 51, No. 4, 1561--1585 (2023; Zbl 07783510) Full Text: DOI arXiv Link
Cohen, Victor; Parmentier, Axel Future memories are not needed for large classes of POMDPs. (English) Zbl 1525.90424 Oper. Res. Lett. 51, No. 3, 270-277 (2023). MSC: 90C40 PDFBibTeX XMLCite \textit{V. Cohen} and \textit{A. Parmentier}, Oper. Res. Lett. 51, No. 3, 270--277 (2023; Zbl 1525.90424) Full Text: DOI arXiv
Feinberg, Eugene A.; Kasyanov, Pavlo O.; Zgurovsky, Michael Z. Markov decision processes with incomplete information and semiuniform Feller transition probabilities. (English) Zbl 1498.90245 SIAM J. Control Optim. 60, No. 4, 2488-2513 (2022). MSC: 90C40 90C39 PDFBibTeX XMLCite \textit{E. A. Feinberg} et al., SIAM J. Control Optim. 60, No. 4, 2488--2513 (2022; Zbl 1498.90245) Full Text: DOI arXiv
Zhang, Hao Dynamic learning and decision making via basis weight vectors. (English) Zbl 1497.91101 Oper. Res. 70, No. 3, 1835-1853 (2022). MSC: 91B06 90C39 PDFBibTeX XMLCite \textit{H. Zhang}, Oper. Res. 70, No. 3, 1835--1853 (2022; Zbl 1497.91101) Full Text: DOI
Zhu, Xianchao; Zhang, Ruiyuan; Huang, Tianyi; Wang, Xiaoting Visual transfer for reinforcement learning via gradient penalty based Wasserstein domain confusion. (English) Zbl 07556354 J. Nonlinear Var. Anal. 6, No. 3, 227-238 (2022). MSC: 47-XX 46-XX PDFBibTeX XMLCite \textit{X. Zhu} et al., J. Nonlinear Var. Anal. 6, No. 3, 227--238 (2022; Zbl 07556354) Full Text: DOI
Neghab, Davood Pirayesh; Khayyati, Siamak; Karaesmen, Fikri An integrated data-driven method using deep learning for a newsvendor problem with unobservable features. (English) Zbl 1507.90012 Eur. J. Oper. Res. 302, No. 2, 482-496 (2022). MSC: 90B05 68T07 PDFBibTeX XMLCite \textit{D. P. Neghab} et al., Eur. J. Oper. Res. 302, No. 2, 482--496 (2022; Zbl 1507.90012) Full Text: DOI
Zhang, Kaiqing; Yang, Zhuoran; Başar, Tamer Multi-agent reinforcement learning: a selective overview of theories and algorithms. (English) Zbl 07608712 Vamvoudakis, Kyriakos G. (ed.) et al., Handbook of reinforcement learning and control. Cham: Springer. Stud. Syst. Decis. Control 325, 321-384 (2021). MSC: 68Txx PDFBibTeX XMLCite \textit{K. Zhang} et al., Stud. Syst. Decis. Control 325, 321--384 (2021; Zbl 07608712) Full Text: DOI arXiv
Keith, Andrew J.; Ahner, Darryl K. A survey of decision making and optimization under uncertainty. (English) Zbl 1480.90185 Ann. Oper. Res. 300, No. 2, 319-353 (2021). MSC: 90C17 PDFBibTeX XMLCite \textit{A. J. Keith} and \textit{D. K. Ahner}, Ann. Oper. Res. 300, No. 2, 319--353 (2021; Zbl 1480.90185) Full Text: DOI
Hansen, Eric A. An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes. (English) Zbl 1519.68233 Artif. Intell. 294, Article ID 103431, 47 p. (2021). MSC: 68T20 90C40 PDFBibTeX XMLCite \textit{E. A. Hansen}, Artif. Intell. 294, Article ID 103431, 47 p. (2021; Zbl 1519.68233) Full Text: DOI
Horiguchi, Masayuki On an approach to evaluation of health care programme by Markov decision model. (English) Zbl 1471.92156 Piunovskiy, Alexey (ed.) et al., Modern trends in controlled stochastic processes: theory and applications, V.III. Selected papers based on the presentations at the traditional Liverpool workshop on controlled stochastic processes, Liverpool, UK, July 2021. Cham: Springer. Emerg. Complex. Comput. 41, 341-354 (2021). MSC: 92C50 90C40 PDFBibTeX XMLCite \textit{M. Horiguchi}, Emerg. Complex. Comput. 41, 341--354 (2021; Zbl 1471.92156) Full Text: DOI
Xiao, Baichun; Yang, Wei A Bayesian learning model for estimating unknown demand parameter in revenue management. (English) Zbl 1487.91046 Eur. J. Oper. Res. 293, No. 1, 248-262 (2021). MSC: 91B24 90B05 90B50 90C39 PDFBibTeX XMLCite \textit{B. Xiao} and \textit{W. Yang}, Eur. J. Oper. Res. 293, No. 1, 248--262 (2021; Zbl 1487.91046) Full Text: DOI
Ahuja, Vishal; Birge, John R. An approximation approach for response-adaptive clinical trial design. (English) Zbl 07303812 INFORMS J. Comput. 32, No. 4, 877-894 (2020). MSC: 90Cxx PDFBibTeX XMLCite \textit{V. Ahuja} and \textit{J. R. Birge}, INFORMS J. Comput. 32, No. 4, 877--894 (2020; Zbl 07303812) Full Text: DOI
Mintz, Yonatan; Aswani, Anil; Kaminsky, Philip; Flowers, Elena; Fukuoka, Yoshimi Nonstationary bandits with habituation and recovery dynamics. (English) Zbl 1455.90095 Oper. Res. 68, No. 5, 1493-1516 (2020). MSC: 90B50 PDFBibTeX XMLCite \textit{Y. Mintz} et al., Oper. Res. 68, No. 5, 1493--1516 (2020; Zbl 1455.90095) Full Text: DOI arXiv
Otten, Maarten; Timmer, Judith; Witteveen, Annemieke Stratified breast cancer follow-up using a continuous state partially observable Markov decision process. (English) Zbl 1430.90584 Eur. J. Oper. Res. 281, No. 2, 464-474 (2020). MSC: 90C90 90C40 92C50 PDFBibTeX XMLCite \textit{M. Otten} et al., Eur. J. Oper. Res. 281, No. 2, 464--474 (2020; Zbl 1430.90584) Full Text: DOI Link
Abbou, Abderrahmane; Makis, Viliam Group maintenance: a restless bandits approach. (English) Zbl 1451.90050 INFORMS J. Comput. 31, No. 4, 719-731 (2019). MSC: 90B25 60J25 90B35 90C05 91B32 PDFBibTeX XMLCite \textit{A. Abbou} and \textit{V. Makis}, INFORMS J. Comput. 31, No. 4, 719--731 (2019; Zbl 1451.90050) Full Text: DOI
Schwöbel, Sarah; Kiebel, Stefan; Marković, Dimitrije Active inference, belief propagation, and the Bethe approximation. (English) Zbl 1471.91411 Neural Comput. 30, No. 9, 2530-2567 (2018). MSC: 91E10 PDFBibTeX XMLCite \textit{S. Schwöbel} et al., Neural Comput. 30, No. 9, 2530--2567 (2018; Zbl 1471.91411) Full Text: DOI
Chen, Pengzhan; He, Zhiqiang; Chen, Chuanxi; Xu, Jiahong Control strategy of speed servo systems based on deep reinforcement learning. (English) Zbl 1461.93334 Algorithms (Basel) 11, No. 5, Paper No. 65, 18 p. (2018). MSC: 93C80 68T05 93E35 PDFBibTeX XMLCite \textit{P. Chen} et al., Algorithms (Basel) 11, No. 5, Paper No. 65, 18 p. (2018; Zbl 1461.93334) Full Text: DOI
Büyüktahtakın, İ. Esra; Haight, Robert G. A review of operations research models in invasive species management: state of the art, challenges, and future directions. (English) Zbl 1408.92037 Ann. Oper. Res. 271, No. 2, 357-403 (2018). MSC: 92D40 90B50 90C90 PDFBibTeX XMLCite \textit{İ. E. Büyüktahtakın} and \textit{R. G. Haight}, Ann. Oper. Res. 271, No. 2, 357--403 (2018; Zbl 1408.92037) Full Text: DOI
Saghafian, Soroush Ambiguous partially observable Markov decision processes: structural results and applications. (English) Zbl 1417.91172 J. Econ. Theory 178, 1-35 (2018). MSC: 91B06 90C40 91A15 91B68 PDFBibTeX XMLCite \textit{S. Saghafian}, J. Econ. Theory 178, 1--35 (2018; Zbl 1417.91172) Full Text: DOI
Thorbergsson, Leifur; Hooker, Giles Experimental design for partially observed Markov decision processes. (English) Zbl 1391.90634 SIAM/ASA J. Uncertain. Quantif. 6, 549-567 (2018). MSC: 90C40 62K05 62F10 90C39 PDFBibTeX XMLCite \textit{L. Thorbergsson} and \textit{G. Hooker}, SIAM/ASA J. Uncertain. Quantif. 6, 549--567 (2018; Zbl 1391.90634) Full Text: DOI arXiv
Zhang, Ling; Zhang, Hao; Yao, Haixiang Optimal investment management for a defined contribution pension fund under imperfect information. (English) Zbl 1401.91214 Insur. Math. Econ. 79, 210-224 (2018). MSC: 91B30 60J20 91G10 PDFBibTeX XMLCite \textit{L. Zhang} et al., Insur. Math. Econ. 79, 210--224 (2018; Zbl 1401.91214) Full Text: DOI
Jiao, Peng; Xu, Kai; Yue, Shiguang; Wei, Xiangyu; Sun, Lin A decentralized partially observable Markov decision model with action duration for goal recognition in real time strategy games. (English) Zbl 1371.68279 Discrete Dyn. Nat. Soc. 2017, Article ID 4580206, 15 p. (2017). MSC: 68T42 68T05 90C40 PDFBibTeX XMLCite \textit{P. Jiao} et al., Discrete Dyn. Nat. Soc. 2017, Article ID 4580206, 15 p. (2017; Zbl 1371.68279) Full Text: DOI
Hinz, Juri; Yee, Jeremy Stochastic switching for partially observable dynamics and optimal asset allocation. (English) Zbl 1359.93535 Int. J. Control 90, No. 3, 553-565 (2017). MSC: 93E20 90C39 90C40 90C46 93E25 PDFBibTeX XMLCite \textit{J. Hinz} and \textit{J. Yee}, Int. J. Control 90, No. 3, 553--565 (2017; Zbl 1359.93535) Full Text: DOI Link
Ben-Zvi, Tal; Chernonog, Tatyana; Avinadav, Tal A two-state partially observable Markov decision process with three actions. (English) Zbl 1346.90798 Eur. J. Oper. Res. 254, No. 3, 957-967 (2016). MSC: 90C40 PDFBibTeX XMLCite \textit{T. Ben-Zvi} et al., Eur. J. Oper. Res. 254, No. 3, 957--967 (2016; Zbl 1346.90798) Full Text: DOI
Guiver, Chris; Mueller, Markus; Hodgson, Dave; Townley, Stuart Robust set-point regulation for ecological models with multiple management goals. (English) Zbl 1341.93021 J. Math. Biol. 72, No. 6, 1467-1529 (2016). MSC: 93B35 93C55 93D15 93B03 92D25 92D40 PDFBibTeX XMLCite \textit{C. Guiver} et al., J. Math. Biol. 72, No. 6, 1467--1529 (2016; Zbl 1341.93021) Full Text: DOI
Faddoul, R.; Raphael, W.; Soubra, A.-H.; Chateauneuf, A. Partially observable Markov decision processes incorporating. (English) Zbl 1339.90329 Eur. J. Oper. Res. 241, No. 2, 391-401 (2015). MSC: 90C40 90C39 90B25 PDFBibTeX XMLCite \textit{R. Faddoul} et al., Eur. J. Oper. Res. 241, No. 2, 391--401 (2015; Zbl 1339.90329) Full Text: DOI
Fan, Hongdong; Xu, Zhe; Chen, Shiwei Optimally maintaining a multi-state system with limited imperfect preventive repairs. (English) Zbl 1332.93373 Int. J. Syst. Sci., Princ. Appl. Syst. Integr. 46, No. 10, 1729-1740 (2015). MSC: 93E20 90C40 90B25 PDFBibTeX XMLCite \textit{H. Fan} et al., Int. J. Syst. Sci., Princ. Appl. Syst. Integr. 46, No. 10, 1729--1740 (2015; Zbl 1332.93373) Full Text: DOI
Chang, Yanling; Erera, Alan L.; White, Chelsea C. III Value of information for a leader-follower partially observed Markov game. (English) Zbl 1358.91017 Ann. Oper. Res. 235, 129-153 (2015). MSC: 91A15 90C40 PDFBibTeX XMLCite \textit{Y. Chang} et al., Ann. Oper. Res. 235, 129--153 (2015; Zbl 1358.91017) Full Text: DOI
Chang, Yanling; Erera, Alan L.; White, Chelsea C. III A leader-follower partially observed, multiobjective Markov game. (English) Zbl 1358.91016 Ann. Oper. Res. 235, 103-128 (2015). MSC: 91A15 90C40 68T20 PDFBibTeX XMLCite \textit{Y. Chang} et al., Ann. Oper. Res. 235, 103--128 (2015; Zbl 1358.91016) Full Text: DOI arXiv
den Boer, Arnoud V.; Zwart, Bert Dynamic pricing and learning with finite inventories. (English) Zbl 1329.91045 Oper. Res. 63, No. 4, 965-978 (2015). MSC: 91B24 90B05 62P20 PDFBibTeX XMLCite \textit{A. V. den Boer} and \textit{B. Zwart}, Oper. Res. 63, No. 4, 965--978 (2015; Zbl 1329.91045) Full Text: DOI Link
Guiver, Chris; Logemann, Hartmut; Rebarber, Richard; Bill, Adam; Tenhumberg, Brigitte; Hodgson, Dave; Townley, Stuart Integral control for population management. (English) Zbl 1307.93239 J. Math. Biol. 70, No. 5, 1015-1063 (2015). MSC: 93C55 93D15 93C40 92D25 92D40 PDFBibTeX XMLCite \textit{C. Guiver} et al., J. Math. Biol. 70, No. 5, 1015--1063 (2015; Zbl 1307.93239) Full Text: DOI Link
Rens, Gavin; Meyer, Thomas; Lakemeyer, Gerhard SLAP: specification logic of actions with probability. (English) Zbl 1327.68164 J. Appl. Log. 12, No. 2, 128-150 (2014). MSC: 68Q60 03B35 03B45 68T27 PDFBibTeX XMLCite \textit{G. Rens} et al., J. Appl. Log. 12, No. 2, 128--150 (2014; Zbl 1327.68164) Full Text: DOI
Ni, Yaodong; Liu, Zhi-Qiang Bounded-parameter partially observable Markov decision processes: framework and algorithm. (English) Zbl 1321.91027 Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 21, No. 6, 821-863 (2013). MSC: 91B06 90C40 PDFBibTeX XMLCite \textit{Y. Ni} and \textit{Z.-Q. Liu}, Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 21, No. 6, 821--863 (2013; Zbl 1321.91027) Full Text: DOI
Ortiz, Olga L.; Erera, Alan L.; White, Chelsea C. State observation accuracy and finite-memory policy performance. (English) Zbl 1286.90163 Oper. Res. Lett. 41, No. 5, 477-481 (2013). MSC: 90C40 93E35 PDFBibTeX XMLCite \textit{O. L. Ortiz} et al., Oper. Res. Lett. 41, No. 5, 477--481 (2013; Zbl 1286.90163) Full Text: DOI
Kehagias, Athanasios; Mitsche, Dieter; Prałat, Paweł Cops and invisible robbers: the cost of drunkenness. (English) Zbl 1291.91039 Theor. Comput. Sci. 481, 100-120 (2013). MSC: 91A43 05C57 05C81 05C85 PDFBibTeX XMLCite \textit{A. Kehagias} et al., Theor. Comput. Sci. 481, 100--120 (2013; Zbl 1291.91039) Full Text: DOI arXiv
Kim, Yeek-Hyun; Thomas, Lyn C. Training and repair policies for stand-by systems. (English) Zbl 1274.90119 Ann. Oper. Res. 208, 469-487 (2013). MSC: 90B25 90C40 PDFBibTeX XMLCite \textit{Y.-H. Kim} and \textit{L. C. Thomas}, Ann. Oper. Res. 208, 469--487 (2013; Zbl 1274.90119) Full Text: DOI Link
Lauri, Mikko; Ritala, Risto Planning for multiple measurement channels in a continuous-state POMDP. (English) Zbl 1271.90105 Ann. Math. Artif. Intell. 67, No. 3-4, 283-317 (2013). MSC: 90C40 68T37 90C39 PDFBibTeX XMLCite \textit{M. Lauri} and \textit{R. Ritala}, Ann. Math. Artif. Intell. 67, No. 3--4, 283--317 (2013; Zbl 1271.90105) Full Text: DOI
Ludkovski, Michael; Sezer, Semih O. Finite horizon decision timing with partially observable Poisson processes. (English) Zbl 1244.62008 Stoch. Models 28, No. 2, 207-247 (2012). MSC: 62C99 62L15 62M99 62L10 65C60 PDFBibTeX XMLCite \textit{M. Ludkovski} and \textit{S. O. Sezer}, Stoch. Models 28, No. 2, 207--247 (2012; Zbl 1244.62008) Full Text: DOI arXiv
Çanakoğlu, Ethem; Özekici, Süleyman Portfolio selection with imperfect information: a hidden Markov model. (English) Zbl 1276.91091 Appl. Stoch. Models Bus. Ind. 27, No. 2, 95-114 (2011). MSC: 91G10 60J20 90C39 PDFBibTeX XMLCite \textit{E. Çanakoğlu} and \textit{S. Özekici}, Appl. Stoch. Models Bus. Ind. 27, No. 2, 95--114 (2011; Zbl 1276.91091) Full Text: DOI
Goulionis, John; Stengos, D. Partially observable Markov decision processes and periodic policies with applications. (English) Zbl 1250.90107 Int. J. Inf. Technol. Decis. Mak. 10, No. 6, 1175-1197 (2011). MSC: 90C40 90B50 PDFBibTeX XMLCite \textit{J. Goulionis} and \textit{D. Stengos}, Int. J. Inf. Technol. Decis. Mak. 10, No. 6, 1175--1197 (2011; Zbl 1250.90107) Full Text: DOI
Bensoussan, Alain; Cakanyildirim, Metin; Sethi, Suresh P.; Shi, Ruixia Computation of approximate optimal policies in a partially observed inventory model with rain checks. (English) Zbl 1234.90003 Automatica 47, No. 8, 1589-1604 (2011). Reviewer: Efstratios Rappos (Aubonne) MSC: 90B05 90B25 PDFBibTeX XMLCite \textit{A. Bensoussan} et al., Automatica 47, No. 8, 1589--1604 (2011; Zbl 1234.90003) Full Text: DOI
Nezhad, Mohammad Saber Fallah; Niaki, Seyed Taghi Akhavan A multi-stage two-machines replacement strategy using mixture models, Bayesian inference, and stochastic dynamic programming. (English) Zbl 1217.62156 Commun. Stat., Theory Methods 40, No. 4, 702-725 (2011). MSC: 62N05 62F15 90C15 62P30 90C39 65C60 PDFBibTeX XMLCite \textit{M. S. F. Nezhad} and \textit{S. T. A. Niaki}, Commun. Stat., Theory Methods 40, No. 4, 702--725 (2011; Zbl 1217.62156) Full Text: DOI
Arifoğlu, Kenan; Özekici, Süleyman Optimal policies for inventory systems with finite capacity and partially observed Markov-modulated demand and supply processes. (English) Zbl 1181.90278 Eur. J. Oper. Res. 204, No. 3, 421-438 (2010). MSC: 90C39 90B05 PDFBibTeX XMLCite \textit{K. Arifoğlu} and \textit{S. Özekici}, Eur. J. Oper. Res. 204, No. 3, 421--438 (2010; Zbl 1181.90278) Full Text: DOI
Baier, Christel; Größer, Marcus; Ciesinski, Frank Model checking linear-time properties of probabilistic systems. (English) Zbl 1484.68095 Droste, Manfred (ed.) et al., Handbook of weighted automata. Berlin: Springer. Monogr. Theoret. Comput. Sci., EATCS Ser., 519-570 (2009). MSC: 68Q60 68Q10 68Q45 68Q87 90C40 PDFBibTeX XMLCite \textit{C. Baier} et al., in: Handbook of weighted automata. Berlin: Springer. 519--570 (2009; Zbl 1484.68095) Full Text: DOI
Biele, Guido; Erev, Ido; Ert, Eyal Learning, risk attitude and hot stoves in restless bandit problems. (English) Zbl 1176.91134 J. Math. Psychol. 53, No. 3, 155-167 (2009). MSC: 91E40 PDFBibTeX XMLCite \textit{G. Biele} et al., J. Math. Psychol. 53, No. 3, 155--167 (2009; Zbl 1176.91134) Full Text: DOI
Littman, Michael L. A tutorial on partially observable Markov decision processes. (English) Zbl 1176.90298 J. Math. Psychol. 53, No. 3, 119-125 (2009). MSC: 90B50 PDFBibTeX XMLCite \textit{M. L. Littman}, J. Math. Psychol. 53, No. 3, 119--125 (2009; Zbl 1176.90298) Full Text: DOI
Baier, Christel; Bertrand, Nathalie; Größer, Marcus Probabilistic acceptors for languages over infinite words. (English) Zbl 1206.68167 Nielsen, Mogens (ed.) et al., SOFSEM 2009: Theory and practice of computer science. 35th conference on current trends in theory and practice of computer science, Špindlerův Mlýn, Czech Republic, January 24–30, 2009. Proceedings. Berlin: Springer (ISBN 978-3-540-95890-1/pbk). Lecture Notes in Computer Science 5404, 19-33 (2009). MSC: 68Q45 PDFBibTeX XMLCite \textit{C. Baier} et al., Lect. Notes Comput. Sci. 5404, 19--33 (2009; Zbl 1206.68167) Full Text: DOI
Makis, Viliam Multivariate Bayesian process control for a finite production run. (English) Zbl 1168.90401 Eur. J. Oper. Res. 194, No. 3, 795-806 (2009). MSC: 90B25 62C12 62P30 90C40 PDFBibTeX XMLCite \textit{V. Makis}, Eur. J. Oper. Res. 194, No. 3, 795--806 (2009; Zbl 1168.90401) Full Text: DOI
Ghasemi, A.; Yacout, S.; Ouali, M. S. Optimal condition based maintenance with imperfect information and the proportional hazards model. (English) Zbl 1128.90326 Int. J. Prod. Res. 45, No. 4, 989-1012 (2007). MSC: 90B25 PDFBibTeX XMLCite \textit{A. Ghasemi} et al., Int. J. Prod. Res. 45, No. 4, 989--1012 (2007; Zbl 1128.90326) Full Text: DOI
Grosfeld-Nir, Abraham Control limits for two-state partially observable Markov decision processes. (English) Zbl 1128.90057 Eur. J. Oper. Res. 182, No. 1, 300-304 (2007). MSC: 90C40 PDFBibTeX XMLCite \textit{A. Grosfeld-Nir}, Eur. J. Oper. Res. 182, No. 1, 300--304 (2007; Zbl 1128.90057) Full Text: DOI
Fernández, Joaquín L.; Sanz, Rafael; Simmons, Reid G.; Diéguez, Amador R. Heuristic anytime approaches to stochastic decision processes. (English) Zbl 1163.90812 J. Heuristics 12, No. 3, 181-209 (2006). MSC: 90C59 90C40 PDFBibTeX XMLCite \textit{J. L. Fernández} et al., J. Heuristics 12, No. 3, 181--209 (2006; Zbl 1163.90812) Full Text: DOI
Lian, Zhaotong; Deshmukh, Abhijit Performance prediction of an unmanned airborne vehicle multi-agent system. (English) Zbl 1168.90669 Eur. J. Oper. Res. 172, No. 2, 680-695 (2006). MSC: 90C90 90C40 PDFBibTeX XMLCite \textit{Z. Lian} and \textit{A. Deshmukh}, Eur. J. Oper. Res. 172, No. 2, 680--695 (2006; Zbl 1168.90669) Full Text: DOI
Cavazos-Cadena, Rolando; Hernández-Hernández, Daniel Successive approximations in partially observable controlled Markov chains with risk-sensitive average criterion. (English) Zbl 1101.93083 Stochastics 77, No. 6, 537-568 (2005). MSC: 93E20 93B36 PDFBibTeX XMLCite \textit{R. Cavazos-Cadena} and \textit{D. Hernández-Hernández}, Stochastics 77, No. 6, 537--568 (2005; Zbl 1101.93083) Full Text: DOI
Jang, Wooseung; Shanthikumar, J. George Sequential process control under capacity constraints. (English) Zbl 1047.90074 Eur. J. Oper. Res. 155, No. 3, 695-714 (2004). MSC: 90C40 90C39 PDFBibTeX XMLCite \textit{W. Jang} and \textit{J. G. Shanthikumar}, Eur. J. Oper. Res. 155, No. 3, 695--714 (2004; Zbl 1047.90074) Full Text: DOI
Givan, Robert; Dean, Thomas; Greig, Matthew Equivalence notions and model minimization in Markov decision processes. (English) Zbl 1082.68801 Artif. Intell. 147, No. 1-2, 163-223 (2003). MSC: 68T20 68T30 68T37 PDFBibTeX XMLCite \textit{R. Givan} et al., Artif. Intell. 147, No. 1--2, 163--223 (2003; Zbl 1082.68801) Full Text: DOI
Madani, Omid; Hanks, Steve; Condon, Anne On the undecidability of probabilistic planning and related stochastic optimization problems. (English) Zbl 1082.68806 Artif. Intell. 147, No. 1-2, 5-34 (2003). MSC: 68T20 68Q25 03D35 68T37 90C15 PDFBibTeX XMLCite \textit{O. Madani} et al., Artif. Intell. 147, No. 1--2, 5--34 (2003; Zbl 1082.68806) Full Text: DOI
Rosenberg, Dinah; Solan, Eilon; Vieille, Nicolas Blackwell optimality in Markov decision processes with partial observation. (English) Zbl 1103.90402 Ann. Stat. 30, No. 4, 1178-1193 (2002). MSC: 90C40 PDFBibTeX XMLCite \textit{D. Rosenberg} et al., Ann. Stat. 30, No. 4, 1178--1193 (2002; Zbl 1103.90402) Full Text: DOI Euclid
Jang, Wooseung; Shanthikumar, J. George Stochastic allocation of inspection capacity to competitive processes. (English) Zbl 0994.90093 Nav. Res. Logist. 49, No. 1, 78-94 (2002). MSC: 90B85 90C40 PDFBibTeX XMLCite \textit{W. Jang} and \textit{J. G. Shanthikumar}, Nav. Res. Logist. 49, No. 1, 78--94 (2002; Zbl 0994.90093) Full Text: DOI
Evans, Jamie; Krishnamurthy, Vikram Optimal sensor scheduling for hidden Markov model state estimation. (English) Zbl 1024.93059 Int. J. Control 74, No. 18, 1737-1742 (2001). Reviewer: Giovanni Di Masi (Padova) MSC: 93E10 93E11 49L20 PDFBibTeX XMLCite \textit{J. Evans} and \textit{V. Krishnamurthy}, Int. J. Control 74, No. 18, 1737--1742 (2001; Zbl 1024.93059) Full Text: DOI
Sahin, Izzet; Zahedi, Fatemeh Mariam Optimal policies under risk for changing software systems based on customer satisfaction. (English) Zbl 0961.90027 Eur. J. Oper. Res. 123, No. 1, 175-194 (2000). MSC: 90B25 90C40 PDFBibTeX XMLCite \textit{I. Sahin} and \textit{F. M. Zahedi}, Eur. J. Oper. Res. 123, No. 1, 175--194 (2000; Zbl 0961.90027) Full Text: DOI
Gilbert, Stephen M.; Bar, Hena M. The value of observing the condition of a deteriorating machine. (English) Zbl 0971.90022 Nav. Res. Logist. 46, No. 7, 790-808 (1999). MSC: 90B30 90B25 PDFBibTeX XMLCite \textit{S. M. Gilbert} and \textit{H. M. Bar}, Nav. Res. Logist. 46, No. 7, 790--808 (1999; Zbl 0971.90022) Full Text: DOI
Mallor, Fermín; Azcárate, Cristina On replacement policies for additive systems with several working levels. (English) Zbl 0958.90018 Ann. Oper. Res. 91, 63-82 (1999). MSC: 90B25 PDFBibTeX XMLCite \textit{F. Mallor} and \textit{C. Azcárate}, Ann. Oper. Res. 91, 63--82 (1999; Zbl 0958.90018) Full Text: DOI
David, Israel; Friedman, Lea; Sinuany-Stern, Zilla A simple suboptimal algorithm for system maintance under partial observability. (English) Zbl 0970.90023 Ann. Oper. Res. 91, 25-40 (1999). MSC: 90B25 90C40 90C59 60K10 PDFBibTeX XMLCite \textit{I. David} et al., Ann. Oper. Res. 91, 25--40 (1999; Zbl 0970.90023) Full Text: DOI
Kaelbling, Leslie Pack; Littman, Michael L.; Cassandra, Anthony R. Planning and acting in partially observable stochastic domains. (English) Zbl 0908.68165 Artif. Intell. 101, No. 1-2, 99-134 (1998). MSC: 68T20 PDFBibTeX XMLCite \textit{L. P. Kaelbling} et al., Artif. Intell. 101, No. 1--2, 99--134 (1998; Zbl 0908.68165) Full Text: DOI
Gong, Linguo; Tang, Kwei Monitoring machine operations using on-line sensors. (English) Zbl 0917.90139 Eur. J. Oper. Res. 96, No. 3, 479-492 (1997). MSC: 90B25 PDFBibTeX XMLCite \textit{L. Gong} and \textit{K. Tang}, Eur. J. Oper. Res. 96, No. 3, 479--492 (1997; Zbl 0917.90139) Full Text: DOI
White, D. J. A superharmonic approach to solving infinite horizon partially observable Markov decision problems. (English) Zbl 0834.90136 Z. Oper. Res. 41, No. 1, 71-88 (1995). MSC: 90C40 PDFBibTeX XMLCite \textit{D. J. White}, Z. Oper. Res. 41, No. 1, 71--88 (1995; Zbl 0834.90136) Full Text: DOI
Serin, Yasemin A nonlinear programming model for partially observable Markov decision processes: Finite horizon case. (English) Zbl 0914.90262 Eur. J. Oper. Res. 86, No. 3, 549-564 (1995). MSC: 90C40 PDFBibTeX XMLCite \textit{Y. Serin}, Eur. J. Oper. Res. 86, No. 3, 549--564 (1995; Zbl 0914.90262) Full Text: DOI
Hordijk, A.; Loeve, J. A. Undiscounted Markov decision chains with partial information; An algorithm for computing a locally optimal periodic policy. (English) Zbl 0826.90120 Z. Oper. Res. 40, No. 2, 163-181 (1994). MSC: 90C40 90B22 60K20 90B18 PDFBibTeX XMLCite \textit{A. Hordijk} and \textit{J. A. Loeve}, Z. Oper. Res. 40, No. 2, 163--181 (1994; Zbl 0826.90120) Full Text: DOI
Monahan, George E. Optimal sequential file search. (English) Zbl 0809.90130 Eur. J. Oper. Res. 77, No. 2, 224-240 (1994). MSC: 90C40 PDFBibTeX XMLCite \textit{G. E. Monahan}, Eur. J. Oper. Res. 77, No. 2, 224--240 (1994; Zbl 0809.90130) Full Text: DOI Link
White, D. J. Extension of the Frank-Wolfe algorithm to concave nondifferentiable objective functions. (English) Zbl 0792.90077 J. Optimization Theory Appl. 78, No. 2, 283-301 (1993). MSC: 90C30 49J52 PDFBibTeX XMLCite \textit{D. J. White}, J. Optim. Theory Appl. 78, No. 2, 283--301 (1993; Zbl 0792.90077) Full Text: DOI
Schneeberger, Stefan Markov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information). (German) Zbl 0757.90027 OR Spektrum 14, No. 2, 71-78 (1992). MSC: 90B25 90-08 90C40 PDFBibTeX XMLCite \textit{S. Schneeberger}, OR Spektrum 14, No. 2, 71--78 (1992; Zbl 0757.90027) Full Text: DOI
White, Douglas J. Piecewise linear approximations for partially observable Markov decision processes with finite horizons. (English) Zbl 0767.90090 J. Inf. Optim. Sci. 13, No. 2, 311-324 (1992). Reviewer: E.A.Feinberg (Stony Brook) MSC: 90C40 PDFBibTeX XMLCite \textit{D. J. White}, J. Inf. Optim. Sci. 13, No. 2, 311--324 (1992; Zbl 0767.90090) Full Text: DOI
Sernik, E. L.; Marcus, S. I. Optimal cost and policy for a Markovian replacement problem. (English) Zbl 0793.90026 J. Optimization Theory Appl. 71, No. 1, 105-126 (1991). MSC: 90B25 90C39 60K20 60J10 PDFBibTeX XMLCite \textit{E. L. Sernik} and \textit{S. I. Marcus}, J. Optim. Theory Appl. 71, No. 1, 105--126 (1991; Zbl 0793.90026) Full Text: DOI
White, Chelsea C. III. A survey of solution techniques for the partially observed Markov decision process. (English) Zbl 0727.90089 Ann. Oper. Res. 32, 215-230 (1991). MSC: 90C40 90-02 90-08 PDFBibTeX XMLCite \textit{C. C. III. White}, Ann. Oper. Res. 32, 215--230 (1991; Zbl 0727.90089) Full Text: DOI
Fernández-Gaucherand, Emmanuel; Arapostathis, Aristotle; Marcus, Steven I. On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes. (English) Zbl 0717.90094 Ann. Oper. Res. 29, No. 1-4, 439-469 (1991). MSC: 90C40 93E20 60J10 90B25 PDFBibTeX XMLCite \textit{E. Fernández-Gaucherand} et al., Ann. Oper. Res. 29, No. 1--4, 439--469 (1991; Zbl 0717.90094) Full Text: DOI
Lovejoy, William S. A survey of algorithmic methods for partially observed Markov decision processes. (English) Zbl 0717.90086 Ann. Oper. Res. 28, No. 1-4, 47-65 (1991). MSC: 90C40 90-08 90-02 PDFBibTeX XMLCite \textit{W. S. Lovejoy}, Ann. Oper. Res. 28, No. 1--4, 47--65 (1991; Zbl 0717.90086) Full Text: DOI
Runggaldier, Wolfgang J. On the construction of \(\epsilon\)-optimal strategies in partially observed MDPs. (English) Zbl 0717.90085 Ann. Oper. Res. 28, No. 1-4, 81-95 (1991). MSC: 90C40 90-02 90-08 PDFBibTeX XMLCite \textit{W. J. Runggaldier}, Ann. Oper. Res. 28, No. 1--4, 81--95 (1991; Zbl 0717.90085) Full Text: DOI
Rieder, U. Structural results for partially observed control models. (English) Zbl 0755.93083 Z. Oper. Res. 35, No. 6, 473-490 (1991). Reviewer: K.M.Ramachandran (Tampa) MSC: 93E20 60J20 90C40 PDFBibTeX XMLCite \textit{U. Rieder}, Z. Oper. Res. 35, No. 6, 473--490 (1991; Zbl 0755.93083) Full Text: DOI
White, Chelsea C. III; White, Douglas J. Markov decision processes. (English) Zbl 0677.90086 Eur. J. Oper. Res. 39, No. 1, 1-16 (1989). Reviewer: G.Hübner MSC: 90C40 90C90 PDFBibTeX XMLCite \textit{C. C. White III} and \textit{D. J. White}, Eur. J. Oper. Res. 39, No. 1, 1--16 (1989; Zbl 0677.90086) Full Text: DOI
Hernández-Lerma, Onésimo; Marcus, Steven I. Nonparametric adaptive control of discrete-time partially observable stochastic systems. (English) Zbl 0675.93055 J. Math. Anal. Appl. 137, No. 2, 312-334 (1989). MSC: 93C40 93E03 93C55 PDFBibTeX XMLCite \textit{O. Hernández-Lerma} and \textit{S. I. Marcus}, J. Math. Anal. Appl. 137, No. 2, 312--334 (1989; Zbl 0675.93055) Full Text: DOI
Whiting, R. G.; Pickett, E. E. On model order estimation for partially observed Markov chains. (English) Zbl 0648.93056 Automatica 24, No. 4, 569-572 (1988). MSC: 93E10 60J10 93E12 62F12 PDFBibTeX XMLCite \textit{R. G. Whiting} and \textit{E. E. Pickett}, Automatica 24, No. 4, 569--572 (1988; Zbl 0648.93056) Full Text: DOI
Hernandez-Lerma, O.; Marcus, S. I. Adaptive control of Markov processes with incomplete state information and unknown parameters. (English) Zbl 0585.90090 J. Optimization Theory Appl. 52, 227-241 (1987). MSC: 90C40 PDFBibTeX XMLCite \textit{O. Hernandez-Lerma} and \textit{S. I. Marcus}, J. Optim. Theory Appl. 52, 227--241 (1987; Zbl 0585.90090) Full Text: DOI
Ohnishi, Masamitsu; Kawai, Hajime; Mine, Hisashi An optimal inspection and replacement policy under incomplete state information. (English) Zbl 0623.90025 Eur. J. Oper. Res. 27, 117-128 (1986). Reviewer: R.Subramanian MSC: 90B25 60K20 90C40 PDFBibTeX XMLCite \textit{M. Ohnishi} et al., Eur. J. Oper. Res. 27, 117--128 (1986; Zbl 0623.90025) Full Text: DOI
Karandikar, Rajeeva L.; Kulkarni, Vidyadhar G. Limiting distributions of functionals of Markov chains. (English) Zbl 0562.60025 Stochastic Processes Appl. 19, 225-235 (1985). Reviewer: A.Pakes MSC: 60F05 60J10 PDFBibTeX XMLCite \textit{R. L. Karandikar} and \textit{V. G. Kulkarni}, Stochastic Processes Appl. 19, 225--235 (1985; Zbl 0562.60025) Full Text: DOI
Neck, Reinhard Stochastic control theory and operational research. (English) Zbl 0564.90030 Eur. J. Oper. Res. 17, 283-301 (1984). Reviewer: F.Colonius MSC: 90B99 93E20 90-02 90C39 90C90 PDFBibTeX XMLCite \textit{R. Neck}, Eur. J. Oper. Res. 17, 283--301 (1984; Zbl 0564.90030) Full Text: DOI