Lu, Jingyi; Quevedo, Daniel E. A jointly optimal design of control and scheduling in networked systems under denial-of-service attacks. (English) Zbl 07647647 Automatica 148, Article ID 110774, 8 p. (2023). MSC: 93B70 93E11 93A15 PDF BibTeX XML Cite \textit{J. Lu} and \textit{D. E. Quevedo}, Automatica 148, Article ID 110774, 8 p. (2023; Zbl 07647647) Full Text: DOI arXiv OpenURL
Kovařík, Vojtěch; Seitz, Dominik; Lisý, Viliam; Rudolf, Jan; Sun, Shuo; Ha, Karel Value functions for depth-limited solving in zero-sum imperfect-information games. (English) Zbl 07638283 Artif. Intell. 314, Article ID 103805, 51 p. (2023). MSC: 68Txx PDF BibTeX XML Cite \textit{V. Kovařík} et al., Artif. Intell. 314, Article ID 103805, 51 p. (2023; Zbl 07638283) Full Text: DOI arXiv OpenURL
Wang, Jun; Cao, Lei; Chen, Xiliang; Lai, Jun General proof of convergence of the Nash-Q-learning algorithm. (English) Zbl 07490713 Fractals 30, No. 1, Article ID 2250027, 9 p. (2022). MSC: 68Txx 28Axx 91Axx PDF BibTeX XML Cite \textit{J. Wang} et al., Fractals 30, No. 1, Article ID 2250027, 9 p. (2022; Zbl 07490713) Full Text: DOI OpenURL
Jaimungal, Sebastian Reinforcement learning and stochastic optimisation. (English) Zbl 1482.91225 Finance Stoch. 26, No. 1, 103-129 (2022). MSC: 91G80 93E20 68T07 91A15 PDF BibTeX XML Cite \textit{S. Jaimungal}, Finance Stoch. 26, No. 1, 103--129 (2022; Zbl 1482.91225) Full Text: DOI OpenURL
Zhang, Kaiqing; Yang, Zhuoran; Başar, Tamer Multi-agent reinforcement learning: a selective overview of theories and algorithms. (English) Zbl 07608712 Vamvoudakis, Kyriakos G. (ed.) et al., Handbook of reinforcement learning and control. Cham: Springer. Stud. Syst. Decis. Control 325, 321-384 (2021). MSC: 68Txx PDF BibTeX XML Cite \textit{K. Zhang} et al., Stud. Syst. Decis. Control 325, 321--384 (2021; Zbl 07608712) Full Text: DOI arXiv OpenURL
Ren, Chunying; Wu, Zijun; Xu, Dachuan; Xu, Wenqing A game-theoretic analysis of deep neural networks. (English) Zbl 07551699 Wu, Weili (ed.) et al., Algorithmic aspects in information and management. 15th international conference, AAIM 2021, virtual event, December 20–22, 2021. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 13153, 369-379 (2021). MSC: 68T07 91A80 PDF BibTeX XML Cite \textit{C. Ren} et al., Lect. Notes Comput. Sci. 13153, 369--379 (2021; Zbl 07551699) Full Text: DOI OpenURL
Li, Sarah H. Q.; Adjé, Assalé; Garoche, Pierre-Loïc; Açıkmeşe, Behçet Bounding fixed points of set-based Bellman operator and Nash equilibria of stochastic games. (English) Zbl 1478.91011 Automatica 130, Article ID 109685, 12 p. (2021). MSC: 91A15 90C40 91A26 93A16 93E03 PDF BibTeX XML Cite \textit{S. H. Q. Li} et al., Automatica 130, Article ID 109685, 12 p. (2021; Zbl 1478.91011) Full Text: DOI arXiv OpenURL
Hackney, Michael; James, Alex; Plank, Michael J. Cooperative and non-cooperative behaviour in the exploitation of a common renewable resource with environmental stochasticity. (English) Zbl 1481.91139 Appl. Math. Modelling 89, Part 2, 1041-1054 (2021). MSC: 91B76 90C40 PDF BibTeX XML Cite \textit{M. Hackney} et al., Appl. Math. Modelling 89, Part 2, 1041--1054 (2021; Zbl 1481.91139) Full Text: DOI OpenURL
Usui, Yuki; Ueda, Masahiko Symmetric equilibrium of multi-agent reinforcement learning in repeated prisoner’s dilemma. (English) Zbl 07425002 Appl. Math. Comput. 409, Article ID 126370, 18 p. (2021). MSC: 91Axx 92Dxx 37Nxx 68Txx 68-XX PDF BibTeX XML Cite \textit{Y. Usui} and \textit{M. Ueda}, Appl. Math. Comput. 409, Article ID 126370, 18 p. (2021; Zbl 07425002) Full Text: DOI arXiv OpenURL
Yuan, Huanhuan; Xia, Yuanqing; Yuan, Yuan; Yang, Hongjiu Resilient strategy design for cyber-physical system under active eavesdropping attack. (English) Zbl 1467.93143 J. Franklin Inst. 358, No. 10, 5281-5304 (2021). MSC: 93B70 93C83 91A05 91A15 91A80 PDF BibTeX XML Cite \textit{H. Yuan} et al., J. Franklin Inst. 358, No. 10, 5281--5304 (2021; Zbl 1467.93143) Full Text: DOI OpenURL
Vainer, Jan; Kukacka, Jiri Nash Q-learning agents in Hotelling’s model: reestablishing equilibrium. (English) Zbl 1464.91062 Commun. Nonlinear Sci. Numer. Simul. 99, Article ID 105805, 19 p. (2021). MSC: 91B72 91B70 91A80 PDF BibTeX XML Cite \textit{J. Vainer} and \textit{J. Kukacka}, Commun. Nonlinear Sci. Numer. Simul. 99, Article ID 105805, 19 p. (2021; Zbl 1464.91062) Full Text: DOI OpenURL
Lazaridis, Aristotelis; Fachantidis, Anestis; Vlahavas, Ioannis Deep reinforcement learning: a state-of-the-art walkthrough. (English) Zbl 1497.68447 J. Artif. Intell. Res. (JAIR) 69, 1421-1471 (2020). MSC: 68T07 68-02 PDF BibTeX XML Cite \textit{A. Lazaridis} et al., J. Artif. Intell. Res. (JAIR) 69, 1421--1471 (2020; Zbl 1497.68447) Full Text: DOI OpenURL
Liu, Shan; Li, Shanbin; Xu, Bugong Event-triggered resilient control for cyber-physical system under denial-of-service attacks. (English) Zbl 1453.93144 Int. J. Control 93, No. 8, 1907-1919 (2020). MSC: 93C65 93B70 93C83 PDF BibTeX XML Cite \textit{S. Liu} et al., Int. J. Control 93, No. 8, 1907--1919 (2020; Zbl 1453.93144) Full Text: DOI OpenURL
Wen, Yinlei; Zhang, Huaguang; Ren, He; Zhang, Kun Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system. (English) Zbl 1447.93205 J. Franklin Inst. 357, No. 12, 8059-8081 (2020). MSC: 93C55 93C05 91A05 91A80 PDF BibTeX XML Cite \textit{Y. Wen} et al., J. Franklin Inst. 357, No. 12, 8059--8081 (2020; Zbl 1447.93205) Full Text: DOI OpenURL
Ding, Kemi; Ren, Xiaoqiang; Quevedo, Daniel E.; Dey, Subhrakanti; Shi, Ling Defensive deception against reactive jamming attacks in remote state estimation. (English) Zbl 1440.93249 Automatica 113, Article ID 108680, 11 p. (2020). MSC: 93E11 93C83 68M25 91A15 91A80 PDF BibTeX XML Cite \textit{K. Ding} et al., Automatica 113, Article ID 108680, 11 p. (2020; Zbl 1440.93249) Full Text: DOI OpenURL
Sahabandu, Dinuka; Moothedath, Shana; Allen, Joey; Bushnell, Linda; Lee, Wenke; Poovendran, Radha Stochastic dynamic information flow tracking game with reinforcement learning. (English) Zbl 1440.68040 Alpcan, Tansu (ed.) et al., Decision and game theory for security. 10th international conference, GameSec 2019, Stockholm, Sweden, October 30 – November 1, 2019. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 11836, 417-438 (2019). MSC: 68M25 68T05 91A15 91A80 PDF BibTeX XML Cite \textit{D. Sahabandu} et al., Lect. Notes Comput. Sci. 11836, 417--438 (2019; Zbl 1440.68040) Full Text: DOI OpenURL
Li, Yuzhe; Mehr, Aryan Saadat; Chen, Tongwen Multi-sensor transmission power control for remote estimation through a SINR-based communication channel. (English) Zbl 1415.93252 Automatica 101, 78-86 (2019). MSC: 93E10 93A15 90C40 91A15 90B18 PDF BibTeX XML Cite \textit{Y. Li} et al., Automatica 101, 78--86 (2019; Zbl 1415.93252) Full Text: DOI OpenURL
Picheny, Victor; Binois, Mickael; Habbal, Abderrahmane A Bayesian optimization approach to find Nash equilibria. (English) Zbl 1410.91030 J. Glob. Optim. 73, No. 1, 171-192 (2019). MSC: 91A10 91A23 91-04 PDF BibTeX XML Cite \textit{V. Picheny} et al., J. Glob. Optim. 73, No. 1, 171--192 (2019; Zbl 1410.91030) Full Text: DOI arXiv OpenURL
Zhang, Zhen; Wang, Dongqing EAQR: a multiagent Q-learning algorithm for coordination of multiple agents. (English) Zbl 1407.68421 Complexity 2018, Article ID 7172614, 14 p. (2018). MSC: 68T05 68T42 PDF BibTeX XML Cite \textit{Z. Zhang} and \textit{D. Wang}, Complexity 2018, Article ID 7172614, 14 p. (2018; Zbl 1407.68421) Full Text: DOI OpenURL
Bian, Tao; Jiang, Zhong-Ping Stochastic and adaptive optimal control of uncertain interconnected systems: a data-driven approach. (English) Zbl 1390.93720 Syst. Control Lett. 115, 48-54 (2018). MSC: 93E03 93B36 93C05 91A25 90C39 93C40 93B35 PDF BibTeX XML Cite \textit{T. Bian} and \textit{Z.-P. Jiang}, Syst. Control Lett. 115, 48--54 (2018; Zbl 1390.93720) Full Text: DOI OpenURL
Greenwald, Amy; Li, Jiacui; Sodomka, Eric Solving for best responses and equilibria in extensive-form games with reinforcement learning methods. (English) Zbl 1437.91066 Başkent, Can (ed.) et al., Rohit Parikh on logic, language and society. Cham: Springer. Outst. Contrib. Log. 11, 185-226 (2017). MSC: 91A18 91A26 68T05 PDF BibTeX XML Cite \textit{A. Greenwald} et al., Outst. Contrib. Log. 11, 185--226 (2017; Zbl 1437.91066) Full Text: DOI OpenURL
Ding, Kemi; Li, Yuzhe; Quevedo, Daniel E.; Dey, Subhrakanti; Shi, Ling A multi-channel transmission schedule for remote state estimation under DoS attacks. (English) Zbl 1357.93097 Automatica 78, 194-201 (2017). MSC: 93E11 93E10 90B18 91A15 91A05 PDF BibTeX XML Cite \textit{K. Ding} et al., Automatica 78, 194--201 (2017; Zbl 1357.93097) Full Text: DOI Link OpenURL
Dimirovski, Georgi M. Learning intelligent controls in high speed networks: synergies of computational intelligence with control and Q-learning theories. (English) Zbl 1402.68026 Sgurev, Vassil (ed.) et al., Innovative issues in intelligent systems. Cham: Springer (ISBN 978-3-319-27266-5/hbk; 978-3-319-27267-2/ebook). Studies in Computational Intelligence 623, 111-139 (2016). MSC: 68M10 68M20 68T05 PDF BibTeX XML Cite \textit{G. M. Dimirovski}, Stud. Comput. Intell. 623, 111--139 (2016; Zbl 1402.68026) Full Text: DOI OpenURL
Zhang, Qi; Jiao, Peng; Yin, Quanjun; Sun, Lin Coordinated learning by model difference identification in multiagent systems with sparse interactions. (English) Zbl 1410.68358 Discrete Dyn. Nat. Soc. 2016, Article ID 3207460, 17 p. (2016). MSC: 68T42 68T05 PDF BibTeX XML Cite \textit{Q. Zhang} et al., Discrete Dyn. Nat. Soc. 2016, Article ID 3207460, 17 p. (2016; Zbl 1410.68358) Full Text: DOI OpenURL
Lim, Shiau Hong; Xu, Huan; Mannor, Shie Reinforcement learning in robust Markov decision processes. (English) Zbl 1348.68197 Math. Oper. Res. 41, No. 4, 1325-1353 (2016). MSC: 68T05 90C40 PDF BibTeX XML Cite \textit{S. H. Lim} et al., Math. Oper. Res. 41, No. 4, 1325--1353 (2016; Zbl 1348.68197) Full Text: DOI OpenURL
Albrecht, Stefano V.; Crandall, Jacob W.; Ramamoorthy, Subramanian Belief and truth in hypothesised behaviours. (English) Zbl 1352.68259 Artif. Intell. 235, 63-94 (2016). MSC: 68T42 91A26 91A80 PDF BibTeX XML Cite \textit{S. V. Albrecht} et al., Artif. Intell. 235, 63--94 (2016; Zbl 1352.68259) Full Text: DOI arXiv OpenURL
Vamvoudakis, Kyriakos G. Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems. (English) Zbl 1336.91022 Automatica 61, 274-281 (2015). MSC: 91A23 91A06 91A10 68T05 91A26 93C40 PDF BibTeX XML Cite \textit{K. G. Vamvoudakis}, Automatica 61, 274--281 (2015; Zbl 1336.91022) Full Text: DOI OpenURL
Chen, Wenlin; Chen, Yixin; Levine, David K. A unifying learning framework for building artificial game-playing agents. (English) Zbl 1329.68209 Ann. Math. Artif. Intell. 73, No. 3-4, 335-358 (2015). MSC: 68T05 68T42 91A80 PDF BibTeX XML Cite \textit{W. Chen} et al., Ann. Math. Artif. Intell. 73, No. 3--4, 335--358 (2015; Zbl 1329.68209) Full Text: DOI OpenURL
Tharakunnel, Kurian; Bhattacharyya, Siddhartha Single-leader-multiple-follower games with boundedly rational agents. (English) Zbl 1170.91306 J. Econ. Dyn. Control 33, No. 8, 1593-1603 (2009). MSC: 91A10 PDF BibTeX XML Cite \textit{K. Tharakunnel} and \textit{S. Bhattacharyya}, J. Econ. Dyn. Control 33, No. 8, 1593--1603 (2009; Zbl 1170.91306) Full Text: DOI OpenURL
van Eck, Nees Jan; van Wezel, Michiel Application of reinforcement learning to the game of Othello. (English) Zbl 1139.90030 Comput. Oper. Res. 35, No. 6, 1999-2017 (2008). MSC: 90C39 90C40 92B20 91A80 PDF BibTeX XML Cite \textit{N. J. van Eck} and \textit{M. van Wezel}, Comput. Oper. Res. 35, No. 6, 1999--2017 (2008; Zbl 1139.90030) Full Text: DOI OpenURL
Mannor, Shie; Shamma, Jeff S. Multi-agent learning for engineers. (English) Zbl 1168.68477 Artif. Intell. 171, No. 7, 417-422 (2007). MSC: 68T05 91A26 PDF BibTeX XML Cite \textit{S. Mannor} and \textit{J. S. Shamma}, Artif. Intell. 171, No. 7, 417--422 (2007; Zbl 1168.68477) Full Text: DOI OpenURL
Sandholm, Tuomas Perspectives on multiagent learning. (English) Zbl 1168.68492 Artif. Intell. 171, No. 7, 382-391 (2007). MSC: 68T05 91A26 PDF BibTeX XML Cite \textit{T. Sandholm}, Artif. Intell. 171, No. 7, 382--391 (2007; Zbl 1168.68492) Full Text: DOI OpenURL
Shoham, Yoav; Powers, Rob; Grenager, Trond If multi-agent learning is the answer, what is the question? (English) Zbl 1168.68493 Artif. Intell. 171, No. 7, 365-377 (2007). MSC: 68T05 91A26 PDF BibTeX XML Cite \textit{Y. Shoham} et al., Artif. Intell. 171, No. 7, 365--377 (2007; Zbl 1168.68493) Full Text: DOI Link OpenURL