Russo, Daniel; Van Roy, Benjamin Satisficing in time-sensitive bandit learning. (English) Zbl 07639653 Math. Oper. Res. 47, No. 4, 2815-2839 (2022). MSC: 68T05 62C10 PDFBibTeX XMLCite \textit{D. Russo} and \textit{B. Van Roy}, Math. Oper. Res. 47, No. 4, 2815--2839 (2022; Zbl 07639653) Full Text: DOI arXiv
Fudenberg, Drew; He, Kevin Player-compatible learning and player-compatible equilibrium. (English) Zbl 1461.91055 J. Econ. Theory 194, Article ID 105238, 39 p. (2021). MSC: 91A26 PDFBibTeX XMLCite \textit{D. Fudenberg} and \textit{K. He}, J. Econ. Theory 194, Article ID 105238, 39 p. (2021; Zbl 1461.91055) Full Text: DOI arXiv