zbMATH — the first resource for mathematics

Shape constraints in economics and operations research. (English) Zbl 1407.62424
Summary: Shape constraints, motivated by either application-specific assumptions or existing theory, can be imposed during model estimation to restrict the feasible region of the parameters. Although such restrictions may not provide any benefits in an asymptotic analysis, they often improve finite sample performance of statistical estimators and the computational efficiency of finding near-optimal control policies. This paper briefly reviews an illustrative set of research utilizing shape constraints in the economics and operations research literature. We highlight the methodological innovations and applications, with a particular emphasis on utility functions, production economics and sequential decision making applications.
62P20 Applications of statistics to economics
62G08 Nonparametric regression and quantile regression
62H12 Estimation in multivariate analysis
62L12 Sequential estimation
simest; scar
Full Text: DOI Euclid
[1] Ackerberg, D. A., Caves, K. and Frazer, G. (2015). Identification properties of recent production function estimators. Econometrica83 2411–2451.
[2] Afriat, S. N. (1967). The construction of utility functions from expenditure data. Internat. Econom. Rev.8 67–77. · Zbl 0248.90001
[3] Afriat, S. N. (1972). Efficiency estimation of production functions. Internat. Econom. Rev.13 568–598. · Zbl 0264.90022
[4] Aït-Sahalia, Y. and Duarte, J. (2003). Nonparametric option pricing under shape restrictions. J. Econometrics116 9–47. · Zbl 1016.62121
[5] Alizadeh, F. (2006). Semidefinite and second-order cone programming and their application to shape-constrained regression and density estimation. In Models, Methods, and Applications for Innovative Decision Making 37–65.
[6] Allon, G., Beenstock, M., Hackman, S., Passy, U. and Shapiro, A. (2007). Nonparametric estimation of concave production technologies by entropic methods. J. Appl. Econometrics22 795–816.
[7] Asamov, T. and Powell, W. B. (2018). Regularized decomposition of high-dimensional multistage stochastic programs with Markov uncertainty. SIAM J. Optim.28 575–595. · Zbl 1395.90190
[8] Asamov, T., Salas, D. F. and Powell, W. B. (2016). SDDP vs. ADP: The effect of dimensionality in multistage stochastic optimization for grid level energy storage. Preprint. Available at arXiv:1605.01521.
[9] Aviv, Y. and Federgruen, A. (2001). Capacitated multi-item inventory systems with random and seasonally fluctuating demands: Implications for postponement strategies. Manage. Sci.47 512–531. · Zbl 1232.90010
[10] Balabdaoui, F., Durot, C. and Jankowski, H. (2016). Least squares estimation in the monotone single index model. Preprint. Available at arXiv:1610.06026.
[11] Balabdaoui, F., Groeneboom, P. and Hendrickx, K. (2017). Score estimation in the monotone single index model. Preprint. Available at arXiv:1712.05593.
[12] Banker, R. D. and Maindiratta, A. (1992). Maximum likelihood estimation of monotone and concave production frontiers. J. Product. Anal.3 401–415.
[13] Barlow, R. E., Bartholomew, D. J., Bremner, J. M. and Brunk, H. D. (1972). Statistical Inference Under Order Restrictions. The Theory and Application of Isotonic Regression. Wiley, London. · Zbl 0246.62038
[14] Beresteanu, A. (2005). Nonparametric analysis of cost complementarities in the telecommunications industry. Rand J. Econ.36 870–889.
[15] Beresteanu, A. et al. (2007). Nonparametric estimation of regression functions under restrictions on partial derivatives. Preprint. Available at http://www.econ.duke.edu/arie/shape.pdf.
[16] Bertsekas, D. P. (1999a). Nonlinear Programming, 2nd ed. Athena Scientific Optimization and Computation Series. Athena Scientific, Belmont, MA. · Zbl 1015.90077
[17] Bertsekas, D. P. (2012). Dynamic Programming and Optimal Control. Vol. II. Approximate Dynamic Programming, 4th ed. Athena Scientific, Belmont, MA. · Zbl 1298.90001
[18] Bertsekas, D. P. and Tsitsiklis, J. N. (1996). Neuro-Dynamic Programming, Athena Scientific, Belmont, MA. · Zbl 0924.68163
[19] Bhat, N., Farias, V. and Moallemi, C. C. (2012). Non-parametric approximate dynamic programming via the kernel method. Adv. Neural Inf. Process. Syst.25 386–394.
[20] Birge, J. R. and Louveaux, F. (2011). Introduction to Stochastic Programming, 2nd ed. Springer Series in Operations Research and Financial Engineering. Springer, New York. · Zbl 1223.90001
[21] Blundell, R. W., Browning, M. and Crawford, I. A. (2003). Nonparametric engel curves and revealed preference. Econometrica71 205–240. · Zbl 1130.62329
[22] Blundell, R., Browning, M. and Crawford, I. (2008). Best nonparametric bounds on demand responses. Econometrica76 1227–1262. · Zbl 1155.91412
[23] Blundell, R., Horowitz, L. and Parey, M. (2012). Measuring the price responsiveness of gasoline demand: Economic shape restrictions and nonparametric demand estimation. Quantitative Economics3 29–51. · Zbl 1419.91430
[24] Blundell, R., Kristensen, D. and Matzkin, R. (2014). Bounding quantile demand functions using revealed preference inequalities. J. Econometrics179 112–127. · Zbl 1298.91101
[25] Breiman, L. (1996). Bagging predictors. Mach. Learn.24 123–140. · Zbl 0858.68080
[26] Breiman, L. (2000). Randomizing outputs to increase prediction accuracy. Mach. Learn.40 229–242. · Zbl 0962.68143
[27] Brown, D. J. and Matzkin, R. L. (1996). Testable restrictions on the equilibrium manifold. Econometrica64 1249–1262. · Zbl 0862.90028
[28] Brunk, H. D. (1970). Estimation of isotonic regression. In Nonparametric Techniques in Statistical Inference 177–197. Cambridge Univ. Press, London.
[29] Chambers, R. G. (1988). Applied Production Analysis: A Dual Approach. Cambridge Univ. Press, Cambridge.
[30] Chambers, P. and Echenique, C. (2017). Revealed Preference Theory, 2nd ed. Econometric Society Monographs. Cambridge University Press, Cambridge. · Zbl 1396.91105
[31] Chatterjee, S., Guntuboyina, A. and Sen, B. (2015). On risk bounds in isotonic and other shape restricted regression problems. Ann. Statist.43 1774–1800. · Zbl 1317.62032
[32] Chen, Z. L. and Powell, W. B. (1999). Convergent cutting-plane and partial-sampling algorithm for multistage stochastic linear programs with recourse. J. Optim. Theory Appl.102 497–524. · Zbl 0955.90096
[33] Chen, Y. and Samworth, R. J. (2016). Generalized additive and index models with shape constraints. J. R. Stat. Soc. Ser. B. Stat. Methodol.78 729–754.
[34] Cherchye, L., De Rock, B. and Vermeulen, F. (2007). The collective model of household consumption: A nonparametric characterization. Econometrica75 553–574. · Zbl 1132.91551
[35] Chernozhukov, V., Newey, W. K. and Santos, A. (2015). Constrained conditional moment restriction models. Preprint. Available at arXiv:1509.06311.
[36] Chetverikov, D., Santos, A. and Shaikh, A. M. (2018). The econometrics of shape restrictions. Ann. Rev. Econ.10. (In press.)
[37] Cheung, R. K.-M. and Powell, W. B. (2000). SHAPE—a stochastic hybrid approximation procedure for two-stage stochastic programs. Oper. Res.48 73–79. · Zbl 1106.90361
[38] Clark, A. J. and Scarf, H. (1960). Optimal policies for a multi-echelon inventory problem. Manage. Sci.6 475–490.
[39] Cosaert, S. and Demuynck, T. (2018). Nonparametric welfare and demand analysis with unobserved individual heterogeneity. Rev. Econ. Stat.100 349–361.
[40] de Farias, D. P. and Van Roy, B. (2000). On the existence of fixed points for approximate value iteration and temporal-difference learning. J. Optim. Theory Appl.105 589–608. · Zbl 1028.90077
[41] de Farias, D. P. and Van Roy, B. (2003). The linear programming approach to approximate dynamic programming. Oper. Res.51 850–865. · Zbl 1165.90666
[42] De Farias, D. P. and Van Roy, B. (2004). On constraint sampling in the linear programming approach to approximate dynamic programming. Math. Oper. Res.29 462–478. · Zbl 1082.90124
[43] Desai, V. V., Farias, V. F. and Moallemi, C. C. (2012a). Approximate dynamic programming via a smoothed linear program. Oper. Res.60 655–674. · Zbl 1251.90379
[44] Desai, V. V., Farias, V. F. and Moallemi, C. C. (2012b). Pathwise optimization for optimal stopping problems. Manage. Sci.1996 1–17.
[45] Dette, H., Hoderlein, S. and Neumeyer, N. (2016). Testing multivariate economic restrictions using quantiles: The example of Slutsky negative semidefiniteness. J. Econometrics191 129–144. · Zbl 1390.62071
[46] Diewert, W. E. (1973). Afriat and revealed preference theory. Rev. Econ. Stud.40 419–425. · Zbl 0268.90007
[47] Diewert, W. E. (1974). Applications of duality theory. In Frontiers of Quantitative Economics, Vol. II (M. Intriligator and D. Kendrick, eds.) 106–171. North-Holland, Amsterdam. · Zbl 0331.90051
[48] Diewert, W. E. and Wales, T. J. (1987). Flexible functional forms and global curvature conditions. Econometrica55 43–68. · Zbl 0619.90002
[49] Donohue, C. J. and Birge, J. R. (2006). The abridged nested decomposition method for multistage stochastic linear programs with relatively complete recourse. Algorithmic Oper. Res.1 20–30. · Zbl 1148.90336
[50] Du, P., Parmeter, C. F. and Racine, J. S. (2013). Nonparametric kernel regression with multiple predictors and multiple shape constraints. Statist. Sinica23 1347–1371. · Zbl 06202710
[51] Dykstra, R. L. (1983). An algorithm for restricted least squares regression. J. Amer. Statist. Assoc.78 837–842. · Zbl 0535.62063
[52] Epstein, G. and Yatchew, J. (1985). Nonparametric hypothesis testing procedures and applications to demand analysis. J. Econometrics30 149–69. · Zbl 0626.62045
[53] Evans, R. V. (1967). Inventory control of a multiproduct system with a limited production resource. Naval Res. Logist.14 173–184. · Zbl 0152.38201
[54] Florens, J. P., Racine, J. S. and Centorrino, S. (2018). Nonparametric instrumental variable derivative estimation. J. Nonparametr. Stat.30 368–391. · Zbl 1401.62058
[55] Førsund, F. R. and Hjalmarsson, L. (2004). Are all scales optimal in dea? Theory and empirical evidence. J. Product. Anal.21 25–48.
[56] Frisch, R. (1964). Theory of Production. Springer, New York.
[57] Gallant, A. R. (1981). On the bias in flexible functional forms and an essentially unbiased form: The Fourier flexible form. J. Econometrics15 211–245. · Zbl 0454.62096
[58] Gallant, A. R. and Golub, G. H. (1984). Imposing curvature restrictions on flexible functional forms. J. Econometrics26 295–321. · Zbl 0573.62098
[59] Geramifard, A., Walsh, T. J., Tellex, S., Chowdhary, G., Roy, N. and How, J. P. (2013). A tutorial on linear function approximators for dynamic programming and reinforcement learning. Found. Trends Mach. Learn.6 375–454. · Zbl 1311.90169
[60] Godfrey, G. A. and Powell, W. B. (2001). An adaptive, distribution-free algorithm for the newsvendor problem with censored demands, with applications to inventory and distribution. Manage. Sci.47 1101–1112. · Zbl 1232.90053
[61] Green, P. J. (1995). Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika82 711–732. · Zbl 0861.62023
[62] Griliches, Z. and Mairesse, J. (1995). Production functions: The search for identification. Technical report, National Bureau of Economic Research, Cambridge, MA. · Zbl 0972.91063
[63] Groeneboom, P., Jongbloed, G. and Wellner, J. A. (2001). Estimation of a convex function: Characterizations and asymptotic theory. Ann. Statist.29 1653–1698. · Zbl 1043.62027
[64] Guntuboyina, A. and Sen, B. (2013). Covering numbers for convex functions. IEEE Trans. Inform. Theory59 1957–1965. · Zbl 1364.52007
[65] Guntuboyina, A. and Sen, B. (2015). Global risk bounds and adaptation in univariate convex regression. Probab. Theory Related Fields163 379–411. · Zbl 1327.62255
[66] Hall, P. and Huang, L.-S. (2001). Nonparametric kernel regression subject to monotonicity constraints. Ann. Statist.29 624–647. · Zbl 1012.62030
[67] Hannah, L. A. and Dunson, D. B. (2011). Bayesian nonparametric multivariate convex regression. Preprint. Available at arXiv:1109.0322.
[68] Hannah, L. and Dunson, D. (2012). Ensemble methods for convex regression with applications to geometric programming based circuit design. Preprint. Available at arXiv:1206.4645.
[69] Hannah, L. A. and Dunson, D. B. (2013). Multivariate convex regression with adaptive partitioning. J. Mach. Learn. Res.14 3261–3294. · Zbl 1318.62225
[70] Hausman, J. A. and Newey, W. K. (2016). Individual heterogeneity and average welfare. Econometrica84 1225–1248.
[71] Higle, J. L. and Sen, S. (1991). Stochastic decomposition: An algorithm for two-stage linear programs with recourse. Math. Oper. Res.16 650–669. · Zbl 0746.90045
[72] Higle, J. L. and Sen, S. (1996). Stopping rules for stochastic decomposition. In Stochastic Decomposition 131–164. Springer, Berlin. · Zbl 0874.90145
[73] Hildreth, C. (1954). Point estimates of ordinates of concave functions. J. Amer. Statist. Assoc.49 598–619. · Zbl 0056.38301
[74] Hoderlein, S. and Stoye, J. (2014). Revealed preferences in a heterogeneous population. Rev. Econ. Stat.96 197–213.
[75] Huh, W. T. and Rusmevichientong, P. (2009). A nonparametric asymptotic analysis of inventory planning with censored demand. Math. Oper. Res.34 103–123. · Zbl 1231.90038
[76] Hwangbo, H., Johnson, A. L. and Ding, Y. (2015). Power curve estimation: Functional estimation imposing the regular ultra passum law. SSRN Working paper. Available at: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2621033.
[77] Jiang, D. R. and Powell, W. B. (2015). An approximate dynamic programming algorithm for monotone value functions. Oper. Res.63 1489–1511. · Zbl 1334.90194
[78] Juditsky, A. and Nemirovski, A. (2002). On nonparametric tests of positivity/monotonicity/convexity. Ann. Statist.30 498–527. · Zbl 1012.62048
[79] Kamien, M. I. and Schwartz, N. L. (1981). Dynamic Optimization. North-Holland, New York–Amsterdam. · Zbl 0455.49002
[80] Kuchibhotla, A. K., Patra, R. K. and Sen, B. (2017). Efficient estimation in convex single index models. Preprint. Available at arXiv:1708.00145.
[81] Kunnumkal, S. and Topaloglu, H. (2008a). Exploiting the structural properties of the underlying Markov decision problem in the Q-learning algorithm. INFORMS J. Comput.20 288–301. · Zbl 1243.90235
[82] Kunnumkal, S. and Topaloglu, H. (2008b). Using stochastic approximation methods to compute optimal base-stock levels in inventory control problems. Oper. Res.56 646–664. · Zbl 1167.90337
[83] Kuosmanen, T. (2008). Representation theorem for convex nonparametric least squares. Econom. J.11 308–325. · Zbl 1141.91640
[84] Kuosmanen, T. and Johnson, A. (2017). Modeling joint production of multiple outputs in stoned: Directional distance function approach. European J. Oper. Res.262 792–801.
[85] Kurt, M. and Kharoufeh, J. P. (2010). Monotone optimal replacement policies for a Markovian deteriorating system in a controllable environment. Oper. Res. Lett.38 273–279. · Zbl 1195.90093
[86] Kurt, M. and Maillart, L. M. (2009). Structured replacement policies for a Markov-modulated shock model. Oper. Res. Lett.37 280–284. · Zbl 1167.90470
[87] Kushner, H. J. and Yin, G. G. (2003). Stochastic Approximation and Recursive Algorithms and Applications, 35. Springer, New York. · Zbl 1026.62084
[88] Lee, C.-Y., Johnson, A. L., Moreno-Centeno, E. and Kuosmanen, T. (2013). A more efficient algorithm for convex nonparametric least squares. European J. Oper. Res.227 391–400. · Zbl 1292.90234
[89] Lewbel, A. (2001). Demand systems with and without errors. Am. Econ. Rev.91 611–618.
[90] Li, Q. and Racine, J. S. (2007). Nonparametric Econometrics: Theory and Practice. Princeton Univ. Press, Princeton, NJ. · Zbl 1183.62200
[91] Lim, E. (2014). On convergence rates of convex regression in multiple dimensions. INFORMS J. Comput.26 616–628. · Zbl 1304.62057
[92] Lim, E. and Glynn, P. W. (2012). Consistency of multidimensional convex regression. Oper. Res.60 196–208. · Zbl 1342.62064
[93] Linowsky, K. and Philpott, A. B. (2005). On the convergence of sampling-based decomposition algorithms for multistage stochastic programs. J. Optim. Theory Appl.125 349–366. · Zbl 1071.90029
[94] Löhndorf, N., Wozabal, D. and Minner, S. (2013). Optimizing trading decisions for hydro storage systems using approximate dual dynamic programming. Oper. Res.61 810–823. · Zbl 1291.90125
[95] Luo, Y. and Lim, E. (2016). On consistency of absolute deviations estimators of convex functions. Int. J. Stat. Probab.5 1.
[96] Maceira, M. E. P., Marzano, L. G. B., Penna, D. D. J., Diniz, A. L. and Justino, T. C. (2015). Application of CVaR risk aversion approach in the expansion and operation planning and for setting the spot price in the Brazilian hydrothermal interconnected system. Int. J. Electr. Power Energy Syst.72 126–135.
[97] Magnani, A. and Boyd, S. P. (2009). Convex piecewise-linear fitting. Optim. Eng.10 1–17. · Zbl 1273.65086
[98] Mak, W.-K., Morton, D. P. and Wood, R. K. (1999). Monte Carlo bounding techniques for determining solution quality in stochastic programs. Oper. Res. Lett.24 47–56. · Zbl 0956.90022
[99] Mammen, E. (1991). Nonparametric regression under qualitative smoothness assumptions. Ann. Statist.19 741–759. · Zbl 0737.62039
[100] Matzkin, R. L. (1991). Semiparametric estimation of monotone and concave utility functions for polychotomous choice models. Econometrica59 1315–1327. · Zbl 0781.90009
[101] Matzkin, R. L. (1994). Restrictions of economic theory in nonparametric methods. In Handbook of Econometrics, Vol. IV. Handbooks in Econom.2 2523–2558. North-Holland, Amsterdam.
[102] Mazumder, R., Choudhury, A., Iyengar, G. and Sen, B. (2015). A computational framework for multivariate convex regression and its variants. Preprint. Available at arXiv:1509.08165.
[103] Michaelides, P. G., Tsionas, E. G., Vouldis, A. T. and Konstantakis, K. N. (2015). Global approximation to arbitrary cost functions: A Bayesian approach with application to US banking. European J. Oper. Res.241 148–160. · Zbl 1338.62192
[104] Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A., Veness, J., Bellemare, M. G., Graves, A., Riedmiller, M., Fidjeland, A. K., Ostrovski, G. and Petersen, S. (2015). Human-level control through deep reinforcement learning. Nature518 529–533.
[105] Mundlak, Y. (1963). Specification and estimation of multiproduct production functions. J. Farm Econ.45 433–443.
[106] Nascimento, J. M. and Powell, W. B. (2009). An optimal approximate dynamic programming algorithm for the lagged asset acquisition problem. Math. Oper. Res.34 210–237. · Zbl 1217.93189
[107] Nascimento, J. M. and Powell, W. B. (2010). Dynamic programming models and algorithms for the mutual fund cash balance problem. Manage. Sci.56 801–815. · Zbl 1232.90341
[108] Nascimento, J. and Powell, W. B. (2013). An optimal approximate dynamic programming algorithm for concave, scalar storage problems with vector-valued controls. IEEE Trans. Automat. Control58 2995–3010. · Zbl 1369.49035
[109] Neave, E. H. (1970). The stochastic cash balance problem with fixed costs for increases and decreases. Manage. Sci.16 472–490. · Zbl 0193.19604
[110] Ormoneit, D. and Sen, Å. (2002). Kernel-based reinforcement learning. Mach. Learn.49 161–178. · Zbl 1014.68069
[111] Papadaki, K. P. and Powell, W. B. (2002). Exploiting structure in adaptive dynamic programming algorithms for a stochastic batch service problem. European J. Oper. Res.142 108–127. · Zbl 1081.90605
[112] Pereira, M. V. F. and Pinto, L. M. V. G. (1991). Multi-stage stochastic optimization applied to energy planning. Math. Program.52 359–375. · Zbl 0749.90057
[113] Perloff, J. M. (2018). Microeconomics, 8th ed. Pearson, Boston.
[114] Philpott, A. B. and de Matos, V. L. (2012). Dynamic sampling algorithms for multi-stage stochastic programs with risk aversion. European J. Oper. Res.218 470–483. · Zbl 1244.90175
[115] Philpott, A., de Matos, V. and Finardi, E. (2013). On solving multistage stochastic programs with coherent risk measures. Oper. Res.61 957–970. · Zbl 1291.90152
[116] Philpott, A. B. and Guan, Z. (2008). On the convergence of stochastic dual dynamic programming and related methods. Oper. Res. Lett.36 450–455. · Zbl 1155.90437
[117] Pierskalla, W. P. and Voelker, J. A. (1976). A survey of maintenance models: The control and surveillance of deteriorating systems. Nav. Res. Logist. Q.23 353–388. · Zbl 0358.90028
[118] Porteus, E. L. (2002). Foundations of Stochastic Inventory Theory. Stanford Univ. Press, Stanford, CA.
[119] Powell, W. B. (2011). Approximate Dynamic Programming: Solving the Curses of Dimensionality, 2nd ed. Wiley, Hoboken, NJ. · Zbl 1242.90002
[120] Powell, W., Ruszczyński, A. and Topaloglu, H. (2004). Learning algorithms for separable approximations of discrete stochastic optimization problems. Math. Oper. Res.29 814–836. · Zbl 1082.90079
[121] Pritchard, G., Philpott, A. B. and Neame, P. J. (2005). Hydroelectric reservoir optimization in a pool market. Math. Program.103 445–461. · Zbl 1099.90070
[122] Puterman, M. L. (1994). Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York. · Zbl 0829.90134
[123] Pya, N. and Wood, S. N. (2015). Shape constrained additive models. Stat. Comput.25 543–559. · Zbl 1331.62367
[124] Robertson, T., Wright, F. T. and Dykstra, R. L. (1988). Order Restricted Statistical Inference. Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics. Wiley, Chichester. · Zbl 0645.62028
[125] Rust, J. (1987). Optimal replacement of GMC bus engines: An empirical model of Harold Zurcher. Econometrica55 999–1033. · Zbl 0624.90034
[126] Ryan, D. L. and Wales, T. J. (2000). Imposing local concavity in the translog and generalized leontief cost functions. Econom. Lett.67 253–260. · Zbl 0953.91021
[127] Samuelson, P. A. (1938). A note on the pure theory of consumer’s behaviour. Economica5 61–71.
[128] Samuelson, P. A. (1950). The problem of integrability in utility theory. Economica17 355–385.
[129] Sarath, B. and Maindiratta, A. (1997). On the consistency of maximum likelihood estimation of monotone and concave production frontiers. J. Product. Anal.8 239–246.
[130] Scarf, H. (1960). The optimality of \((S,s)\) policies in the dynamic inventory problem. In Mathematical Methods in the Social Sciences, 1959 196–202. Stanford Univ. Press, Stanford, CA.
[131] Seijo, E. and Sen, B. (2011). Nonparametric least squares estimation of a multivariate convex regression function. Ann. Statist.39 1633–1657. · Zbl 1220.62044
[132] Sen, S. and Zhou, Z. (2014). Multistage stochastic decomposition: A bridge between stochastic programming and approximate dynamic programming. SIAM J. Optim.24 127–153. · Zbl 1291.90153
[133] Shapiro, A. (2011). Analysis of stochastic dual dynamic programming method. European J. Oper. Res.209 63–72. · Zbl 1208.90126
[134] Shapiro, A., Tekaya, W., da Costa, J. P. and Soares, M. P. (2013). Risk neutral and risk averse stochastic dual dynamic programming method. European J. Oper. Res.224 375–391. · Zbl 1292.90219
[135] Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., van den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T., Leach, M., Kavukcuoglu, K., Graepel, T. and Hassabis, D. (2016). Mastering the game of go with deep neural networks and tree search. Nature529 484–489.
[136] Stokey, N. L., Lucas, R. E. Jr. and Prescott, E. C. Jr. (1989). Recursive Methods in Economic Dynamics. Harvard Univ. Press, Cambridge, MA.
[137] Sutton, R. S. and Barto, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA.
[138] Sutton, R. S., McAllester, D. A., Singh, S. P. and Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems 1057–1063.
[139] Tripathi, G. (2000). Local semiparametric efficiency bounds under shape restrictions. Econometric Theory16 729–739. · Zbl 0967.62034
[140] Tsitsiklis, J. N. and Roy, B. V. (1996). Feature-based methods for large scale dynamic programming. Mach. Learn.22 59–94. · Zbl 0843.68092
[141] Tsitsiklis, J. N. and Van Roy, B. (1999). Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives. IEEE Trans. Automat. Control44 1840–1851. · Zbl 0958.60042
[142] Varian, H. R. (1982). The nonparametric approach to demand analysis. Econometrica50 945–973. · Zbl 0483.90006
[143] Varian, H. R. (1984). The nonparametric approach to production analysis. Econometrica52 579–597. · Zbl 0558.90024
[144] Varian, R. (1985). Nonparametric analysis of optimizing behavior with measurement error. J. Econometrics30 445–58.
[145] Varian, H. R. (1992). Microeconomic Analysis. Norton, New York.
[146] Villalobos, M. and Wahba, G. (1987). Inequality-constrained multivariate smoothing splines with application to the estimation of posterior probabilities. J. Amer. Statist. Assoc.82 239–248. · Zbl 0614.62047
[147] Watkins, C. J. and Dayan, P. (1992). Q-learning. Mach. Learn.8 279–292. · Zbl 0773.68062
[148] Wu, C.-F. (1982). Some algorithms for concave and isotonic regression. In Optimization in Statistics. Stud. Management Sci.19 105–116. North-Holland, Amsterdam.
[149] Wu, X. and Sickles, R. (2018). Semiparametric estimation under shape constraints. Econ. Stat.6 74–89.
[150] Yagi, D., Chen, Y., Johnson, A. L. and Kuosmanen, T. (2018a). Shape constrained kernel-weighted least squares: Application to production function estimation for Chilean manufacturing industries. J. Bus. Econom. Statist. Working Paper. Available at: https://arxiv.org/abs/1604.06003.
[151] Yatchew, A. (2003). Semiparametric Regression for the Applied Econometrician. Cambridge Univ. Press, Cambridge. · Zbl 1067.62041
[152] Zhang, H.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.