Identifying latent grouped patterns in panel data models with interactive fixed effects. (English) Zbl 1452.62960

Summary: We consider the estimation of latent grouped patterns in dynamic panel data models with interactive fixed effects. We assume that the individual slope coefficients are homogeneous within a group and heterogeneous across groups but each individual’s group membership is unknown to the researcher. We consider penalized principal component (PPC) estimation by extending the penalized-profile-likelihood-based C-Lasso of the first author et al. [Econometrica 84, No. 6, 2215–2264 (2016; Zbl 1410.62110)] to panel data models with cross section dependence. Given the correct number of groups, we show that the C-Lasso can achieve simultaneous classification and estimation in a single step and exhibit the desirable property of uniform classification consistency. The C-Lasso-based PPC estimators of the group-specific parameters also have the oracle property. BIC-type information criteria are proposed to choose the numbers of factors and groups consistently and to select the data-driven tuning parameter. Simulations are conducted to demonstrate the finite-sample performance of the proposed method. We apply our C-Lasso to study the persistence of housing prices in China’s large and medium-sized cities in the last decade and identify three groups.


62P20 Applications of statistics to economics
62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH)
62H30 Classification and discrimination; cluster analysis (statistical aspects)
62J07 Ridge regression; shrinkage estimators (Lasso)


Zbl 1410.62110
Full Text: DOI Link


[1] Amemiya, T., Advanced econometrics, (1985), Harvard University Press Cambridge
[2] Bai, J., Inferential theory for factor models of large dimensions, Econometrica, 71, 135-171, (2003) · Zbl 1136.62354
[3] Bai, J., Panel data models with interactive fixed effects, Econometrica, 77, 1229-1279, (2009) · Zbl 1183.62196
[4] Bai, J.; Li, K., Theory and methods of panel data models with interactive effects, Ann. Statist., 42, 142-170, (2014) · Zbl 1331.62112
[5] Bai, J.; Ng, S., Determining the number of factors in approximate factor models, Econometrica, 70, 191-221, (2002) · Zbl 1103.91399
[6] Bai, C.; Li, Q.; Ouyang, M., Property taxes and home prices: A tale of two cities, J. Econometrics, 180, 1-15, (2014) · Zbl 1298.91122
[7] Baltagi, B. H.; Bresson, G.; Pirotte, A., To pool or not to pool?, (Mátyás, L.; Sevestre, P., The Econometrics of Panel Data: Fundamentals and Recent Developments in Theory and Practice, (2008), Springer-Verlag Berlin), 517-546
[8] Bertsekas, D. P., Nonlinear programming, (1995), Athena Scientific Belmont, MA · Zbl 0935.90037
[9] Bester, C. A.; Hansen, C. B., Grouped effects estimators in fixed effects models, J. Econometrics, 190, 197-208, (2016) · Zbl 1419.62501
[10] Bonhomme, S.; Manresa, E., Grouped patterns of heterogeneity in panel data, Econometrica, 83, 1147-1187, (2015)
[11] Chudik, A.; Pesaran, M. H., Common correlated effects estimation of heterogeneous dynamic panel data models with weakly exogenous regressors, J. Econometrics, 188, 393-420, (2015) · Zbl 1337.62354
[12] Chudik, A.; Pesaran, M. H.; Tosetti, E., Weak and strong cross section dependence and estimation of large panels, Econom. J., 14, C45-C90, (2011) · Zbl 1218.62097
[13] Fan, J.; Peng, H., Nonconcave penalized likelihood with a diverging number of parameters, Ann. Statist., 32, 928-961, (2004) · Zbl 1092.62031
[14] Greenaway-McGrevy, R.; Han, C.; Sul, D., Asymptotic distribution of factor augmented estimators for panel regression, J. Econometrics, 169, 48-53, (2012) · Zbl 1443.62452
[15] Hahn, J.; Kuersteiner, G., Bias reduction for dynamic nonlinear panel models with fixed effects, Econometric Theory, 27, 1152-1191, (2011) · Zbl 1442.62739
[16] Hsiao, C., Analysis of panel data, (2014), Cambridge University Press Cambridge · Zbl 1320.62003
[17] Hsiao, C.; Pesaran, M. H., Random coefficient panel data models, (Matyas, L.; Sevestre, P., The Econometrics of Panel Data: Fundamentals and Recent Developments in Theory and Practice, (2008), Springer-Verlag Berlin), 187-216
[18] Jin, S.; Su, L., A nonparametric poolability test for panel data models with cross section dependence, Econometric Rev., 32, 469-512, (2013)
[19] Kapetanios, G.; Pesaran, M. H., Alternative approaches to estimation and inference in large multifactor panels: small sample results with an application to modeling of asset returns, (Phillips, G.; Tzavalis, E., The Refinement of Econometric Estimation and Test Procedures: Finite Sample and Asymptotic Analysis, (2007), Cambridge University Press New York), 239-281
[20] Kapetanios, G.; Pesaran, M. H.; Yagamata, T., Panels with non-stationary multifactor error structures, J. Econometrics, 160, 326-348, (2011) · Zbl 1441.62767
[21] Lam, C.; Fan, J., Profile-kernel likelihood inference with diverging number of parameters, Ann. Statist., 36, 2232-2260, (2008) · Zbl 1274.62289
[22] Li, K.; Lu, L., Efficient estimation of heterogeneous coefficients in panel data models with common shock. working paper, (2014), Capital University of Economics and Business
[23] Li, D.; Qian, J.; Su, L., Panel data models with interactive fixed effects and multiple structural breaks, J. Amer. Statist. Assoc., 111, 1804-1819, (2016)
[24] Liao, Z., Adaptive GMM shrinkage estimation with consistent moment selection, Econometric Theory, 29, 857-904, (2013) · Zbl 1290.62052
[25] Lin, C.-C.; Ng, S., Estimation of panel data models with parameter heterogeneity when group membership is unknown, J. Econ. Methods, 1, 42-55, (2012) · Zbl 1279.62224
[26] Lu, X.; Su, L., Shrinkage estimation of dynamic panel data models with interactive fixed effects, J. Econometrics, 190, 148-175, (2016) · Zbl 1419.62516
[27] Moon, H. R.; Weidner, M., Linear regression for panel with unknown number of factors as interactive fixed effects, Econometrica, 83, 1543-1579, (2015)
[28] Moon, H. R.; Weidner, M., Dynamic linear panel data regression models with interactive fixed effects, Econometric Theory, 33, 158-195, (2017) · Zbl 1441.62816
[29] Newey, W.; McFadden, D., Large sample estimation and hypothesis testing, (Engle, R. F.; McFadden, D., Handbook of Econometrics, (1994), North Holland Amsterdam), 2111-2245
[30] Pesaran, M. H., Estimation and inference in large heterogeneous panels with a multifactor error structure, Econometrica, 74, 967-1012, (2006) · Zbl 1152.91718
[31] Pesaran, M. H.; Tosetti, E., Large panels with common factors and spatial correlation, J. Econometrics, 161, 182-202, (2011) · Zbl 1441.62838
[32] Pesaran, M. H.; Yamagata, T., Testing slope homogeneity in large panels, J. Econometrics, 142, 50-93, (2008) · Zbl 1418.62530
[33] Prakasa Rao, B. L.S., Conditional independence, conditional mixing and conditional association, Ann. Inst. Statist. Math., 61, 441-460, (2009) · Zbl 1314.60054
[34] Sarafidis, V.; Weber, N., A partially heterogeneous framework for analyzing panel data, Oxf. Bull. Econ. Stat., 77, 274-296, (2015)
[35] Song, M., Asymptotic theory for dynamic heterogeneous panels with cross-sectional dependence and its applications. working paper, (2013), Columbia University
[36] Su, L.; Chen, Q., Testing homogeneity in panel data models with interactive fixed effects, Econometric Theory, 29, 1079-1135, (2013) · Zbl 1290.62088
[37] Su, L.; Jin, S., Sieve estimation of panel data models with cross section dependence, J. Econometrics, 169, 34-47, (2012) · Zbl 1443.62508
[38] Su, L.; Wang, X., On time-varying factor models: estimation and testing, J. Econometrics, 198, 84-101, (2017)
[39] Su, L., Zhang, Y., 2016. Nonparametric Dynamic Panel Data Models with Interactive Fixed Effects: Sieve Estimation and Specification Testing. Working Paper, Singapore Management University.
[40] Su, L.; Jin, S.; Zhang, Y., Specification test for panel data models with interactive fixed effects, J. Econometrics, 186, 222-244, (2015) · Zbl 1331.62485
[41] Su, L.; Shi, Z.; Phillips, P. C.B., Identifying latent structures in panel data, Econometrica, 84, 2215-2264, (2016)
[42] Sun, Y., 2005. Estimation and Inference in Panel Structure Models. Working Paper, UCSD.
[43] Tibshirani, R. J., Regression shrinkage and selection via the LASSO, J. R. Stat. Soc. Ser. B Stat. Methodol., 58, 267-288, (1996) · Zbl 0850.62538
[44] Wang, Z.; Zhang, Q., Fundamental factors in the housing markets of China, J. Hous. Econ., 25, 53-61, (2014)
[45] Wang, W.; Phillips, P. C.B.; Su, L., Homogeneity pursuit in panel data models: theory and applications, J. Appl. Econometrics, (2018), (forthcoming)
[46] Yuan, M.; Lin, Y., Model selection and estimation in regression with grouped variables, J. R. Stat. Soc. Ser. B Stat. Methodol., 68, 49-67, (2006) · Zbl 1141.62030
[47] Zhang, Y.; Hua, X.; Zhao, L., Exploring determinants of housing prices: A case study of Chinese experience in 1999-2010, Econ. Model., 29, 2349-2361, (2012)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.