Automatic grouping using smooth-threshold estimating equations. (English) Zbl 1274.62470

Summary: Use of redundant statistical model is often the case with practical data analysis. Redundancy widely investigated is inclusion of irrelevant predictors which is resolved by setting their coefficients to zero. On the other hand, it is also useful to consider overlapping parameters of which the values are similar. Grouping by regarding a set of parameters as a single parameter contributes to building intimate parameterization and increasing estimation accuracy by dimension reduction.
The paper proposes a data adaptive automatic grouping of parameters, which simultaneously enables variable selection that can yield sparse solution, by applying the smooth-thresholding. The new procedure is applicable to several estimation equation-based methods, and is shown to possess the oracle property. No convex optimization is needed for its implementation. Numerical examinations including large \(p\) small \(n\) situation are performed. Proposed automatic grouping applies to interaction modeling for Ohio wheeze data and for credit scoring data.


62J07 Ridge regression; shrinkage estimators (Lasso)
62J10 Analysis of variance and covariance (ANOVA)
62P20 Applications of statistics to economics
62P05 Applications of statistics to actuarial sciences and financial mathematics


