Temporal clustering of time series via threshold autoregressive models: application to commodity prices.

*(English)*Zbl 1404.62083Summary: The primary aim in this study is grouping time series according to the similarity between their data generating mechanisms (DGMs) rather than comparing pattern similarities in the time series trajectories. The approximation to the DGM of each series is accomplished by fitting the linear autoregressive and the non-linear threshold autoregressive models, and outputs of the estimates are used for feature extraction. Threshold autoregressive models are recognized for their ability to represent nonlinear features in time series, such as abrupt changes, time-irreversibility and regime-shifting behavior. The proposed clustering approach is mainly based on feature vectors derived from above-mentioned models estimates. Through the use of the proposed approach, one can determine and monitor the set of co-moving time series variables across the time. The efficiency of the proposed approach is demonstrated through a simulation study and the results are compared with other proposed time series clustering methods. An illustration of the proposed clustering approach is given by application to several commodity prices. It is expected that the process of determining the commodity groups that are time-dependent will advance the current knowledge about temporal behavior and the dynamics of co-moving and coherent prices, and can serve as a basis for multivariate time series analyses. Furthermore, generating a time varying commodity prices index and sub-indexes can become possible. Findings suggested that clusters of the prices series have been affected with the global financial crisis in 2008 and the data generating mechanisms of prices and so the clusters of prices might not be the same across the entire time-period of the analysis.

##### MSC:

62M10 | Time series, auto-correlation, regression, etc. in statistics (GARCH) |

62H30 | Classification and discrimination; cluster analysis (statistical aspects) |

62P05 | Applications of statistics to actuarial sciences and financial mathematics |

91G20 | Derivative securities (option pricing, hedging, etc.) |

PDF
BibTeX
XML
Cite

\textit{S. Aslan} et al., Ann. Oper. Res. 260, No. 1--2, 51--77 (2018; Zbl 1404.62083)

Full Text:
DOI

##### References:

[1] | Aghabozorgi, S; Shirkhorshidi, AS; Wah, TY, Time-series clustering—A decade review, Information Systems, 53, 16-38, (2015) |

[2] | Ai, C; Chatrath, A; Song, F, On the comovement of commodity prices, American Journal of Agricultural Economics, 88, 574-588, (2006) |

[3] | Amendola, A; Niglio, M; Vitale, C, Statistical properties of threshold models, Communications in Statistics Theory and Methods, 38, 2479-2497, (2009) · Zbl 1177.37081 |

[4] | Bai, J; Ng, S, A panic attack on unit roots and cointegration, Econometrica, 72, 1127-1177, (2004) · Zbl 1091.62068 |

[5] | Bastos, JA; Caiado, J, Clustering financial time series with variance ratio statistics, Quantitative Finance, 14, 2121-2133, (2014) · Zbl 1402.62246 |

[6] | Batista, G. E., Wang, X., & Keogh E. J. (2011). A complexity-invariant distance measure for time series. In Proceedings of the 2011 SIAM international conference on data mining (pp. 699-710). SIAM. · Zbl 1402.62246 |

[7] | Ben-David, I; Franzoni, F; Moussawi, R, Hedge fund stock trading in the financial crisis of 2007-2009, Review of Financial Studies, 25, 1-54, (2012) |

[8] | Bernanke, BS; Boivin, J; Eliasz, P, Measuring the effects of monetary policy: A factor-augmented vector autoregressive (FAVAR) approach, Quarterly Journal of Economics, 120, 387-422, (2005) |

[9] | Berndt, D. J., & Clifford, J. (1994). Using dynamic time warping to find patterns in time series. In KDD workshop, Seattle, WA, vol. 10, pp. 359-370. |

[10] | Brandmaier, AM, Pdc: an R package for complexity-based clustering of time series, Journal of Statistical Software, 67, 1-23, (2015) |

[11] | Byrne, JP; Fazio, G; Fiess, N, Primary commodity prices: co-movements, common factors and fundamentals, Journal of Development Economics, 101, 16-26, (2013) |

[12] | Caiado, J; Crato, N; Peña, D, A periodogram-based metric for time series classification, Computational Statistics & Data Analysis, 50, 2668-2684, (2006) · Zbl 1445.62222 |

[13] | Casado, D. (2010). Classification techniques for time series and functional data. Ph.D. thesis, Universidad Carlos III de Madrid. · Zbl 0473.62081 |

[14] | Castellano, R; D’Ecclesia, RL, CDS volatility: the key signal of credit quality, Annals of Operations Research, 205, 89-107, (2013) · Zbl 1269.91092 |

[15] | Chan, KS, Consistency and limiting distribution of the least squares estimator of a threshold autoregressive model, The Annals of Statistics, 21, 520-533, (1993) · Zbl 0786.62089 |

[16] | Charles, A; Darné, O; Diebolt, C; Ferrara, L, A new monthly chronology of the us industrial cycles in the prewar economy, Journal of Financial Stability, 17, 3-9, (2015) |

[17] | Chen, CW; So, MK; Liu, FC, A review of threshold time series models in finance, Statistics and Its Interface, 4, 167-181, (2011) · Zbl 1229.91354 |

[18] | Chen, SL; Jackson, J; Kim, H; Resiandini, P, What drives commodity prices?, American Journal of Agricultural Economics, 96, 1455-1468, (2014) |

[19] | Chen, YC; Turnovsky, SJ; Zivot, E, Forecasting inflation using commodity price aggregates, Journal of Econometrics, 183, 117-134, (2014) · Zbl 1312.91069 |

[20] | Cheng, I; Xiong, W, Financialization of commodity markets, Annual Review of Financial Economics, 6, 419-941, (2014) |

[21] | Chouakria, AD; Nagabhushan, PN, Adaptive dissimilarity index for measuring time series proximity, Advances in Data Analysis and Classification, 1, 5-21, (2007) · Zbl 1131.62078 |

[22] | Cilibrasi, R; Vitányi, PM, Clustering by compression, IEEE Transactions on Information theory, 51, 1523-1545, (2005) · Zbl 1297.68097 |

[23] | Nicola, F; Pace, P; Hernandez, MA, Co-movement of major energy, agricultural, and food commodity price returns: A time-series assessment, Energy Economics, 57, 28-41, (2016) |

[24] | Díaz, SP; Vilar, JA, Comparing several parametric and nonparametric approaches to time series clustering: A simulation study, Journal of Classification, 27, 333-362, (2010) · Zbl 1337.62137 |

[25] | Fan, J; Zhang, W, Generalised likelihood ratio tests for spectral density, Biometrika, 91, 195-209, (2004) · Zbl 1132.62351 |

[26] | Fernandez, V, Influence in commodity markets: measuring co-movement globally, Resources Policy, 45, 151-164, (2015) |

[27] | Ferraresi, T; Roventini, A; Fagiolo, G, Fiscal policies and credit regimes: A TVAR approach, Journal of Applied Econometrics, 30, 1047-1072, (2015) |

[28] | Frankel, J. A. (2006). The effect of monetary policy on real commodity prices. National Bureau of Economic Research: Tech. rep. |

[29] | Fu, TC, A review on time series data mining, Engineering Applications of Artificial Intelligence, 24, 164-181, (2011) |

[30] | Furlong, F., Ingenito, R., et al. (1996). Commodity prices and inflation. Economic Review-Federal Reserve Bank of San Francisco, pp. 27-47. |

[31] | Giorgino, T; etal., Computing and visualizing dynamic time warping alignments in R: the dtw package, Journal of Statistical Software, 31, 1-24, (2009) |

[32] | Golay, X; Kollias, S; Stoll, G; Meier, D; Valavanis, A; Boesiger, P, A new correlation-based fuzzy logic clustering algorithm for FMRI, Magnetic Resonance in Medicine, 40, 249-260, (1998) |

[33] | Gonzalo, J; Pitarakis, JY, Estimation and model selection based inference in single and multiple threshold models, Journal of Econometrics, 110, 319-352, (2002) · Zbl 1043.62068 |

[34] | Hamilton, J. D. (2016). Macroeconomic regimes and regime shifts. National Bureau of Economic Research: Tech. rep. · Zbl 1297.68097 |

[35] | Hansen, BE, Inference in TAR models, Studies in Nonlinear Dynamics & Econometrics, 2, 1-14, (1997) · Zbl 1078.91558 |

[36] | Hansen, BE, Threshold autoregression in economics, Statistics and Its Interface, 4, 123-127, (2011) |

[37] | Kakizawa, Y; Shumway, RH; Taniguchi, M, Discrimination and clustering for multivariate time series, Journal of the American Statistical Association, 93, 328-340, (1998) · Zbl 0906.62060 |

[38] | Kalpakis, K., Gada, D., & Puttagunta, V. (2001). Distance measures for effective clustering of ARIMA time-series. In ICDM 2001, Proceedings IEEE international conference on data mining (pp 273-280). IEEE. |

[39] | Keogh, E; Chakrabarti, K; Pazzani, M; Mehrotra, S, Dimensionality reduction for fast similarity search in large time series databases, Knowledge and Information Systems, 3, 263-286, (2001) · Zbl 0989.68039 |

[40] | Keogh, E., Lonardi, S., & Ratanamahatana, C. A. (2004). Towards parameter-free data mining. In Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 206-215). ACM. |

[41] | Kilian, L, The economic effects of energy price shocks, Journal of Economic Literature, 46, 871-909, (2008) |

[42] | Lescaroux, F, On the excess co-movement of commodity prices note about the role of fundamental factors in short-run dynamics, Energy Policy, 37, 3906-3913, (2009) |

[43] | Liao, TW, Clustering of time series data: A survey, Pattern Recognition, 38, 1857-1874, (2005) · Zbl 1077.68803 |

[44] | Lin, E. M., Sun, E. W., & Yu, M. T. (2016). Systemic risk, financial markets, and performance of financial institutions. Annals of Operations Research, 1-25. doi:10.1007/s10479-016-2113-8. |

[45] | Lin, J., Keogh, E., Lonardi, S., & Chiu, B. (2003). A symbolic representation of time series, with implications for streaming algorithms. In Proceedings of the 8th ACM SIGMOD workshop on research issues in data mining and knowledge discovery (pp. 2-11). ACM. · Zbl 1312.91069 |

[46] | Maharaj, EA, A significance test for classifying ARMA models, Journal of Statistical Computation and Simulation, 54, 305-331, (1996) · Zbl 0899.62116 |

[47] | Maharaj, EA, Cluster of time series, Journal of Classification, 17, 297-314, (2000) · Zbl 1017.62079 |

[48] | Matesanz, D; Torgler, B; Dabat, G; Ortega, GJ, Co-movements in commodity prices: A note based on network analysis, Agricultural Economics, 45, 13-21, (2014) |

[49] | Montero, P; Vilar, JA, Tsclust: an R package for time series clustering, Journal of Statistical Software, 62, 1-43, (2014) |

[50] | Ng, AY; Jordan, MI; Weiss, Y; etal., On spectral clustering: analysis and an algorithm, Advances in Neural Information Processing Systems, 2, 849-856, (2002) |

[51] | Peña, D., Galeano, P., et al. (2001). Multivariate analysis in vector time series. Tech. rep., Universidad Carlos III de Madrid. Departamento de Estadística. |

[52] | Piccolo, D, A distance measure for classifying ARIMA models, Journal of Time Series Analysis, 11, 153-164, (1990) · Zbl 0691.62083 |

[53] | Pindyck, RS, Volatility and commodity price dynamics, Journal of Futures Markets, 24, 1029-1047, (2004) |

[54] | Pindyck, RS; Rotemberg, JJ, The excess co-movement of commodity prices, The Economic Journal, 100, 1173-1198, (1990) |

[55] | Poncela, P; Senra, E; Sierra, LP, Common dynamics of nonenergy commodity prices and their relation to uncertainty, Applied Economics, 46, 3724-3735, (2014) |

[56] | Rossen, A, What are metal prices like? co-movement, price cycles and long-run trends, Resources Policy, 45, 255-276, (2015) |

[57] | Sensoy, A; Hacihasanoglu, E; Nguyen, DK, Dynamic convergence of commodity futures: not all types of commodities are alike, Resources Policy, 44, 150-160, (2015) |

[58] | Steen, M; Gjolberg, O, Are commodity markets characterized by herd behaviour?, Applied Financial Economics, 23, 79-90, (2013) |

[59] | Tang, K; Xiong, W, Index investment and the financialization of commodities, Financial Analysts Journal, 68, 54-74, (2012) |

[60] | Tibshirani, R; Walther, G; Hastie, T, Estimating the number of clusters in a data set via the gap statistic, Journal of the Royal Statistical Society: Series B, 63, 411-423, (2001) · Zbl 0979.62046 |

[61] | Tong, H. (1990). Non-linear time series: A dynamical system approach. Oxford: Oxford University Press. |

[62] | Tong, H; Lim, KS, Threshold autoregression, limit cycles and cyclical data, Journal of the Royal Statistical Society, 42, 245-292, (1980) · Zbl 0473.62081 |

[63] | Tsai, CL, How do us stock returns respond differently to oil price shocks pre-crisis, within the financial crisis, and post-crisis?, Energy Economics, 50, 47-62, (2015) |

[64] | Vacha, L; Barunik, J, Co-movement of energy commodities revisited: evidence from wavelet coherence analysis, Energy Economics, 34, 241-247, (2012) |

[65] | Vilar, JA; Pértega, S, Discriminant and cluster analysis for Gaussian stationary processes: local linear Fitting approach, Journal of Nonparametric Statistics, 16, 443-462, (2004) · Zbl 1076.62063 |

[66] | Virbickaite, A; Ausin, MC; Galeano, P, A Bayesian non-parametric approach to asymmetric dynamic conditional correlation model with application to portfolio selection, Computational Statistics & Data Analysis, 100, 814-829, (2016) · Zbl 06918367 |

[67] | Luxburg, U, A tutorial on spectral clustering, Statistics and Computing, 17, 395-416, (2007) |

[68] | Wiggins, S; Etienne, XL, Turbulent times: uncovering the origins of us natural gas price fluctuations Since deregulation, Energy Economics, 64, 196-205, (2017) |

[69] | Zhang, H; Ho, TB; Zhang, Y; Lin, MS, Unsupervised feature extraction for time series clustering using orthogonal wavelet transform, Informatica, 30, 305-319, (2006) |

[70] | Zhu, Y., Yang, F., & Ye, W. (2016). Financial contagion behavior analysis based on complex network approach. Annals of Operations Research, 1-19. doi:10.1007/s10479-016-2362-6. |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.