×

zbMATH — the first resource for mathematics

Learning wind fields with multiple kernels. (English) Zbl 1421.86019
Summary: This paper presents multiple kernel learning (MKL) regression as an exploratory spatial data analysis and modelling tool. The MKL approach is introduced as an extension of support vector regression, where MKL uses dedicated kernels to divide a given task into sub-problems and to treat them separately in an effective way. It provides better interpretability to non-linear robust kernel regression at the cost of a more complex numerical optimization. In particular, we investigate the use of MKL as a tool that allows us to avoid using ad-hoc topographic indices as covariables in statistical models in complex terrains. Instead, MKL learns these relationships from the data in a non-parametric fashion. A study on data simulated from real terrain features confirms the ability of MKL to enhance the interpretability of data-driven models and to aid feature selection without degrading predictive performances. Here we examine the stability of the MKL algorithm with respect to the number of training data samples and to the presence of noise. The results of a real case study are also presented, where MKL is able to exploit a large set of terrain features computed at multiple spatial scales, when predicting mean wind speed in an Alpine region.
MSC:
86A32 Geostatistics
86-04 Software, source code, etc. for problems pertaining to geophysics
68T05 Learning and adaptive systems in artificial intelligence
Software:
EnKF; EDA; SHOGUN; SVM
PDF BibTeX XML Cite
Full Text: DOI
References:
[1] Andrienko N, Andrienko G (2006) Exploratory data analysis of spatial and temporal data. Springer, NY · Zbl 1093.62005
[2] Ayotte, KW, Computational modelling for wind energy assessment, Journal of Wind Engineering and Industrial Aerodynamics, 96, 1571-1590, (2008)
[3] Ayotte, KW; Davy, RJ; Coppin, PA, A simple temporal and spatial analysis of flow in complex terrain in the context of wind energy modeling, Boundary-Layer Meteorology, 98, 275-295, (2001)
[4] Bach FR, Lanckriet GRG, Jordan MI (2004) Multiple kernel learning, conic duality and the SMO algorithm. In: Proceedings of the 21th international conference on machine learning 69
[5] Baines PG (1997) Topographic effects in stratified flows. Cambridge University Press, Cambridge
[6] Beccali M, Cirrincione G, Marvuglia A, Serporta C (In press) Estimation of wind velocity over a complex terrain using the generalized mapping regressor. Applied Energy
[7] Bishop C (2006) Pattern recognition and machine learning. Springer, NY · Zbl 1107.68072
[8] Canu S, Grandvalet Y, Guigue V, and Rakotomamonjy A (2005) SVM and kernel methods matlab toolbox. Perception Systèmes et Information, INSA de Rouen, Rouen, France
[9] Cellura, M.; Cirrincione, G.; Marvuglia, A.; Miraoui, A., Wind speed spatial estimation for energy planning in Sicily: A neural kriging application, Renewable Energy, 33, 1251-1266, (2008)
[10] Cressie N (1993) Statistics for spatial data, revised edn. Wiley, NY
[11] Eidsvik, KJ, A system for wind power estimation in mountainous terrain, Prediction of Askervein hill data. Wind Energy, 8, 237-249, (2005)
[12] Eidsvik, KJ; Holstad, A.; Lie, I.; Utnes, T., A prediction system for local wind variations in mountainous terrain, Boundary-Layer Meteorology, 112, 557-586, (2004)
[13] Evensen G (2006) Data assimilation: The ensemble Kalman filter. Springer, NY · Zbl 1157.86001
[14] Faure P, Huard P (1965) Résolution de programmes mathématiques à fonction non linéaire par la méthode du gradient réduit, Revue Française de Recherche Opérationnelle 36 · Zbl 0135.20001
[15] Foresti L, Pozdnoukhov A, Tuia D and Kanevski M (In press) Extreme precipitation modelling using geostatistics and machine learning algorithms. Proceedings of the 7th international conference on geostatistics for environmental applications
[16] Foresti L, Tuia D, Pozdnoukhov A, Kanevski M (2009) Multiple kernel learning of environmental data. Case study: analysis and mapping of wind fields. Proceedings of the 19th international conference on artificial neural networks, Part II, pp 933-943
[17] Franck HP, Rathmann O, Mortensen NG, Landberg L (2001) The numerical wind atlas—the KAMM/WAsP method. Risoe National Laboratory publications, Danemark Risoe-R-1252(EN)
[18] Freeman, WT; Adelson, EH, The design and use of steerable filters, IEEE transactions on Pattern Analysis and Machine Intelligence, 13, 891-906, (1991)
[19] Freund RM (2004) Solution methods for quadratic optimization. Technical report, Massachusetts Institute of Technology, MA
[20] Gönen, M.; Alpaydin, E., Localized multiple kernel learning, Proceedings of the 25th International Conference on Machine Learning, 307, 352-359, (2008)
[21] Gravdahl AR (1998) Meso scale modeling with a reynolds averaged navier-stokes solver: assessment of wind resources along the Norwegian coast. 31th IEA experts meeting. State of the Art on Wind Resource Estimation
[22] Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V., Gene selection for cancer classification using support vector machines, Machine Learning, 46, 389-422, (2002) · Zbl 0998.68111
[23] Guyon I, Gunn S, Nikravesh M, Zadeh LA (eds) (2006) Feature extraction: foundations and applications. Springer, NY · Zbl 1114.68059
[24] Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning, 2nd edn. Springer, NY · Zbl 1273.62005
[25] Haykin S (1999) Neural Networks. Prentice Hall, India
[26] Huber, PJ, Robust estimation of a location parameter, Annals of Mathematical Statistics, 35, 73-101, (1964) · Zbl 0136.39805
[27] Hughes, GF, On the mean accuracy of statistical pattern recognition, IEEE Transactions on Information Theory, 14, 55-63, (1968)
[28] Kanevski M (ed) (2008) Advanced mapping of environmental data. ISTE Wiley, NY
[29] Kanevski M, Pozdnoukhov A, Timonin V (2009) Machine learning algorithms for spatial data analysis and modelling. EPFL Press, Lausanne
[30] Lanckriet, GRG; De Bie, T.; Cristianini, N.; Jordan, MI; Noble, WS, A statistical framework for genomic data fusion, Bioinformatics, 20, 2626-2635, (2004)
[31] Landberg, L.; Myllerup, L.; Rathmann, O.; Petersen, EL; Jorgensen, BH; Badger, J.; Mortensen, NG, Wind resource estimation-an overview, Wind Energy, 6, 261-271, (2003)
[32] Lewis, DP; Jebara, T.; Noble, WS, Support vector machine learning from heterogeneous data: an empirical analysis using protein sequence and structure, Bioinformatics, 22, 2753-2760, (2006)
[33] Lindsay JB, Rothwell J (2008) Modelling channeling and deflection of wind by topography. In: Zhou Q, Lees B (eds) Advances in digital terrain analysis. Springer, NY, pp 383-406
[34] Liston, GE; Elder, KA, Meteorological distribution system for high-resolution terrestrial modeling (microMet), Journal of Hydrometeorology, 7, 217-234, (2006)
[35] Longworth C, Gales MJF (2008) multiple kernel learning for speaker verification. IEEE conference on acoustic, speech and signal processing ICASSP, pp 1581-1584
[36] Martinez WL (2004) Exploratory data analysis with matlab. Chapman & Hall/CRC, London
[37] Mercer J (1905) Functions of positive and negative type and their connection with the theory of integral equations. Phil Trans R Soc CCIX:215-228
[38] Palma, JMLM; Castro, FA; Ribeiro, LF; Rodrigues, AH; Pinto, AP, Linear and nonlinear models in wind resource assessment and wind turbine micro-siting in complex terrain, Journal of Wind Engineering and Industrial Aerodynamics, 96, 2308-2326, (2008)
[39] Petersen, EL; Mortensen, NG; Landberg, L.; Hojstrup, J.; Frank, HP, Wind power meteorology, Wind Energy, 1, 2-22, (1998)
[40] Pozdnoukhov, A.; Kanevski, M., Multi-scale support vector algorithms for hot spot detection and modelling, Stochastic Environmental Research and Risk Assessment, 22, 647-660, (2008) · Zbl 1407.62346
[41] Pozdnoukhov A, Kanevski M, Timonin V (2007) Prediction of wind power density using machine learning algorithms. Proceedings of the 12th annual conference of international association for mathematical Geology
[42] Pozdnoukhov, A.; Foresti, L.; Kanevski, M., Data-driven topo-climatic mapping with machine learning methods, Natural Hazards, 3, 497-518, (2009)
[43] Rakotomamonjy, A.; Bach, FR; Canu, S.; Grandvalet, Y., Simple MKL, Journal of Machine Learning Research, 9, 2491-2521, (2008)
[44] Rätsch, G.; Sonnenburg, S.; Schäfer, C., Learning interpretable SVMs for biological sequence classification, BMC Bioinformatics, 7, s9, (2006) · Zbl 1119.92334
[45] Schaffner B, Remund J (eds) (2005) The alpine space wind map: modeling approach. Alpine Windharvest Report Series 7-2. Alpine windharvest partnership network
[46] Schölkopf B (2001) The kernel trick for distances. In: Leen TK, Dietterich TG, and Tresp V (eds) NIPS. MIT Press, Cambridge, pp 301-307
[47] Schölkopf B, Smola A (2002) Learning with Kernels. MIT Press, Cambridge
[48] Smola A-J, Schölkopf B (1998) A Tutorial on support vector regression. NeuroCOLT2 technical report series, NC2-TR-1998-030
[49] Sonnenburg, S.; Schaefer, G.; Rätsch, G.; Schölkopf, B., Large scale multiple kernel learning, Journal of Machine Learning Research, 7, 1531-1565, (2006) · Zbl 1222.90072
[50] Tuia D, Kanevski M (2008) Environmental monitoring network characterization and clustering. In: Kanevski (ed) Advanced mapping of environmental data. ISTE Wiley, NY, pp 19-47
[51] Tuia D, Camps-Valls G, Matasci G, Kanevski M (in press) Learning relevant image features with multiple kernel classification. IEEE Trans Geosci Remote Sens
[52] Vapnik V (1995) The nature of statistical learning theory. Springer, NY · Zbl 0833.62008
[53] Whiteman CD (2000) Mountain meteorology: fundamentals and applications. Oxford University Press, Oxford
[54] Wilson JP, Gallant JC (eds) (2000) Terrain analysis: principles and applications. Wiley, NY
[55] Zien, A.; Ong, CS, Multiclass multiple kernel learning, Proceedings of the 24th International Conference on Machine Learning, 227, 1191-1198, (2007)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.