MLlib swMATH ID: 15430 Software Authors: Meng, Xiangrui; Bradley, Joseph; Yavuz, Burak; Sparks, Evan; Venkataraman, Shivaram; Liu, Davies; Freeman, Jeremy; Tsai, Db; Amde, Manish; Owen, Sean; Xin, Doris; Xin, Reynold; Franklin, Michael J.; Zadeh, Reza; Zaharia, Matei; Talwalkar, Ameet Description: MLlib: machine learning in apache spark. Apache Spark is a popular open-source platform for large-scale data processing that is well-suited for iterative machine learning tasks. In this paper we present MLlib, Spark’s open-source distributed machine learning library. MLlib provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives. Shipped with Spark, MLlib supports several languages and provides a high-level API that leverages Spark’s rich ecosystem to simplify the development of end-to-end machine learning pipelines. MLlib has experienced a rapid growth due to its vibrant open-source community of over 140 contributors, and includes extensive documentation to support further growth and to let users quickly get up to speed. Homepage: http://spark.apache.org/mllib/ Related Software: Apache Spark; Scikit; Spark; Hadoop; LIBSVM; MapReduce; TensorFlow; XGBoost; rpart; R; Mahout; UCI-ml; MXNet; LIBLINEAR; RCV1; WEKA; ElemStatLearn; randomForest; Apache Flink; GraphX Cited in: 26 Publications Standard Articles 1 Publication describing the Software, including 1 Publication in zbMATH Year MLlib: machine learning in Apache Spark. Zbl 1360.68697Meng, Xiangrui; Bradley, Joseph; Yavuz, Burak; Sparks, Evan; Venkataraman, Shivaram; Liu, Davies; Freeman, Jeremy; Tsai, Db; Amde, Manish; Owen, Sean; Xin, Doris; Xin, Reynold; Franklin, Michael J.; Zadeh, Reza; Zaharia, Matei; Talwalkar, Ameet 2016 all top 5 Cited by 127 Authors 1 Amde, Manish 1 Anil, Robin 1 Arbabifard, Kamyar 1 Asencio-Cortes, Gualberto 1 Avalos, Omar 1 Azzag, Hanene 1 Babaee, Hessam 1 Bacciu, Davide 1 Benbernou, Salima 1 Benito, Rosa Maria 1 Berthold, Michael R. 1 Borboudakis, Giorgos 1 Bradley, Joseph 1 Caballero, Rafael E. 1 Capan, Gokhan 1 Carta, Antonio 1 Chen, Weizhu 1 Chen, Yun 1 Christophides, Vassilis 1 Das, Santak 1 Drost-Fromm, Isabel 1 Dunning, Ted 1 Duong, Tarn 1 Fernandez-Basso, Carlos 1 Fillbrunn, Alexander 1 Forte, Simone 1 Franklin, Michael J. 1 Freeman, Jeremy 1 Friedman, Ellen 1 Gairola, Kartik 1 Ghesmoune, Mohammed 1 Gnesi, Stefania 1 Gopal, Chidambara N. 1 Gotety, Ramya 1 Grant, Trevor 1 Greenwell, Brandon M. 1 Gudivada, Venkat N. 1 Iwen, Mark A. 1 Jagadale, Akshay 1 Jaggi, Martin 1 Jain, Pooja 1 Jordan, Michael Irwin 1 Kakarlapudi, Avinash V. 1 Kanavos, Andreas 1 Karniadakis, George Em 1 Katsogridakis, Pavlos 1 Kumar, Shishir 1 Langseth, Helge 1 Lebbah, Mustapha 1 Li, Jinxiang 1 Li, Zhengying 1 Lin, Qihang 1 Lin, Yi-Yang 1 Lingras, Pawan 1 Liu, Davies 1 Losada, Juan Carlos 1 Lu, Haihao 1 Ma, Chenxin 1 Madsen, Anders L. 1 Martín-Bautista, María-José 1 Martínez-Álvarez, Francisco 1 Martínez, Ana M. 1 Masegosa, Andrés R. 1 Masoumi, Behrooz 1 Mazumder, Rahul 1 Mehta, Tejas R. 1 Meng, Xiangrui 1 Nair, Divitha 1 Nielsen, Thomas D. 1 Nodarakis, Nikolaos 1 Okafor, Nonso 1 Ouziri, Mourad 1 Owen, Sean W. 1 Pande, Kartikeya 1 Pérez-Chacón, R. 1 Pratikakis, Polyvios 1 Qin, Wangchen 1 Quinn, Shannon L. 1 Qureshi, Shiban 1 Raissi, Maziar 1 Ramos-López, Darío 1 Ranjan, Paritosh 1 Robles, José Manuel 1 Roy, Asim Kumar 1 Rudraraju, Srinivasa B. 1 Ruiz, María Dolores 1 Salehi, Abbas 1 Salmerón, Antonio 1 Schelter, Sebastian 1 Semini, Laura 1 Sharma, Kirti 1 Sharma, Shashank 1 Shukla, Anju 1 Siebes, Arno P. J. M. 1 Sindhanuru, Hemanth 1 Singh, Harikesh 1 Singh, Suraj Bhan 1 Sioutas, Spyros 1 Smith, Virginia 1 Song, Ruiyang ...and 27 more Authors all top 5 Cited in 18 Serials 4 Journal of Machine Learning Research (JMLR) 3 International Journal of Approximate Reasoning 2 Machine Learning 1 Information Sciences 1 Computational Mechanics 1 SIAM Journal on Matrix Analysis and Applications 1 Applied Mathematical Modelling 1 SIAM Journal on Optimization 1 INFORMS Journal on Computing 1 Soft Computing 1 Journal of Combinatorial Optimization 1 Data Mining and Knowledge Discovery 1 Probability in the Engineering and Informational Sciences 1 Proceedings of the National Academy of Sciences, India. Section A. Physical Sciences 1 Journal of Applied Mathematics and Computing 1 Algorithms 1 Journal of Logical and Algebraic Methods in Programming 1 Chapman & Hall/CRC Data Science Series all top 5 Cited in 7 Fields 19 Computer science (68-XX) 10 Statistics (62-XX) 5 Operations research, mathematical programming (90-XX) 2 Game theory, economics, finance, and other social and behavioral sciences (91-XX) 1 Linear and multilinear algebra; matrix theory (15-XX) 1 Probability theory and stochastic processes (60-XX) 1 Numerical analysis (65-XX) Citations by Year