AUGEM swMATH ID: 17584 Software Authors: Q. Wang, X. Zhang, Y. Zhang, Q. Yi Description: AUGEM: Automatically generate high performance dense linear algebra kernels on x86 CPUs. Basic Liner algebra subprograms (BLAS) is a fundamental library in scientific computing. In this paper, we present a template-based optimization framework, AUGEM, which can automatically generate fully optimized assembly code for several dense linear algebra (DLA) kernels, such as GEMM, GEMV, AXPY and DOT, on varying multi-core CPUs without requiring any manual interference from developers. In particular, based on domain-specific knowledge about algorithms of the DLA kernels, we use a collection of parameterized code templates to formulate a number of commonly occurring instruction sequences within the optimized low-level C code of these DLA kernels. Then, our framework uses a specialized low-level C optimizer to identify instruction sequences that match the pre-defined code templates and thereby translates them into extremely efficient SSE/AVX instructions. The DLA kernels generated by our template-based approach surpass the implementations of Intel MKL and AMD ACML BLAS libraries, on both Intel Sandy Bridge and AMD Piledriver processors. Homepage: http://dl.acm.org/citation.cfm?id=2503219 Related Software: LAPACK; R; ATLAS; PARDISO; UMFPACK; CSparse; BLAS; BLIS; CUDA; ADOL-C; Eigen; PRIMME; CASTEP; CIRR; ARPACK; PETSc; SLEPc; ABINIT; hypre; Quantum Espresso Cited in: 11 Publications all top 5 Cited by 32 Authors 1 Bartelt, Matthias 1 Bientinesi, Paolo 1 Bloshanskaya, Lidia 1 Breuer, Alex 1 Dietzsch, Julian 1 Erlandson, Lucas 1 Gross, Michael B. 1 Igual, Francisco D. 1 Jónasson, Kristján 1 King, J. R. C. 1 Kruglyakov, Mikhail 1 Kuchta, Miroslav 1 Li, Ruipeng 1 Lind, Steven J. 1 Low, Tze Meng 1 Lumsdaine, Andrew 1 Mangion, Andrew Zammit 1 Mardal, Kent-Andre 1 Matthews, Devin A. 1 Melsted, Páll 1 Mortensen, Mikael 1 Nordaas, Magne 1 Quintana-Ortí, Enrique S. 1 Ragnarsson, Petur Orri 1 Rougier, Jonathan C. 1 Saad, Yousef 1 Sigurðsson, Sven Þ. 1 Smith, Tyler M. 1 Springer, Paul 1 Verschaeve, Joris C. G. 1 Xi, Yuanzhe 1 Yngvason, Hordur Freyr all top 5 Cited in 7 Serials 3 ACM Transactions on Mathematical Software 3 SIAM Journal on Scientific Computing 1 Journal of Computational Physics 1 Journal of Computational and Applied Mathematics 1 Mathematics and Computers in Simulation 1 Computational Statistics and Data Analysis 1 Mathematical Geosciences all top 5 Cited in 8 Fields 10 Numerical analysis (65-XX) 3 Linear and multilinear algebra; matrix theory (15-XX) 1 Statistics (62-XX) 1 Mechanics of deformable solids (74-XX) 1 Fluid mechanics (76-XX) 1 Optics, electromagnetic theory (78-XX) 1 Geophysics (86-XX) 1 Information and communication theory, circuits (94-XX) Citations by Year