swMATH ID: 
20213

Software Authors: 
Ernie Chan, Field G. Van Zee, Paolo Bientinesi, Enrique S. QuintanaOrti, Gregorio QuintanaOrti, Robert van de Geijn

Description: 
SuperMatrix: a multithreaded runtime scheduling system for algorithmsbyblocks. This paper describes SuperMatrix, a runtime system that parallelizes matrix operations for SMP and/or multicore architectures. We use this system to demonstrate how code described at a high level of abstraction can achieve high performance on such architectures while completely hiding the parallelism from the library programmer. The key insight entails viewing matrices hierarchically, consisting of blocks that serve as units of data where operations over those blocks are treated as units of computation. The implementation transparently enqueues the required operations, internally tracking dependencies, and then executes the operations utilizing outoforder execution techniques inspired by superscalar microarchitectures. This separation of concerns allows library developers to implement algorithms without concerning themselves with the parallelization aspect of the problem. Different heuristics for scheduling operations can be implemented in the runtime system independent of the code that enqueues the operations. Results gathered on a 16 CPU ccNUMA Itanium2 server demonstrate excellent performance. 
Homepage: 
http://dl.acm.org/citation.cfm?id=1345227

Related Software: 
ScaLAPACK;
LAPACK;
FLAME;
PLASMA;
MAGMA;
BLAS;
libflame;
PLAPACK;
MKL;
StarPU;
Algorithm 1022;
randUTV;
Algorithm 656;
Algorithm 679;
PETSc;
FLASH;
Matlab;
PHiPAC;
ESSL;
CTF

Cited in: 
5 Publications
