Gareev, Roman A.; Akimova, Elena N. Analytical modeling of matrix-vector multiplication on multicore processors. (English) Zbl 1527.65028 Math. Methods Appl. Sci. 45, No. 15, 8769-8799 (2022). MSC: 65F99 65Y05 PDFBibTeX XMLCite \textit{R. A. Gareev} and \textit{E. N. Akimova}, Math. Methods Appl. Sci. 45, No. 15, 8769--8799 (2022; Zbl 1527.65028) Full Text: DOI
Bos, Joppe W.; Ofner, Maximilian; Renes, Joost; Schneider, Tobias; van Vredendaal, Christine The matrix reloaded: multiplication strategies in FrodoKEM. (English) Zbl 1504.94108 Conti, Mauro (ed.) et al., Cryptology and network security. 20th international conference, CANS 2021, Vienna, Austria, December 13–15, 2021. Proceedings. Cham: Springer. Lect. Notes Comput. Sci. 13099, 72-91 (2021). MSC: 94A60 68W30 68P25 81P94 PDFBibTeX XMLCite \textit{J. W. Bos} et al., Lect. Notes Comput. Sci. 13099, 72--91 (2021; Zbl 1504.94108) Full Text: DOI
Van Zee, Field G.; Parikh, Devangi N.; Geijn, Robert A. Van De Supporting mixed-domain mixed-precision matrix multiplication within the BLIS framework. (English) Zbl 07467972 ACM Trans. Math. Softw. 47, No. 2, Article No. 12, 26 p. (2021). MSC: 65-XX PDFBibTeX XMLCite \textit{F. G. Van Zee} et al., ACM Trans. Math. Softw. 47, No. 2, Article No. 12, 26 p. (2021; Zbl 07467972) Full Text: DOI arXiv
Springer, Paul; Bientinesi, Paolo Design of a high-performance GEMM-like tensor-tensor multiplication. (English) Zbl 1484.65092 ACM Trans. Math. Softw. 44, No. 3, Article No. 28, 29 p. (2018). MSC: 65F99 15A72 65Y05 PDFBibTeX XMLCite \textit{P. Springer} and \textit{P. Bientinesi}, ACM Trans. Math. Softw. 44, No. 3, Article No. 28, 29 p. (2018; Zbl 1484.65092) Full Text: DOI arXiv
Huang, Jianyu; Matthews, Devin A.; van de Geijn, Robert A. Strassen’s algorithm for tensor contraction. (English) Zbl 1416.65117 SIAM J. Sci. Comput. 40, No. 3, C305-C326 (2018). MSC: 65F30 65Y20 15A69 PDFBibTeX XMLCite \textit{J. Huang} et al., SIAM J. Sci. Comput. 40, No. 3, C305--C326 (2018; Zbl 1416.65117) Full Text: DOI arXiv
Matthews, Devin A. High-performance tensor contraction without transposition. (English) Zbl 1379.65024 SIAM J. Sci. Comput. 40, No. 1, C1-C24 (2018). MSC: 65F30 15A69 65Y20 PDFBibTeX XMLCite \textit{D. A. Matthews}, SIAM J. Sci. Comput. 40, No. 1, C1--C24 (2018; Zbl 1379.65024) Full Text: DOI arXiv
Van Zee, Field G.; Smith, Tyler M. Implementing high-performance complex matrix multiplication via the 3m and 4m methods. (English) Zbl 1484.65093 ACM Trans. Math. Softw. 44, No. 1, Article No. 7, 36 p. (2017). MSC: 65F99 PDFBibTeX XMLCite \textit{F. G. Van Zee} and \textit{T. M. Smith}, ACM Trans. Math. Softw. 44, No. 1, Article No. 7, 36 p. (2017; Zbl 1484.65093) Full Text: DOI
Low, Tze Meng; Igual, Francisco D.; Smith, Tyler M.; Quintana-Orti, Enrique S. Analytical modeling is enough for high-performance BLIS. (English) Zbl 1369.65200 ACM Trans. Math. Softw. 43, No. 2, Article No. 12, 18 p. (2016). MSC: 65Y15 65Fxx PDFBibTeX XMLCite \textit{T. M. Low} et al., ACM Trans. Math. Softw. 43, No. 2, Article No. 12, 18 p. (2016; Zbl 1369.65200) Full Text: DOI Link