Algorithm 784: GEMM-based level 3 BLAS: Portability and Optimization Issues
1998 (English)In: ACM Transactions on Mathematical Software, Vol. 24, no 3, 303-316 p.Article in journal (Refereed) Published
This companion article discusses portability and optimization issues of the GEMM-based level 3 BLAS model implementations and the performance evaluation benchmark. All software comes in all four data types (single- and double-precision, real and complex) and are designed to be easy to implement and use on different platforms. Each of the GEMM-based routines has a few machine-dependent parameters that specify internal block. sizes, cache characteristics, and branch points for alternative code sections. These parameters provide means for adjustment to the characteristics of a memory hierarchy.
Place, publisher, year, edition, pages
1998. Vol. 24, no 3, 303-316 p.
IdentifiersURN: urn:nbn:se:umu:diva-21955ISBN: 0098-3500OAI: oai:DiVA.org:umu-21955DiVA: diva2:212213