Open this publication in new window or tab >>2016 (English)Report (Other academic)
Abstract [en]
A novel parallel formulation of Hessenberg-triangular reduction of a regular matrix pair on distributed memory computers is presented. The formulation is based on a sequential cache-blocked algorithm by Kågstrom, Kressner, E.S. Quintana-Ortí, and G. Quintana-Ortí (2008). A static scheduling algorithm is proposed that addresses the problem of underutilized processes caused by two-sided updates of matrix pairs based on sequences of rotations. Experiments using up to 961 processes demonstrate that the new algorithm is an improvement of the state of the art but also identifies factors that currently limit its scalability.
Place, publisher, year, edition, pages
Umeå: Department of Computing Science, Umeå University, 2016. p. 26
Series
Report / UMINF, ISSN 0348-0542 ; 16.10
National Category
Computational Mathematics
Identifiers
urn:nbn:se:umu:diva-120002 (URN)
2016-05-042016-05-042018-06-07Bibliographically approved