On aggressive early deflation in parallel variants of the QR algorithm
2012 (English)In: Applied parallel and scientific computing, PT I, Berlin, Heidelberg: Springer, 2012, 1-10 p.Conference paper (Refereed)
The QR algorithm computes the Schur form of a matrix and is by far the most popular approach for solving dense nonsymmetric eigenvalue problems. Multishift and aggressive early deflation (AED) techniques have led to significantly more efficient sequential implementations of the QR algorithm during the last decade. More recently, these techniques have been incorporated in a novel parallel QR algorithm on hybrid distributed memory HPC systems. While leading to significant performance improvements, it has turned out that AED may become a computational bottleneck as the number of processors increases. In this paper, we discuss a two-level approach for performing AED in a parallel environment, where the lower level consists of a novel combination of AED with the pipelined QR algorithm implemented in the ScaLAPACK routine PDLAHQR. Numerical experiments demonstrate that this new implementation further improves the performance of the parallel QR algorithm.
Place, publisher, year, edition, pages
Berlin, Heidelberg: Springer, 2012. 1-10 p.
, Lecture Notes in Computer Science, ISSN 0302-9743 ; 7133
IdentifiersURN: urn:nbn:se:umu:diva-61792ISI: 000309713800001ISBN: 978-3-642-28150-1OAI: oai:DiVA.org:umu-61792DiVA: diva2:572434
10th Nordic International Conference on Applied Parallel Computing - State of the Art in Scientific and Parallel Computing (PARA), JUN 06-09, 2010, Reykjavik, Iceland
ISSN-nummret i posten gäller den tryckta versionen, finns även i onlineversion.
ISSN: 0302-9743 (Print) 1611-3349 (Online)2012-11-272012-11-262013-03-13Bibliographically approved