Umeå University's logo

umu.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Parallel robust solution of triangular linear systems
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.ORCID-id: 0000-0002-8444-6303
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.ORCID-id: 0000-0002-4675-7434
2019 (engelsk)Inngår i: Concurrency and Computation, ISSN 1532-0626, E-ISSN 1532-0634, Vol. 31, nr 19, artikkel-id e5064Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Triangular linear systems are central to the solution of general linear systems and the computation of eigenvectors. In the absence of floating‐point exceptions, substitution runs to completion and solves a system which is a small perturbation of the original system. If the matrix is well‐conditioned, then the normwise relative error is small. However, there are well‐conditioned systems for which substitution fails due to overflow. The robust solvers xLATRS from LAPACK extend the set of linear systems which can be solved by dynamically scaling the solution and the right‐hand side to avoid overflow. These solvers are sequential and apply to systems with a single right‐hand side. This paper presents algorithms which are blocked and parallel. A new task‐based parallel robust solver (Kiya) is presented and compared against both DLATRS and the non‐robust solvers DTRSV and DTRSM. When there are many right‐hand sides, Kiya performs significantly better than the robust solver DLATRS and is not significantly slower than the non‐robust solver DTRSM.

sted, utgiver, år, opplag, sider
John Wiley & Sons, 2019. Vol. 31, nr 19, artikkel-id e5064
Emneord [en]
Overflow protection, parallel algorithms, task-based parallelism, triangular linear systems
HSV kategori
Identifikatorer
URN: urn:nbn:se:umu:diva-154297DOI: 10.1002/cpe.5064ISI: 000486203400007Scopus ID: 2-s2.0-85056329436OAI: oai:DiVA.org:umu-154297DiVA, id: diva2:1271007
Merknad

Special Issue

Tilgjengelig fra: 2018-12-14 Laget: 2018-12-14 Sist oppdatert: 2023-03-07bibliografisk kontrollert
Inngår i avhandling
1. Towards efficient overflow-free solvers for systems of triangular type
Åpne denne publikasjonen i ny fane eller vindu >>Towards efficient overflow-free solvers for systems of triangular type
2019 (engelsk)Licentiatavhandling, med artikler (Annet vitenskapelig)
Abstract [en]

Triangular linear systems are fundamental in numerical linear algebra. A triangular linear system has a straight-forward and efficient solution strategy, namely forward substitution for lower triangular systems and backward substitution for upper triangular systems. Triangular systems, or, more generally, systems of triangular type occur frequently in algorithms for more complex problems. This thesis addresses three systems that involve linear systems of triangular type. The first system concerns quasi-triangular matrices. Quasi-triangular matrices are block triangular with 1-by-1 and 2-by-2 blocks on the diagonal. Quasi-triangular systems arise in the computation of eigenvectors from the real Schur form for the non-symmetric eigenvalue problem. This thesis contributes two algorithms for the eigenvector computation, which solve shifted quasi-triangular linear systems in an efficient and scalable way. The second system addresses scaled triangular linear systems. During the solution of a triangular linear system, the entries of the solution can grow. This growth can exceed the representable range of floating-point numbers. Such an overflow can be avoided by solving a scaled triangular system. The solution is scaled prior to every operation that would otherwise result in an overflow. After scaling, the operations can be executed safely. This thesis analyzes the scalability of a recently developed tiled, robust solver for scaled triangular systems, which ensures that at no point in the computation the overflow threshold is exceeded. The third system tackles the scaled continuous-time triangular Sylvester equation, which couples two quasi-triangular matrices. The solution process is prone to overflow. This thesis contributes a robust, tiled solver and demonstrates its practicability. These three systems can be addressed with a variation of forward or backward substitution. Compared to the highly optimized and scalable implementations of standard forward and backward substitution available in HPC libraries,the existing implementations of these three systems run at a smaller fraction of the peak performance. This thesis presents techniques to improve on the performance and robustness of the implementations of the three systems.

sted, utgiver, år, opplag, sider
Umeå: Department of computing science, Umeå University, 2019. s. 18
Serie
Report / UMINF, ISSN 0348-0542 ; 19.05
HSV kategori
Forskningsprogram
datalogi
Identifikatorer
urn:nbn:se:umu:diva-159436 (URN)978-91-7855-084-5 (ISBN)
Veileder
Tilgjengelig fra: 2019-05-28 Laget: 2019-05-28 Sist oppdatert: 2023-03-07bibliografisk kontrollert
2. Improving the efficiency of eigenvector-related computations
Åpne denne publikasjonen i ny fane eller vindu >>Improving the efficiency of eigenvector-related computations
2021 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Alternativ tittel[sv]
Förbättrad effektivitet för egenvektor-relaterade beräkningar
Abstract [en]

An effective strategy in dense linear algebra is the design of algorithms as tiled algorithms. Tiled algorithms that express the bulk of the computation as matrix-matrix operations (level-3 BLAS) have proven successful in achieving high performance on cache-based architectures. At the same time, tiled algorithms interoperate with dynamic data-driven execution models such as task parallelism and promise good parallel scalability.

This thesis applies the concept of tiled algorithms and task-centric execution to algorithms related to the computation of eigenvectors for the dense, non-symmetric eigenvalue problem. First, a standard algorithm for computing eigenvectors from the Schur form is recast such that all computational steps are rich in matrix-matrix operations. Second, inverse iteration on the Hessenberg matrix as an alternative approach to computing eigenvectors is addressed. An existing algorithm is revised to express the computationally most expensive step with matrix-matrix operations. Third, a task-parallel, tiled triangular Sylvester equation solver is amended to solve a larger class of problems. All algorithms have an enhanced performance, which is demonstrated through numerical experiments.

sted, utgiver, år, opplag, sider
Umeå: Umeå University, 2021. s. 27
Serie
Report / UMINF, ISSN 0348-0542 ; 21.05
Emneord
high-performance computing, standard non-symmetric eigenvalue problem, triangular Sylvester equation, tiled algorithms, task parallelism
HSV kategori
Identifikatorer
urn:nbn:se:umu:diva-185734 (URN)978-91-7855-577-2 (ISBN)978-91-7855-576-5 (ISBN)
Disputas
2021-09-20, MA316, MIT-huset, plan 3, Umeå, 10:00 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2021-08-30 Laget: 2021-07-04 Sist oppdatert: 2021-07-05bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Kjelgaard Mikkelsen, Carl ChristianSchwarz, Angelika BeatrixKarlsson, Lars

Søk i DiVA

Av forfatter/redaktør
Kjelgaard Mikkelsen, Carl ChristianSchwarz, Angelika BeatrixKarlsson, Lars
Av organisasjonen
I samme tidsskrift
Concurrency and Computation

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 619 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf