Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Benchmarking the Linear Algebra Awareness of TensorFlow and PyTorch
Rwth, Aachen University, Germany.
Rwth, Aachen University, Germany.
Rwth, Aachen University, Germany.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap. Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Högpresterande beräkningscentrum norr (HPC2N).ORCID-id: 0000-0002-4972-7097
2022 (Engelska)Ingår i: Proceedings: 2022 IEEE 36th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2022, IEEE, 2022, s. 924-933Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Linear algebra operations, which are ubiquitous in machine learning, form major performance bottlenecks. The High-Performance Computing community invests significant effort in the development of architecture-specific optimized kernels, such as those provided by the BLAS and LAPACK libraries, to speed up linear algebra operations. However, end users are progressively less likely to go through the error prone and time-consuming process of directly using said kernels; instead, frameworks such as TensorFlow (TF) and PyTorch (PyT), which facilitate the development of machine learning applications, are becoming more and more popular. Although such frameworks link to BLAS and LAPACK, it is not clear whether or not they make use of linear algebra knowledge to speed up computations. For this reason, in this paper we develop benchmarks to investigate the linear algebra optimization capabilities of TF and PyT. Our analyses reveal that a number of linear algebra optimizations are still missing; for instance, reducing the number of scalar operations by applying the distributive law, and automatically identifying the optimal parenthesization of a matrix chain. In this work, we focus on linear algebra computations in TF and PyT; we both expose opportunities for performance enhancement to the benefit of the developers of the frameworks and provide end users with guidelines on how to achieve performance gains.

Ort, förlag, år, upplaga, sidor
IEEE, 2022. s. 924-933
Nyckelord [en]
Linear Algebra, Machine Learning, Performance analysis
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:umu:diva-199008DOI: 10.1109/IPDPSW55747.2022.00150ISI: 000855041000111Scopus ID: 2-s2.0-85136223573ISBN: 9781665497473 (digital)ISBN: 9781665497480 (tryckt)OAI: oai:DiVA.org:umu-199008DiVA, id: diva2:1692576
Konferens
36th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2022, Lyon, France, 30 May - 03 June 2022
Tillgänglig från: 2022-09-02 Skapad: 2022-09-02 Senast uppdaterad: 2023-11-10Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Bientinesi, Paolo

Sök vidare i DiVA

Av författaren/redaktören
Bientinesi, Paolo
Av organisationen
Institutionen för datavetenskapHögpresterande beräkningscentrum norr (HPC2N)
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 243 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf