Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Enabling mixed-precision in spectral element codes
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0009-0003-5512-254X
Université Paris-Saclay, UVSQ, LI-PaRAD, France.
KTH - Royal Institute of Technology, Sweden.
Show others and affiliations
2026 (English)In: Future Generation Computer Systems, ISSN 0167-739X, E-ISSN 1872-7115, Vol. 174, article id 107990Article in journal (Refereed) Published
Abstract [en]

Mixed-precision computing has the potential to significantly reduce the cost of exascale computations, but determining when and how to implement it in programs can be challenging. In this article, we propose a methodology for enabling mixed-precision with the help of computer arithmetic tools, roofline model, and computer arithmetic techniques. As case studies, we consider Nekbone (Nek5000 developers), a mini-application for the Computational Fluid Dynamics (CFD) solver Nek5000 (Fischer et al.), and a modern Neko (Jansson et al., 2024) CFD application. With the help of the Verificarlo (Denis et al., 2016) tool and computer arithmetic techniques, we introduce a strategy to address stagnation issues in the preconditioned Conjugate Gradient method in Nekbone and apply these insights to implement a mixed-precision version of Neko. We evaluate the derived mixed-precision versions of these codes by combining metrics in three dimensions: accuracy, time-to-solution, and energy-to-solution. Notably, mixed-precision in Nekbone reduces time-to-solution by roughly 1.62x and energy-to-solution by 2.43x on MareNostrum 5, while in the real-world Neko application, the gain is up to 1.3x in both time and energy, with the accuracy that matches double-precision results.

Place, publisher, year, edition, pages
Elsevier, 2026. Vol. 174, article id 107990
Keywords [en]
Computer arithmetic tool, Conjugate gradient, Energy-to-solution, Mixed-precision, Neko, Roofline model, Verificarlo
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-242183DOI: 10.1016/j.future.2025.107990Scopus ID: 2-s2.0-105009726439OAI: oai:DiVA.org:umu-242183DiVA, id: diva2:1983971
Available from: 2025-07-14 Created: 2025-07-14 Last updated: 2025-07-14Bibliographically approved

Open Access in DiVA

fulltext(1858 kB)60 downloads
File information
File name FULLTEXT01.pdfFile size 1858 kBChecksum SHA-512
2f839894bd2c29fa28d2ad0dd9963a349fb89054aeba8e538e7215639bd25cdee71db1be51a17cd1642a66e9e5797acd8a2aaa0dedba8e57fbced3cd94ff588e
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Chen, YanxiangBientinesi, PaoloIakymchuk, Roman

Search in DiVA

By author/editor
Chen, YanxiangBientinesi, PaoloIakymchuk, Roman
By organisation
Department of Computing Science
In the same journal
Future Generation Computer Systems
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 62 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 379 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf