Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
From promise to practice: a study of common pitfalls behind the generalization gap in machine learning
Umeå University, Faculty of Science and Technology, Department of Computing Science. (Machine Learning)
Umeå University, Faculty of Science and Technology, Department of Computing Science. (Machine Learning)
Umeå University, Faculty of Medicine, Department of Diagnostics and Intervention.ORCID iD: 0000-0002-6321-8117
Umeå University, Faculty of Medicine, Department of Diagnostics and Intervention.ORCID iD: 0000-0002-8971-9788
Show others and affiliations
2025 (English)In: Transactions on Machine Learning Research, E-ISSN 2835-8856Article in journal (Refereed) Published
Abstract [en]

The world of Machine Learning (ML) offers great promise, but often there is a noticeable gap between claims made in research papers and the model's practical performance in real-life applications. This gap can often be attributed to systematic errors and pitfalls that occur during the development phase of ML models. This study aims to systematically identify these errors. For this, we break down the ML process into four main stages: data handling, model design, model evaluation, and reporting. Across these stages, we have identified fourteen common pitfalls based on a comprehensive review of around 60 papers discussing either broad challenges or specific pitfalls within ML pipeline. Moreover, Using the Brain Tumor Segmentation (BraTS) dataset, we perform three experiments to illustrate the impacts of these pitfalls, providing examples of how they can skew results and affect outcomes. In addition, we also perform a review to study the frequency of unclear reporting regarding these pitfalls in ML research. The goal of this review was to assess whether authors have adequately addressed these pitfalls in their reports. For this, we review 126 randomly chosen papers on image segmentation from the ICCV (2013-2021) and MICCAI (2013-2022) conferences from the last ten years. The results from this review show a notable oversight of these issues, with many of the papers lacking clarity on how the pitfalls are handled. This highlights an important gap in current reporting practices within the ML community. The code for the experiments is available at https://github.com/SG-Azar/BraTS-ML-Pitfalls-Experiments.

Place, publisher, year, edition, pages
Transactions on Machine Learning Research , 2025.
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:umu:diva-233898Scopus ID: 2-s2.0-85219528926OAI: oai:DiVA.org:umu-233898DiVA, id: diva2:1926398
Funder
Swedish Childhood Cancer Foundation, MT2021-0012Lions Cancerforskningsfond i Norr, LP 22-2319Lions Cancerforskningsfond i Norr, LP 24-2367Available from: 2025-01-10 Created: 2025-01-10 Last updated: 2025-03-18Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

ScopusArticle on Open ReviewJournal website

Authority records

Ghanbari Azar, SaeidehTronchin, LorenzoSimkó, AttilaNyholm, TufveLöfstedt, Tommy

Search in DiVA

By author/editor
Ghanbari Azar, SaeidehTronchin, LorenzoSimkó, AttilaNyholm, TufveLöfstedt, Tommy
By organisation
Department of Computing ScienceDepartment of Diagnostics and Intervention
In the same journal
Transactions on Machine Learning Research
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 384 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf