Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
LatentAugment: data augmentation via guided manipulation of GAN's latent space
University Campus-Biomedico of Rome, Department of Engineering, Rome, Italy.
Umeå universitet, Medicinska fakulteten, Institutionen för diagnostik och intervention.ORCID-id: 0000-0002-2391-1419
Umeå universitet, Medicinska fakulteten, Institutionen för diagnostik och intervention. University Campus-Biomedico of Rome, Department of Engineering, Rome, Italy.ORCID-id: 0000-0003-2621-072X
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.ORCID-id: 0000-0001-7119-7646
2025 (Engelska)Ingår i: IEEE Transactions on Pattern Analysis and Machine Intelligence, ISSN 0162-8828, E-ISSN 1939-3539, Vol. 47, nr 12, s. 11519-11533Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Data Augmentation (DA) is a technique to increase the quantity and diversity of the training data, and by that alleviate overfitting and improve generalisation. However, standard DA produces synthetic data for augmentation with limited diversity. Generative Adversarial Networks (GANs) may unlock additional information in a dataset by generating synthetic samples having the appearance of real images. However, these models struggle to simultaneously address three key requirements: fidelity and high-quality samples; diversity and mode coverage; and fast sampling. Indeed, GANs generate high-quality samples rapidly, but have poor mode coverage, limiting their adoption in DA applications. We propose LatentAugment, a DA strategy that overcomes the low diversity of GANs, opening up for use in DA applications. Without external supervision, LatentAugment modifies latent vectors and moves them into latent space regions to maximise the synthetic images' diversity and fidelity. It is also agnostic to the dataset and the downstream task. A wide set of experiments shows that LatentAugment improves the generalisation of a deep model translating from MRI-to-CT beating both standard DA as well GAN-based sampling. We further demonstrate its effectiveness when translating from low-energy mammograms to dual-energy subtracted images in contrast-enhanced spectral mammography. Moreover, still in comparison with GAN-based sampling, LatentAugment synthetic samples show superior mode coverage and diversity. Code is available at: https://github.com/ltronchin/LatentAugment.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2025. Vol. 47, nr 12, s. 11519-11533
Nyckelord [en]
Computer vision, generalisation, Generative Adversarial Networks, image synthesis, medical imaging, mode coverage
Nationell ämneskategori
Medicinsk bildvetenskap Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:umu:diva-244092DOI: 10.1109/TPAMI.2025.3598866PubMedID: 40889306Scopus ID: 2-s2.0-105014973123OAI: oai:DiVA.org:umu-244092DiVA, id: diva2:1999914
Forskningsfinansiär
Lions Cancerforskningsfond i Norr, LP 18- 2182Lions Cancerforskningsfond i Norr, LP 22-2319National Academic Infrastructure for Supercomputing in Sweden (NAISS)Swedish National Infrastructure for Computing (SNIC)Tillgänglig från: 2025-09-22 Skapad: 2025-09-22 Senast uppdaterad: 2025-11-21Bibliografiskt granskad

Open Access i DiVA

fulltext(5380 kB)10 nedladdningar
Filinformation
Filnamn FULLTEXT02.pdfFilstorlek 5380 kBChecksumma SHA-512
701f01bc173fa05aeafabb0d1ff246c47b43da2eae0103d12a7ab82114c921dea42e723c5a545fb244899645f49617aecbecb63b7734dcd49074b5fc231293ad
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextPubMedScopus

Person

Vu, Minh HoangSoda, PaoloLöfstedt, Tommy

Sök vidare i DiVA

Av författaren/redaktören
Vu, Minh HoangSoda, PaoloLöfstedt, Tommy
Av organisationen
Institutionen för diagnostik och interventionInstitutionen för datavetenskap
I samma tidskrift
IEEE Transactions on Pattern Analysis and Machine Intelligence
Medicinsk bildvetenskapDatavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 66 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 256 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf