Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Question-Centric Model for Visual Question Answering in Medical Imaging
Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics.ORCID iD: 0000-0002-2391-1419
Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics.ORCID iD: 0000-0001-7119-7646
Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics.ORCID iD: 0000-0002-8971-9788
ARTORG Center, University of Bern, 3008 Bern, Switzerland.ORCID iD: 0000-0001-6791-4753
2020 (English)In: IEEE Transactions on Medical Imaging, ISSN 0278-0062, E-ISSN 1558-254X, Vol. 39, no 9, p. 2856-2868Article in journal (Refereed) Published
Abstract [en]

Deep learning methods have proven extremely effective at performing a variety of medical image analysis tasks. With their potential use in clinical routine, their lack of transparency has however been one of their few weak points, raising concerns regarding their behavior and failure modes. While most research to infer model behavior has focused on indirect strategies that estimate prediction uncertainties and visualize model support in the input image space, the ability to explicitly query a prediction model regarding its image content offers a more direct way to determine the behavior of trained models. To this end, we present a novel Visual Question Answering approach that allows an image to be queried by means of a written question. Experiments on a variety of medical and natural image datasets show that by fusing image and question features in a novel way, the proposed approach achieves an equal or higher accuracy compared to current methods.

Place, publisher, year, edition, pages
IEEE, 2020. Vol. 39, no 9, p. 2856-2868
Keywords [en]
Visual question answering, deep learning, medical images, medical questions and answers
National Category
Computer graphics and computer vision Natural Language Processing Other Medical Sciences not elsewhere specified
Research subject
Computer Science; Computerized Image Analysis
Identifiers
URN: urn:nbn:se:umu:diva-174694DOI: 10.1109/TMI.2020.2978284ISI: 000566339800013Scopus ID: 2-s2.0-85090173541OAI: oai:DiVA.org:umu-174694DiVA, id: diva2:1463481
Funder
Cancerforskningsfonden i NorrlandRegion VästerbottenVinnovaAvailable from: 2020-09-02 Created: 2020-09-02 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Vu, Minh HoangLöfstedt, TommyNyholm, Tufve

Search in DiVA

By author/editor
Vu, Minh HoangLöfstedt, TommyNyholm, TufveSznitman, Raphael
By organisation
Radiation Physics
In the same journal
IEEE Transactions on Medical Imaging
Computer graphics and computer visionNatural Language ProcessingOther Medical Sciences not elsewhere specified

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 287 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf