Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Ensemble of Streamlined Bilinear Visual Question Answering Models for the ImageCLEF 2019 Challenge in the Medical Domain
Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics.ORCID iD: 0000-0002-2391-1419
ARTORG Center, University of Bern, Bern, Switzerland.
Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics.ORCID iD: 0000-0002-8971-9788
Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics. Umeå University, Faculty of Science and Technology, Department of Chemistry.ORCID iD: 0000-0001-7119-7646
2019 (English)In: CLEF 2019: Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum / [ed] Linda Cappellato, Nicola Ferro, David E. Losada, and Henning Müller, 2019, Vol. 2380Conference paper, Published paper (Other academic)
Abstract [en]

This paper describes the contribution by participants from Umeå University, Sweden, in collaboration with the University of Bern, Switzerland, for the Medical Domain Visual Question Answering challenge hosted by ImageCLEF 2019. We proposed a novel Visual Question Answering approach that leverages a bilinear model to aggregateand synthesize extracted image and question features. While we did not make use of any additional training data, our model used an attention scheme to focus on the relevant input context and was further boosted by using an ensemble of trained models. We show here that the proposed approach performs at state-of-the-art levels, and provides an improvement over several existing methods. The proposed method was ranked 3rd in the Medical Domain Visual Question Answering challenge of ImageCLEF 2019.

Place, publisher, year, edition, pages
2019. Vol. 2380
National Category
Computer graphics and computer vision Medical and Health Sciences
Identifiers
URN: urn:nbn:se:umu:diva-166758OAI: oai:DiVA.org:umu-166758DiVA, id: diva2:1381723
Conference
CLEF 2019 - Conference and Labs of the Evaluation Forum, Lugano, Switzerland, Sept 9-12, 2019
Available from: 2019-12-27 Created: 2019-12-27 Last updated: 2025-02-01Bibliographically approved

Open Access in DiVA

fulltext(951 kB)271 downloads
File information
File name FULLTEXT01.pdfFile size 951 kBChecksum SHA-512
ab1c4db1eb2dd39fdf1d3eacb34c6546f3e2761d99ea629e7cc617ab6bc321e91bbc6a0a1488395bac7719c52d1823defb9ae711eaa533599a997e2bb289473b
Type fulltextMimetype application/pdf

Other links

URL

Authority records

Vu, Minh HoangNyholm, TufveLöfstedt, Tommy

Search in DiVA

By author/editor
Vu, Minh HoangNyholm, TufveLöfstedt, Tommy
By organisation
Radiation PhysicsDepartment of Chemistry
Computer graphics and computer visionMedical and Health Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 271 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 819 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf