Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
CLEVR-Math: A Dataset for Compositional Language, Visual and Mathematical Reasoning
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0002-1112-2981
Örebro University, Sweden.
2022 (English)In: CEUR Workshop Proceedings / [ed] d'Avila Garcez A.; Jimenez-Ruiz E.; Jimenez-Ruiz E., CEUR-WS , 2022, Vol. 3212Conference paper, Published paper (Refereed)
Abstract [en]

We introduce CLEVR-Math, a multi-modal math word problems dataset consisting of simple math word problems involving addition/subtraction, represented partly by a textual description and partly by an image illustrating the scenario. The text describes actions performed on the scene that is depicted in the image. Since the question posed may not be about the scene in the image, but about the state of the scene before or after the actions are applied, the solver envision or imagine the state changes due to these actions. Solving these word problems requires a combination of language, visual and mathematical reasoning. We apply state-of-the-art neural and neuro-symbolic models for visual question answering on CLEVR-Math and empirically evaluate their performances. Our results show how neither method generalise to chains of operations. We discuss the limitations of the two in addressing the task of multi-modal word problem solving.

Place, publisher, year, edition, pages
CEUR-WS , 2022. Vol. 3212
Series
International Workshop on Neural-Symbolic Learning and Reasoning, ISSN 1613-0073
Keywords [en]
Math Word Problem Solving, Multimodal Reasoning, Neuro-Symbolic, Visual Question Answering
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:umu:diva-200100Scopus ID: 2-s2.0-85138703727OAI: oai:DiVA.org:umu-200100DiVA, id: diva2:1703389
Conference
16th International Workshop on Neural-Symbolic Learning and Reasoning, NeSy 2022, Windsor, UK, september 28-30, 2022.
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Available from: 2022-10-13 Created: 2022-10-13 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

fulltext(3742 kB)110 downloads
File information
File name FULLTEXT01.pdfFile size 3742 kBChecksum SHA-512
df48450b3450318137f480972774fbcd47acc4c664a762dca374153e5cb6b32c77420d7932700517d409291f5932648903af29734b2b3ce7fef2df77513ea3cc
Type fulltextMimetype application/pdf

Other links

ScopusPublishers fulltext

Authority records

Dahlgren Lindström, Adam

Search in DiVA

By author/editor
Dahlgren Lindström, Adam
By organisation
Department of Computing Science
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar
Total: 110 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 316 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf