Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
CareCorpus+: expanding and augmenting caregiver strategy data to support pediatric rehabilitation
Department of Computer Science, University of Illinois Chicago Institute for Population and Precision Health, University of Chicago.
Department of Occupational Therapy, University of Illinois Chicago.
Department of Occupational Therapy, University of Illinois Chicago.
Umeå University, Faculty of Science and Technology, Department of Computing Science. Department of Occupational Therapy, University of Illinois Chicago.ORCID iD: 0000-0003-1290-9441
Show others and affiliations
2024 (English)In: EMNLP 2024. The 2024 conference on empirical methods in natural language processing: proceedings of the conference, Association for Computational Linguistics, 2024, p. 6912-6927Conference paper, Published paper (Refereed)
Abstract [en]

Caregiver strategy classification in pediatric rehabilitation contexts is strongly motivated by real-world clinical constraints but highly underresourced and seldom studied in natural language processing settings. We introduce a large dataset of 3,062 caregiver strategies in this setting, a five-fold increase over the nearest contemporary dataset. These strategies are manually categorized into clinically established constructs with high agreement (κ=0.68-0.89). We also propose two techniques to further address identified data constraints. First, we manually supplement target task data with relevant public data from online child health forums. Next, we propose a novel data augmentation technique to generate synthetic caregiver strategies with high downstream task utility. Extensive experiments showcase the quality of our dataset. They also establish evidence that both the publicly available data and the synthetic strategies result in large performance gains, with relative F1 increases of 22.6% and 50.9%, respectively.

Place, publisher, year, edition, pages
Association for Computational Linguistics, 2024. p. 6912-6927
National Category
Computer and Information Sciences Occupational Therapy
Identifiers
URN: urn:nbn:se:umu:diva-232836DOI: 10.18653/v1/2024.emnlp-main.392Scopus ID: 2-s2.0-85217816157ISBN: 979-8-89176-164-3 (electronic)OAI: oai:DiVA.org:umu-232836DiVA, id: diva2:1920245
Conference
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), Miami, Florida, USA, November 12-16, 2024.
Funder
NIH (National Institutes of Health), 1K12 HD055931Available from: 2024-12-11 Created: 2024-12-11 Last updated: 2025-02-24Bibliographically approved

Open Access in DiVA

fulltext(1093 kB)44 downloads
File information
File name FULLTEXT01.pdfFile size 1093 kBChecksum SHA-512
a03611e58f54015dff701654f5fd3c94f5d40cd319272ca5a0324ace4e1d7d3386d8fdd528fb18690e277e43dca35dfe25724d931b526b45e1acd5f52fbce623
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Kaelin, Vera C.

Search in DiVA

By author/editor
Kaelin, Vera C.
By organisation
Department of Computing Science
Computer and Information SciencesOccupational Therapy

Search outside of DiVA

GoogleGoogle Scholar
Total: 44 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 199 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf