Umeå University's logo

umu.sePublikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Latent program modeling: Inferring latent problem-solving strategies from a PISA problem-solving task
Umeå universitet, Samhällsvetenskapliga fakulteten, Institutionen för tillämpad utbildningsvetenskap.ORCID-id: 0000-0002-6998-3397
2022 (engelsk)Inngår i: Journal of Educational Data Mining, E-ISSN 2157-2100, Vol. 14, nr 1, s. 46-80Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Response process data have the potential to provide a rich description of test-takers’ thinking processes. However, retrieving insights from these data presents a challenge for educational assessments and educational data mining as they are complex and not well annotated. The present study addresses this challenge by developing a computational model that simulates how different problem-solving strategies would behave while searching for a solution to a Program for International Student Assessment (PISA) 2012 problem-solving item, and uses n-gram processing of data together with a naïve Bayesian classifier to infer latent problem-solving strategies from the test-takers’ response process data. The retrieval of simulated strategies improved with increased n-gram length, reaching an accuracy of 0.72 on the original PISA task. Applying the model to generalized versions of the task showed that classification accuracy increased with problem size and the mean number of actions, reaching a classification accuracy of 0.90 for certain task versions. The strategy that was most efficient and effective in the PISA Traffic task evaluated paths based on the labeled travel time. However, in generalized versions of the task, a straight line strategy was more effective. When applying the classifier to empirical data, most test-takers were classified as using a random path strategy (46%). Test-takers classified as using the travel time strategy had the highest probability of solving the task (p̂ ≈ 1). The test-takers classified as using the random actions strategy had the lowest probability of solving the task (p̂ ≈ 0.11). The effect of (classified) strategy on general PISA problem-solving performance was overall weak, except for a negative effect for the random actions strategy (β ≈ −65, CI95% ≈ [−96, −36]). The study contributes with a novel approach to inferring latent problem-solving strategies from action sequences. The study also illustrates how simulations can provide valuable information about item design by exploring how changing item properties could affect the accuracy of inferences about unobserved problem-solving strategies.

sted, utgiver, år, opplag, sider
International Educational Data Mining Society , 2022. Vol. 14, nr 1, s. 46-80
Emneord [en]
computational cognitive modeling, educational assessment, PISA, problem-solving, process data
HSV kategori
Identifikatorer
URN: urn:nbn:se:umu:diva-203575DOI: 10.5281/zenodo.6686443Scopus ID: 2-s2.0-85145815648OAI: oai:DiVA.org:umu-203575DiVA, id: diva2:1728715
Tilgjengelig fra: 2023-01-19 Laget: 2023-01-19 Sist oppdatert: 2023-04-19bibliografisk kontrollert
Inngår i avhandling
1. Exploring and modeling response process data from PISA: inferences related to motivation and problem-solving
Åpne denne publikasjonen i ny fane eller vindu >>Exploring and modeling response process data from PISA: inferences related to motivation and problem-solving
2023 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Alternativ tittel[sv]
Modellering av responsprocessdata från PISA : inferenser relaterade till motivation och problemlösning
Abstract [en]

This thesis explores and models response process data from large-scale assessments, focusing on test-taking motivation, problem-solving strategies, and questionnaire response validity. It consists of four studies, all using data from PISA (Programme for International Student Assessment) data.

Study I processed and clustered log-file data to create a behavioral evaluation of students' effort applied to a PISA problem-solving item, and examined the relationship between students' behavioral effort, self-reported effort, and test performance. Results show that effort invested before leaving the task unsolved was positively related to performance, while effort invested before solving the tasks was not. Low effort before leaving the task unsolved was further related to lower self-reported effort. The findings suggest that test-taking motivation could only be validly measured from efforts exerted before giving up.

Study II used response process data to infer students' problem-solving strategies on a PISA problem-solving task, and investigated the efficiency of strategies and their relationship to PISA performance. A text classifier trained on data from a generative computational model was used to retrieve different strategies, reaching a classification accuracy of 0.72, which increased to 0.90 with item design changes. The most efficient strategies used information from the task environment to make plans. Test-takers classified as selecting actions randomly performed worse overall. The study concludes that computational modeling can inform score interpretation and item design.

Study III investigated the relationship between motivation to answer the PISA student questionnaire and test performance. Departing from the theory of satisficing in surveys a Bayesian finite mixture model was developed to assess questionnaire-taking motivation. Results showed that overall motivation was high, but decreased toward the end. The questionnaire-taking motivation was positively related to performance, suggesting that it could be a proxy for test-taking motivation, however, reading skills may affect the estimation.

Study IV examines the validity of composite scores assessing reading metacognition, using a Bayesian finite mixture model that jointly considers response times and sequential patterns in subitem responses. The results show that, the relatively high levels of satisficing (up to 30%) negatively biased composite scores. The study highlights the importance of considering response time data and subitem response patterns when the validity of scores from the student questionnaire.

In conclusion, response process data from international large-scale assessments can provide valuable insights into test-takers’ motivation, problem-solving strategies, and questionnaire validity.

sted, utgiver, år, opplag, sider
Umeå: Umeå University, 2023. s. 53
Serie
Academic dissertations at the department of Educational Measurement, ISSN 1652-9650 ; 15
Emneord
response processes, large-scale assessments, motivation, problem-solving, computational modeling, Bayesian modeling
HSV kategori
Forskningsprogram
beteendevetenskapliga mätningar
Identifikatorer
urn:nbn:se:umu:diva-206866 (URN)978-91-8070-058-0 (ISBN)978-91-8070-057-3 (ISBN)
Disputas
2023-05-17, Aula Biologica, Umeå, 10:00 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2023-04-26 Laget: 2023-04-19 Sist oppdatert: 2024-07-02bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Lundgren, Erik

Søk i DiVA

Av forfatter/redaktør
Lundgren, Erik
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 124 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf