Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Measurement of alignment between standards and assessment
Umeå universitet, Samhällsvetenskapliga fakulteten, Beteendevetenskapliga mätningar.
2008 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Many educational systems of today are standards-based and aim at for alignment, i.e. consistency, among the components of the educational system: standards, teaching and assessment. To conclude whether the alignment is sufficiently high, analyses with a useful model are needed. This thesis investigates the usefulness of models for analyzing alignment between standards and assessments, with emphasis on one method: Bloom’s revised taxonomy. The thesis comprises an introduction and five articles that empirically investigate the usefulness of methods for alignment analyses.

In the first article, the usefulness of different models for analyzing alignment between standards and assessment is theoretically and empirically compared based on a number of criteria. The results show that Bloom’s revised taxonomy is the most useful model. The second article investigates the usefulness of Bloom’s revised taxonomy for interpretation of standards in mathematics with two differently composed panels of judges. One panel consisted of teachers and the other panel of assessment experts. The results show that Bloom’s revised taxonomy is useful for interpretation of standards, but that many standards are multi-categorized (placed in more than one category). The results also show higher levels of intra- and inter-judge consistency for assessment experts than for teachers. The third article further investigates the usefulness of Bloom’s revised taxonomy for analyses of alignment between standards and assessment. The results show that Bloom’s revised taxonomy is useful for analyses of both standards and assessments. The fourth article studies whether vague and general standards can explain the large proportion of multi-categorized standards in mathematics. The strategy was to divide a set of standards into smaller substandards and then compare the usefulness and inter-judge consistency for categorization with Bloom’s revised taxonomy for undivided and divided standards. The results show that vague and general standards do not explain the large proportion of multi-categorized standards. Another explanation is related to the nature of mathematics that often intertwines conceptual and procedural knowledge. This was also studied in the article and the results indicate that this is a probable explanation. The fifth article focuses on another aspect of alignment between standards and assessment, namely the alignment between performance standards and cut-scores for a specific assessment. The validity of two standard-setting methods, the Angoff method and the borderline-group method, was investigated. The results show that both methods derived reasonable and trustworthy cut-scores, but also that there are potential problems with these methods.

In the introductory part of the thesis, the empirical studies are summarized, contextualized and discussed. The discussion relates alignment to validity issues for assessments and relates the obtained empirical results to theoretical assumptions and applied implications. One conclusion of the thesis is that Bloom’s revised taxonomy is useful for analyses of alignment between standards and assessments. Another conclusion is that the two standard setting methods derive reasonable and trustworthy results. It is preferable if an alignment model can be used both for alignment analyses and in ongoing practice for increasing alignment. Bloom’s revised taxonomy has the potential for being such an alignment model. This thesis has found this taxonomy useful for alignment analyses, but its’ usefulness for increasing alignment in ongoing practice has to be investigated.

Ort, förlag, år, upplaga, sidor
Umeå: Beteendevetenskapliga mätningar , 2008. , s. 226
Serie
Academic dissertations at the department of Educational Measurement, ISSN 1652-9650 ; 3
Nyckelord [en]
alignment, standards, assessment, Bloom's revised taxonomy, the Angoff method, the borderline-group method, usefulness, validity
Nationell ämneskategori
Bearbetnings-, yt- och fogningsteknik
Identifikatorer
URN: urn:nbn:se:umu:diva-1865ISBN: 978-91-7264-662-9 (tryckt)OAI: oai:DiVA.org:umu-1865DiVA, id: diva2:142244
Disputation
2008-10-24, S205, Samhällsvetarhuset, Umeå, 10:15 (Engelska)
Opponent
Handledare
Tillgänglig från: 2008-09-30 Skapad: 2008-09-30 Senast uppdaterad: 2018-06-09Bibliografiskt granskad
Delarbeten
1. Alignment of standards and assessment: A theoretical and empirical study of methods for alignment
Öppna denna publikation i ny flik eller fönster >>Alignment of standards and assessment: A theoretical and empirical study of methods for alignment
2008 (Engelska)Ingår i: Electronic Journal of Research in Educational Psychology, ISSN 1699-5880, E-ISSN 1696-2095, Vol. 6, nr 3, s. 667-690Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Introduction. In a standards-based school-system alignment of policy documents with stan-dards and assessment is important. To be able to evaluate whether schools and students have reached the standards, the assessment should focus on the standards. Different models and methods can be used for measuring alignment, i.e. the correspondence between standards and assessment. Based on the assumption that a model must be able to include content and cogni-tive complexity, nine different models are identified and these models are then scrutinized with reference to defined theoretical criteria. The conclusion is that Bloom’s revised taxon-omy and Porter’s taxonomy are the most appropriate models.

Method. Bloom’s revised taxonomy and Porter’s taxonomy are compared based on empirical data from standards and assessment in a chemistry course in upper secondary schools in Swe-den. The comparison is based on five rules and of inter-rater reliability.

Results. Bloom’s revised taxonomy was more inclusive and exclusive than Porter’s taxon-omy. The inter-rater reliability for classification of standards was significantly better for Bloom’s revised taxonomy than for Porter’s taxonomy.

Conclusion. Based on the five rules, the conclusion is that Bloom’s revised taxonomy is the best model.

Nyckelord
Alignment, standards, assessment, Bloom's revised taxonomi, Porter's taxonomy
Identifikatorer
urn:nbn:se:umu:diva-11292 (URN)
Tillgänglig från: 2008-12-09 Skapad: 2008-12-09 Senast uppdaterad: 2024-03-06Bibliografiskt granskad
2. Interpretation of standards with Bloom's revised taxonomy: a comparison of teachers and assessment experts
Öppna denna publikation i ny flik eller fönster >>Interpretation of standards with Bloom's revised taxonomy: a comparison of teachers and assessment experts
2009 (Engelska)Ingår i: International Journal of Research and Method in Education, ISSN 1743-727X, E-ISSN 1743-7288, Vol. 32, nr 1, s. 39-51Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In education, standards have to be interpreted, for planning of teaching, for development of assessments and for alignment analysis. In most cases, it is important that there is an agreement between individuals and organizations about how to interpret standards. However, there is a lack of studies of how consistent different group of judges are when interpreting standards. In this study, the usefulness of Bloom’s revised taxonomy for interpreting standards in mathematics is evaluated, using different criteria. The results indicate that the taxonomy is an acceptable tool. The results also indicate that there are differences between the panel composed of teachers and the panel composed of assessment experts. The assessment experts were more consistent in their interpretation of standards. Limitations of the study and requirements for alignment analysis are discussed.

Ort, förlag, år, upplaga, sidor
London: Routledge, 2009
Nyckelord
standards, Bloom’s revised taxonomy, inter-judge consistency, intrajudge
Nationell ämneskategori
Pedagogik
Identifikatorer
urn:nbn:se:umu:diva-31051 (URN)10.1080/17437270902749262 (DOI)2-s2.0-70449578749 (Scopus ID)
Tillgänglig från: 2010-01-27 Skapad: 2010-01-27 Senast uppdaterad: 2023-03-24Bibliografiskt granskad
3. Alignment between standards and assessment: An evaluation of the usefulness of Bloom's revised taxonomy
Öppna denna publikation i ny flik eller fönster >>Alignment between standards and assessment: An evaluation of the usefulness of Bloom's revised taxonomy
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Identifikatorer
urn:nbn:se:umu:diva-3507 (URN)
Tillgänglig från: 2008-09-30 Skapad: 2008-09-30 Senast uppdaterad: 2022-03-14
4. Interpretation of standards with Bloom's revised taxonomy: Does a division influence its usefulness?
Öppna denna publikation i ny flik eller fönster >>Interpretation of standards with Bloom's revised taxonomy: Does a division influence its usefulness?
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Identifikatorer
urn:nbn:se:umu:diva-3508 (URN)
Tillgänglig från: 2008-09-30 Skapad: 2008-09-30 Senast uppdaterad: 2022-03-22
5. A comparison of two different methods for setting performance standards for a test with constructed-response items
Öppna denna publikation i ny flik eller fönster >>A comparison of two different methods for setting performance standards for a test with constructed-response items
2008 (Engelska)Ingår i: Practical Assessment, Research, and Evaluation, E-ISSN 1531-7714, Vol. 13, nr 9, s. 12-Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

The trustworthiness of performance standards influences the credibility of criterion-referenced large-scale testing. In this paper, two standard-setting methods are evaluated and compared, when applied to a test with polytomously scored constructed-response items. A version of the Angoff method is chosen as representative of the class of test-centred standard-setting procedures and the borderline-group method represents the class of examinee-centred procedures. The evaluation is based on procedural, internal and external evidence. The results indicate that both methods provide reasonable and trustworthy approaches to standard setting, but also confirm some of the potential problems with these methods.

Ort, förlag, år, upplaga, sidor
College Park, Md.: ERIC Clearinghouse on Assessment and Evaluation and the Department of Measurement, Statistics, and Evaluation at the University of Maryland, 2008
Identifikatorer
urn:nbn:se:umu:diva-3509 (URN)
Tillgänglig från: 2008-09-30 Skapad: 2008-09-30 Senast uppdaterad: 2024-01-16Bibliografiskt granskad

Open Access i DiVA

fulltext(883 kB)8467 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 883 kBChecksumma SHA-1
aeb6e32242b1962d85cffa6b43582fadb5eb84d44c615ef2f7c86ef80cb839de977d4ad8
Typ fulltextMimetyp application/pdf

Person

Näsström, Gunilla

Sök vidare i DiVA

Av författaren/redaktören
Näsström, Gunilla
Av organisationen
Beteendevetenskapliga mätningar
Bearbetnings-, yt- och fogningsteknik

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 8476 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 7187 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf