Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
The curated UNESCO Courier 1.0: annotated corpora for digital research in the global humanities
Department of History of Science and Ideas, Uppsala University, Uppsala, Sweden.
Department of Media and Communication Studies, Malmö University, Malmö, Sweden.
Umeå University, Faculty of Arts, Humlab.
Umeå University, Faculty of Arts, Humlab.
Show others and affiliations
2024 (English)In: Journal of Open Humanities Data, E-ISSN 2059-481X, Vol. 10, article id 20Article in journal (Refereed) Published
Abstract [en]

The monthly magazine of the United Nations Educational, Scientific and Cultural Organization, founded in 1948 as The UNESCO Courier, represents an extraordinary resource for research on global themes in the humanities. We present the Curated Courier 1.0, a package of digital text corpora, text analysis tools, and supplementary material that aims to make the complete archive of this publication from 1948 to 2020 machine-readable, accessible, and reusable for digital text analysis. One corpus compiles the text of all articles, which we carefully reconstructed and linked to a comprehensive curated metadata index while excluding additional text (masthead, photo captions, letters to the editor, and so on). A second corpus brings together the complete text of all issues. This article first presents the value of Courier as a source for digital research in the global humanities. Second, it outlines how we created the curated corpus and discusses some challenges we met. Third, it offers examples of tools researchers might use to explore and utilize the annotated corpus and discusses a few approaches that we have developed and tested.

Place, publisher, year, edition, pages
Ubiquity Press, 2024. Vol. 10, article id 20
Keywords [en]
global humanities, history, international organizations, text analysis, topic modeling, UNESCO
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:umu:diva-222305DOI: 10.5334/johd.181Scopus ID: 2-s2.0-85186412888OAI: oai:DiVA.org:umu-222305DiVA, id: diva2:1844939
Funder
Swedish Research Council, 2019-03278Available from: 2024-03-15 Created: 2024-03-15 Last updated: 2024-03-15Bibliographically approved

Open Access in DiVA

fulltext(1157 kB)35 downloads
File information
File name FULLTEXT01.pdfFile size 1157 kBChecksum SHA-512
1b8da11d0f7524f192c691bb0fa435a31f497e3d01d8ad7482471dcd97fc8a8b011714442df6823668a8b2728cc96f34564a1616245fa97cf55d63ba26d11c49
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Mähler, RogerMarklund, Andreas

Search in DiVA

By author/editor
Mähler, RogerMarklund, Andreas
By organisation
Humlab
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 35 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 225 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf