UNESCO’s Proceedings, 1945–2017: A Bilingual Digital Text CorpusShow others and affiliations
2025 (English)In: Journal of Open Humanities Data, E-ISSN 2059-481X, Vol. 11, article id 31Article in journal (Refereed) Published
Abstract [en]
The record of the meetings of UNESCO’s General Conference offers a valuable resource for research in the global humanities. We present a digital text corpus, including metadata and supplementary material, that makes the complete record of these meetings from 1946 to 2017 in English and/or French accessible in a machine-readable form that is suitable for digital text analysis. The corpus is stored on Zenodo; relevant code is available on GitHub. The corpus offers reuse potential for scholars interested in any of the countless issues that have been discussed and debated in UNESCO’s General Conference over more than seventy years, as well as to Natural Language Processing (NLP) developers interested in the challenges of language recognition and automated segmentation.
Place, publisher, year, edition, pages
Ubiquity Press, 2025. Vol. 11, article id 31
Keywords [en]
corpus design, digital text analysis, global humanities, international organizations, text corpus, transnational history
National Category
Natural Language Processing
Identifiers
URN: urn:nbn:se:umu:diva-239431DOI: 10.5334/johd.314ISI: 001484675700001Scopus ID: 2-s2.0-105005867352OAI: oai:DiVA.org:umu-239431DiVA, id: diva2:1962971
2025-06-022025-06-022026-01-19Bibliographically approved