Umeå University's logo

umu.sePublications
Change search
Link to record
Permanent link

Direct link
von Boer, Johan
Publications (4 of 4) Show all publications
Nylén, F., Skotare, T. & von Boer, J. (2025). The Visible Speech (VISP) platform: a secure infrastructure for the study of speech acts and spoken conversations. In: DHNB 2025: 9th Conference on Digital Humanities in the Nordic and Baltic Countries. Paper presented at 9th Conference on Digital Humanities in the Nordic and Baltic Countries (DHNB 2025), Tartu, Estonia, March 5-7, 2025.
Open this publication in new window or tab >>The Visible Speech (VISP) platform: a secure infrastructure for the study of speech acts and spoken conversations
2025 (English)In: DHNB 2025: 9th Conference on Digital Humanities in the Nordic and Baltic Countries, 2025Conference paper, Poster (with or without abstract) (Refereed)
Abstract [en]

Spoken language is central to many human interactions and provides the medium through which activities and events across many humanities and social sciences disciplines are studied. It is also the object of active study in itself. As central to humanity as spoken language is, regulations aimed at mitigating privacy concerns also affect the affordance for collaborations on a national or larger scale based on spoken materials.  

The Visible Speech (VISP) platform is a web-based research infrastructure at Humlab, Umeå University, designed to handle audio recordings of speech in compliance with the national implementation of GDPR and security requirements. VISP provides a centralized environment for research of all disciplines in which recordings of spoken language constitute the primary material, meeting both researchers’ needs for efficient workflows and legislators’ demands for secure data management.

One of VISP's primary advantages is its ability to facilitate research on audio recordings that now constitute personally identifiable information (PII) under the application of the GDPR in Sweden. These recordings may contain sensitive content or have been made in sensitive contexts, classifying them as sensitive PII under national legislation. Sensitive contents may occur in relation to, for instance, the ethnicity and religious beliefs of the speaker, and sensitive contexts may occur when the recording is made in a healthcare context or in a context where a person’s membership with a union organization is divulged. While the challenges in conducting larger research efforts on the types of materials are currently aggravated by the implementation of the GDPR locally in Sweden, it is currently not clear to what extent upcoming AI regulation will, in effect, migrate identical or similar constraints to research in other countries in the EU as well.

The VISP platform offers a unified environment for storage, controlled access, direct work, and reproducible speech signal processing. VISP is built on the foundation of earlier research efforts1,2 and includes a comprehensive set of speech and voice analysis procedures within one framework. Thus, national research groups can collectively store interviews or other spoken language recordings, have automatic transcriptions or other speech processing performed, and access the results for complementary manual annotation or analysis simultaneously and securely. Additionally, VISP facilitates the digital archiving of projects through a uniform, documented, and transparent directory structure, reducing barriers to making data available following the FAIR principles. Research projects dealing with sensitive personal data in audio recording form require review by the Ethical Approval Authority and may subsequently take advantage of the VISP facilities.

A significant feature of VISP is its integration with the Swedish Academic Identity Federation (SWAMID, connected to eduGAIN), which enables researchers across Sweden to have secure, federated logins. This national federated login system allows researchers to access project data and collaborate on material processing in ways that were previously not possible. Moreover, VISP supports projects by lowering the step in to digital signal processing and audio analysis of the collected audio signals. This capability allows researchers to perform hands-on processing and analysis without the risk of disseminating sensitive audio recordings. By leveraging SWAMID, VISP ensures that researchers can work seamlessly and securely on collected materials, enhancing collaborative efforts and data handling efficiency. By providing tools for direct manipulation and examination of audio data, VISP ensures that all stages of data handling, from collection to analysis, are conducted within a secure environment, thereby maintaining the integrity and confidentiality of sensitive information.

The work conducted within VISP is part of SweCLARIN, the Swedish node of the European Research Infrastructure Consortium (ERIC) CLARIN. SweCLARIN aims to develop and provide national and European infrastructure for speech and text-based e-science, offering extensive digitized materials and advanced language technology tools. By combining the advanced technologies developed by CLARIN ERIC partners1 with stringent security protocols and leveraging federated login systems, VISP enables efficient and secure research on audio recordings of speech. The VISP components are available for download and setup of local instances and for modification, and the framework, therefore, promises to provide an invaluable tool for researchers, facilitating unprecedented collaboration and data processing within the digital humanities on both a national and larger scale.

References:

[1] R. Winkelmann, J. Harrington, K. Jänsch, EMU-SDMS: Advanced speech database management and analysis in R, Comput. Speech Lang. 45 (2017) 392–410.

[2] R. Winkelmann, J. Harrington, EMU-SDMS: R Centric Semi-automatic Speech Database Processing and Analysis. In Sasha Calhoun, Paola Escudero, Marija Tabain & Paul Warren (eds.) Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019 (pp. 1317--1321).  Canberra, Australia: Australasian Speech Science and Technology Association Inc

National Category
Languages and Literature Other Humanities
Research subject
digital humanities
Identifiers
urn:nbn:se:umu:diva-235877 (URN)
Conference
9th Conference on Digital Humanities in the Nordic and Baltic Countries (DHNB 2025), Tartu, Estonia, March 5-7, 2025
Funder
Swedish Research Council, 2023-00161-16
Available from: 2025-02-24 Created: 2025-02-24 Last updated: 2025-02-24Bibliographically approved
Gholamrezaie, E., Buckland, P. I., Mähler, R., von Boer, J., Weegar, R., Sjölander, M. & Engqvist, C.-E. (2024). A swedish national infrastructure for interdisciplinary environmental research integrating archaeological and quaternary geological data. In: EGU General Assebly 2024: Programme. Paper presented at EGU General Assembly 2024, Vienna, Austria and Online, April 14-19, 2024. , Article ID EGU24-15957.
Open this publication in new window or tab >>A swedish national infrastructure for interdisciplinary environmental research integrating archaeological and quaternary geological data
Show others...
2024 (English)In: EGU General Assebly 2024: Programme, 2024, article id EGU24-15957Conference paper, Oral presentation with published abstract (Refereed)
National Category
Archaeology Earth and Related Environmental Sciences Computer Systems
Identifiers
urn:nbn:se:umu:diva-232681 (URN)10.5194/egusphere-egu24-15957 (DOI)
Conference
EGU General Assembly 2024, Vienna, Austria and Online, April 14-19, 2024
Note

Part of sub programme ESSI3 – Open Science Informatics for Earth and Space Sciences. 

Available from: 2024-12-05 Created: 2024-12-05 Last updated: 2025-01-31Bibliographically approved
Edvardsson, J., Hansson, A., Sjölander, M., von Boer, J., Buckland, P. I., Linderson, H., . . . Hammarlund, D. (2023). Old wood in a new light: an online dendrochronological database. International Journal of Wood Culture, 3(1-3), 442-463
Open this publication in new window or tab >>Old wood in a new light: an online dendrochronological database
Show others...
2023 (English)In: International Journal of Wood Culture, ISSN 2772-3194, Vol. 3, no 1-3, p. 442-463Article in journal (Refereed) Published
Abstract [en]

The Old Wood in a New Light database project focuses on the digitization and accessibility of the results of dendrochronological samples analyzed and archived at four Swedish university-based tree-ring laboratories at Lund University, Stockholm University, University of Gothenburg, and the Swedish University of Agricultural Sciences. Collaboration with the Environmental Archaeology Laboratory and Humlab at Umeå University enables long-term open access to data, raw data, and metadata. In this project, we (1) systematically undertake large-scale entry and open access publication of results from wood samples scientifically analyzed and archived by Swedish laboratories and the associated metadata, into the Strategic Environmental Archaeology Database (SEAD; www.sead.se) research data infrastructure, and (2) actively promote the database as a resource for new and ongoing interdisciplinary research initiatives. Including dendrochronological data in SEAD infrastructure allows interdisciplinary studies that combine major scientific and societal questions. Building on a pilot study of construction timber from southern Sweden and adaptation of SEAD digitization workflows, more than 70 000 samples archived at the four dendrochronological laboratories are now being handled in the project. The broad coverage of research networks, stakeholder interaction, and strategic support from the cultural heritage community is guaranteed owing to the ongoing collaboration between laboratories and an established international and multidisciplinary reference group.

Place, publisher, year, edition, pages
Brill Academic Publishers, 2023
Keywords
archaeology, cultural heritage, dendrochronology, open data, Sweden, timber, tree rings
National Category
History and Archaeology Agriculture, Forestry and Fisheries Climate Science Wood Science Geology
Research subject
Archaeology; environmental archaeology; Earth Sciences with Specialization Environmental Analysis; climate change; data science; cultural heritage
Identifiers
urn:nbn:se:umu:diva-201875 (URN)10.1163/27723194-bja10009 (DOI)
Projects
Old wood in a new lightSEAD
Funder
Riksbankens Jubileumsfond, IN20-0026Riksbankens Jubileumsfond, IN15-0231:1Swedish Research Council, 2007-7494Swedish Research Council, 2010-5976
Available from: 2022-12-21 Created: 2022-12-21 Last updated: 2025-02-01Bibliographically approved
Buckland, P. I., Sjölander, M., von Boer, J., Roger, M. & Linderholm, J. (2022). The intricate details of using research databases and repositories for environmental archaeology data. ArcheoLogica Data, 2, 15-29
Open this publication in new window or tab >>The intricate details of using research databases and repositories for environmental archaeology data
Show others...
2022 (English)In: ArcheoLogica Data, ISSN 2785-0307, Vol. 2, p. 15-29Article in journal (Refereed) Published
Abstract [en]

Environmental archaeology is a complex mix of empirical analysis and qualitative interpretation.It is increasingly data science oriented, and databases and online resources are becoming increasinglyimportant in large scale synthesis research on changes in climate, environments and human activities.Research funders, journals and universities place much emphasis on the use of data repositories toensure transparency and reusability in the research process. Although these are important, researchersthemselves, however, may have more use for research databases which are oriented more towardsadvanced querying and exploratory data analysis than conforming to archiving standards. This paperexplores the pros and cons of these different approaches. It also discusses and problematizes somekey concepts in research data management, including the definitions of data and metadata, along withthe FAIR principles. Research examples are provided from a broad field of environmental archaeologyand palaeoecology. In contrast to most publications, the developer’s perspective is also included, anda worked example using the Strategic Environmental Archaeology Database (SEAD) to investigate fossilbeetle data demonstrates the implementation of some of this in the real world. This example may befollowed online using the SEAD browser, and all described data downloaded from there. After providingboth encouragement and warnings on the use of digital resources for synthesis research, some suggestionsare made for moving forward.

Place, publisher, year, edition, pages
Sesto Fiorentino: All’Insegna del Giglio, 2022
Keywords
database, infrastructure, environment, archives, FAIR, Palaeoecology, geoarchaeology, research data infrastructure, linking data and metadata, interdisciplinary research.
National Category
Computer and Information Sciences Archaeology Geology Information Studies
Research subject
Archaeology; environmental archaeology; data science; Computer Science; library and information science
Identifiers
urn:nbn:se:umu:diva-194496 (URN)10.13131/unipi/2785-0668/a1cc-xt56 (DOI)
Available from: 2022-05-06 Created: 2022-05-06 Last updated: 2022-05-09Bibliographically approved
Organisations

Search in DiVA

Show all publications