Umeå universitets logga

umu.sePublikationer
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
von Boer, Johan
Publikationer (4 of 4) Visa alla publikationer
Nylén, F., Skotare, T. & von Boer, J. (2025). The Visible Speech (VISP) platform: a secure infrastructure for the study of speech acts and spoken conversations. In: DHNB 2025: 9th Conference on Digital Humanities in the Nordic and Baltic Countries. Paper presented at 9th Conference on Digital Humanities in the Nordic and Baltic Countries (DHNB 2025), Tartu, Estonia, March 5-7, 2025.
Öppna denna publikation i ny flik eller fönster >>The Visible Speech (VISP) platform: a secure infrastructure for the study of speech acts and spoken conversations
2025 (Engelska)Ingår i: DHNB 2025: 9th Conference on Digital Humanities in the Nordic and Baltic Countries, 2025Konferensbidrag, Poster (med eller utan abstract) (Refereegranskat)
Abstract [en]

Spoken language is central to many human interactions and provides the medium through which activities and events across many humanities and social sciences disciplines are studied. It is also the object of active study in itself. As central to humanity as spoken language is, regulations aimed at mitigating privacy concerns also affect the affordance for collaborations on a national or larger scale based on spoken materials.  

The Visible Speech (VISP) platform is a web-based research infrastructure at Humlab, Umeå University, designed to handle audio recordings of speech in compliance with the national implementation of GDPR and security requirements. VISP provides a centralized environment for research of all disciplines in which recordings of spoken language constitute the primary material, meeting both researchers’ needs for efficient workflows and legislators’ demands for secure data management.

One of VISP's primary advantages is its ability to facilitate research on audio recordings that now constitute personally identifiable information (PII) under the application of the GDPR in Sweden. These recordings may contain sensitive content or have been made in sensitive contexts, classifying them as sensitive PII under national legislation. Sensitive contents may occur in relation to, for instance, the ethnicity and religious beliefs of the speaker, and sensitive contexts may occur when the recording is made in a healthcare context or in a context where a person’s membership with a union organization is divulged. While the challenges in conducting larger research efforts on the types of materials are currently aggravated by the implementation of the GDPR locally in Sweden, it is currently not clear to what extent upcoming AI regulation will, in effect, migrate identical or similar constraints to research in other countries in the EU as well.

The VISP platform offers a unified environment for storage, controlled access, direct work, and reproducible speech signal processing. VISP is built on the foundation of earlier research efforts1,2 and includes a comprehensive set of speech and voice analysis procedures within one framework. Thus, national research groups can collectively store interviews or other spoken language recordings, have automatic transcriptions or other speech processing performed, and access the results for complementary manual annotation or analysis simultaneously and securely. Additionally, VISP facilitates the digital archiving of projects through a uniform, documented, and transparent directory structure, reducing barriers to making data available following the FAIR principles. Research projects dealing with sensitive personal data in audio recording form require review by the Ethical Approval Authority and may subsequently take advantage of the VISP facilities.

A significant feature of VISP is its integration with the Swedish Academic Identity Federation (SWAMID, connected to eduGAIN), which enables researchers across Sweden to have secure, federated logins. This national federated login system allows researchers to access project data and collaborate on material processing in ways that were previously not possible. Moreover, VISP supports projects by lowering the step in to digital signal processing and audio analysis of the collected audio signals. This capability allows researchers to perform hands-on processing and analysis without the risk of disseminating sensitive audio recordings. By leveraging SWAMID, VISP ensures that researchers can work seamlessly and securely on collected materials, enhancing collaborative efforts and data handling efficiency. By providing tools for direct manipulation and examination of audio data, VISP ensures that all stages of data handling, from collection to analysis, are conducted within a secure environment, thereby maintaining the integrity and confidentiality of sensitive information.

The work conducted within VISP is part of SweCLARIN, the Swedish node of the European Research Infrastructure Consortium (ERIC) CLARIN. SweCLARIN aims to develop and provide national and European infrastructure for speech and text-based e-science, offering extensive digitized materials and advanced language technology tools. By combining the advanced technologies developed by CLARIN ERIC partners1 with stringent security protocols and leveraging federated login systems, VISP enables efficient and secure research on audio recordings of speech. The VISP components are available for download and setup of local instances and for modification, and the framework, therefore, promises to provide an invaluable tool for researchers, facilitating unprecedented collaboration and data processing within the digital humanities on both a national and larger scale.

References:

[1] R. Winkelmann, J. Harrington, K. Jänsch, EMU-SDMS: Advanced speech database management and analysis in R, Comput. Speech Lang. 45 (2017) 392–410.

[2] R. Winkelmann, J. Harrington, EMU-SDMS: R Centric Semi-automatic Speech Database Processing and Analysis. In Sasha Calhoun, Paola Escudero, Marija Tabain & Paul Warren (eds.) Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019 (pp. 1317--1321).  Canberra, Australia: Australasian Speech Science and Technology Association Inc

Nationell ämneskategori
Språk och litteratur Annan humaniora och konst
Forskningsämne
digital humaniora
Identifikatorer
urn:nbn:se:umu:diva-235877 (URN)
Konferens
9th Conference on Digital Humanities in the Nordic and Baltic Countries (DHNB 2025), Tartu, Estonia, March 5-7, 2025
Forskningsfinansiär
Vetenskapsrådet, 2023-00161-16
Tillgänglig från: 2025-02-24 Skapad: 2025-02-24 Senast uppdaterad: 2025-02-24Bibliografiskt granskad
Gholamrezaie, E., Buckland, P. I., Mähler, R., von Boer, J., Weegar, R., Sjölander, M. & Engqvist, C.-E. (2024). A swedish national infrastructure for interdisciplinary environmental research integrating archaeological and quaternary geological data. In: EGU General Assebly 2024: Programme. Paper presented at EGU General Assembly 2024, Vienna, Austria and Online, April 14-19, 2024. , Article ID EGU24-15957.
Öppna denna publikation i ny flik eller fönster >>A swedish national infrastructure for interdisciplinary environmental research integrating archaeological and quaternary geological data
Visa övriga...
2024 (Engelska)Ingår i: EGU General Assebly 2024: Programme, 2024, artikel-id EGU24-15957Konferensbidrag, Muntlig presentation med publicerat abstract (Refereegranskat)
Nationell ämneskategori
Arkeologi Geovetenskap och relaterad miljövetenskap Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-232681 (URN)10.5194/egusphere-egu24-15957 (DOI)
Konferens
EGU General Assembly 2024, Vienna, Austria and Online, April 14-19, 2024
Anmärkning

Part of sub programme ESSI3 – Open Science Informatics for Earth and Space Sciences. 

Tillgänglig från: 2024-12-05 Skapad: 2024-12-05 Senast uppdaterad: 2025-01-31Bibliografiskt granskad
Edvardsson, J., Hansson, A., Sjölander, M., von Boer, J., Buckland, P. I., Linderson, H., . . . Hammarlund, D. (2023). Old wood in a new light: an online dendrochronological database. International Journal of Wood Culture, 3(1-3), 442-463
Öppna denna publikation i ny flik eller fönster >>Old wood in a new light: an online dendrochronological database
Visa övriga...
2023 (Engelska)Ingår i: International Journal of Wood Culture, ISSN 2772-3194, Vol. 3, nr 1-3, s. 442-463Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

The Old Wood in a New Light database project focuses on the digitization and accessibility of the results of dendrochronological samples analyzed and archived at four Swedish university-based tree-ring laboratories at Lund University, Stockholm University, University of Gothenburg, and the Swedish University of Agricultural Sciences. Collaboration with the Environmental Archaeology Laboratory and Humlab at Umeå University enables long-term open access to data, raw data, and metadata. In this project, we (1) systematically undertake large-scale entry and open access publication of results from wood samples scientifically analyzed and archived by Swedish laboratories and the associated metadata, into the Strategic Environmental Archaeology Database (SEAD; www.sead.se) research data infrastructure, and (2) actively promote the database as a resource for new and ongoing interdisciplinary research initiatives. Including dendrochronological data in SEAD infrastructure allows interdisciplinary studies that combine major scientific and societal questions. Building on a pilot study of construction timber from southern Sweden and adaptation of SEAD digitization workflows, more than 70 000 samples archived at the four dendrochronological laboratories are now being handled in the project. The broad coverage of research networks, stakeholder interaction, and strategic support from the cultural heritage community is guaranteed owing to the ongoing collaboration between laboratories and an established international and multidisciplinary reference group.

Ort, förlag, år, upplaga, sidor
Brill Academic Publishers, 2023
Nyckelord
archaeology, cultural heritage, dendrochronology, open data, Sweden, timber, tree rings
Nationell ämneskategori
Historia och arkeologi Jordbruk, skogsbruk och fiske Klimatvetenskap Trävetenskap Geologi
Forskningsämne
arkeologi; miljöarkeologi; geovetenskap med inriktning mot miljöanalys; klimatförändringar; data science; kulturarv
Identifikatorer
urn:nbn:se:umu:diva-201875 (URN)10.1163/27723194-bja10009 (DOI)
Projekt
Old wood in a new lightSEAD
Forskningsfinansiär
Riksbankens Jubileumsfond, IN20-0026Riksbankens Jubileumsfond, IN15-0231:1Vetenskapsrådet, 2007-7494Vetenskapsrådet, 2010-5976
Tillgänglig från: 2022-12-21 Skapad: 2022-12-21 Senast uppdaterad: 2025-02-01Bibliografiskt granskad
Buckland, P. I., Sjölander, M., von Boer, J., Roger, M. & Linderholm, J. (2022). The intricate details of using research databases and repositories for environmental archaeology data. ArcheoLogica Data, 2, 15-29
Öppna denna publikation i ny flik eller fönster >>The intricate details of using research databases and repositories for environmental archaeology data
Visa övriga...
2022 (Engelska)Ingår i: ArcheoLogica Data, ISSN 2785-0307, Vol. 2, s. 15-29Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Environmental archaeology is a complex mix of empirical analysis and qualitative interpretation.It is increasingly data science oriented, and databases and online resources are becoming increasinglyimportant in large scale synthesis research on changes in climate, environments and human activities.Research funders, journals and universities place much emphasis on the use of data repositories toensure transparency and reusability in the research process. Although these are important, researchersthemselves, however, may have more use for research databases which are oriented more towardsadvanced querying and exploratory data analysis than conforming to archiving standards. This paperexplores the pros and cons of these different approaches. It also discusses and problematizes somekey concepts in research data management, including the definitions of data and metadata, along withthe FAIR principles. Research examples are provided from a broad field of environmental archaeologyand palaeoecology. In contrast to most publications, the developer’s perspective is also included, anda worked example using the Strategic Environmental Archaeology Database (SEAD) to investigate fossilbeetle data demonstrates the implementation of some of this in the real world. This example may befollowed online using the SEAD browser, and all described data downloaded from there. After providingboth encouragement and warnings on the use of digital resources for synthesis research, some suggestionsare made for moving forward.

Ort, förlag, år, upplaga, sidor
Sesto Fiorentino: All’Insegna del Giglio, 2022
Nyckelord
database, infrastructure, environment, archives, FAIR, Palaeoecology, geoarchaeology, research data infrastructure, linking data and metadata, interdisciplinary research.
Nationell ämneskategori
Data- och informationsvetenskap Arkeologi Geologi Biblioteks- och informationsvetenskap
Forskningsämne
arkeologi; miljöarkeologi; data science; datalogi; biblioteks- och informationsvetenskap
Identifikatorer
urn:nbn:se:umu:diva-194496 (URN)10.13131/unipi/2785-0668/a1cc-xt56 (DOI)
Tillgänglig från: 2022-05-06 Skapad: 2022-05-06 Senast uppdaterad: 2022-05-09Bibliografiskt granskad
Organisationer

Sök vidare i DiVA

Visa alla publikationer