Umeå University's logo

umu.sePublications
Change search
Link to record
Permanent link

Direct link
Publications (10 of 11) Show all publications
Khairova, N. & Vysotska, V. (Eds.). (2024). CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024. Paper presented at CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12–13, 2024. CEUR-WS
Open this publication in new window or tab >>CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024
2024 (English)Conference proceedings (editor) (Refereed)
Place, publisher, year, edition, pages
CEUR-WS, 2024. p. 583
Series
CEUR Workshop Proceedings (CEUR-WS), ISSN 1613-0073 ; 3722
National Category
Natural Language Processing
Identifiers
urn:nbn:se:umu:diva-228015 (URN)
Conference
CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12–13, 2024
Available from: 2024-07-22 Created: 2024-07-22 Last updated: 2025-02-07Bibliographically approved
Ericson, P., Khairova, N. & De Vos, M. (2024). Joint Postproceedings for the Workshops and Tutorials at the Third International Conference on Hybrid Human-Artificial Intelligence (HHAI) (preface). In: Petter Ericson, Nina Khairova, Marina De Vos (Ed.), CEUR Workshop Proceedings: . Paper presented at 3rd International Conference on Hybrid Human-Artificial Intelligence, HHAI-WS 2024, June 10-11, 2024, Malmö, Sweden (pp. I-III). CEUR-WS
Open this publication in new window or tab >>Joint Postproceedings for the Workshops and Tutorials at the Third International Conference on Hybrid Human-Artificial Intelligence (HHAI) (preface)
2024 (English)In: CEUR Workshop Proceedings / [ed] Petter Ericson, Nina Khairova, Marina De Vos, CEUR-WS , 2024, p. I-IIIConference paper, Published paper (Refereed)
Abstract [en]

This preface briefly presents the organisation and outcomes of the workshop and tutorial days of the Third International Conference on Hybrid Human-Artificial Intelligence (HHAI) 2024, introducing the conference topic and giving key highlights of the specifics of the proceedings.

Place, publisher, year, edition, pages
CEUR-WS, 2024
Series
International Conference on Hybrid Human-Artificial Intelligence, ISSN 1613-0073 ; 3825
Keywords
hybrid human artificial intelligence, hybrid intelligence
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-232600 (URN)2-s2.0-85210320416 (Scopus ID)
Conference
3rd International Conference on Hybrid Human-Artificial Intelligence, HHAI-WS 2024, June 10-11, 2024, Malmö, Sweden
Available from: 2024-12-09 Created: 2024-12-09 Last updated: 2024-12-09Bibliographically approved
Khairova, N., Kupriianov, Y., Vorzhevitina, A. & Shanidze, O. (2024). Models for effective categorization and classification of texts into specific thematic groups (using gender and criminal themes as examples). In: Nina Khairova; Victoria Vysotska (Ed.), CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024. Paper presented at CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12–13, 2024 (pp. 37-49). CEUR-WS, IV
Open this publication in new window or tab >>Models for effective categorization and classification of texts into specific thematic groups (using gender and criminal themes as examples)
2024 (English)In: CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024 / [ed] Nina Khairova; Victoria Vysotska, CEUR-WS , 2024, Vol. IV, p. 37-49Conference paper, Published paper (Refereed)
Abstract [en]

An analysis of existing automated methods for text classification, used to develop an effective approach for automated text classification by thematic groups in the context of information related to criminal and gender themes, was conducted. Based on the analysis of classification methods, an algorithm for classifying texts by types of crime and gender was developed, information-linguistic and software for the task of distributing texts into thematic groups were developed, and the effectiveness of the developed application was assessed.

Place, publisher, year, edition, pages
CEUR-WS, 2024
Series
CEUR Workshop Proceedings (CEUR-WS), ISSN 1613-0073 ; 3722
Keywords
categorization, Classification, criminal justice theme, gender criminology, gender stereotypes, social practices, thematic groups
National Category
Natural Language Processing
Identifiers
urn:nbn:se:umu:diva-227967 (URN)2-s2.0-85198743336 (Scopus ID)
Conference
CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12–13, 2024
Available from: 2024-07-22 Created: 2024-07-22 Last updated: 2025-02-07Bibliographically approved
Khairova, N. & Vysotska, V. (2024). Preface: computational linguistics workshop. In: Nina Khairova; Victoria Vysotska (Ed.), CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024. Paper presented at CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12-13, 2024. CEUR-WS, IV, Article ID preface.
Open this publication in new window or tab >>Preface: computational linguistics workshop
2024 (English)In: CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024 / [ed] Nina Khairova; Victoria Vysotska, CEUR-WS , 2024, Vol. IV, article id prefaceConference paper, Published paper (Refereed)
Abstract [en]

This document is the preface of the Computational Linguistics Workshop of the 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS 2024), April 12–13, 2024, held in Lviv, Ukraine.

Place, publisher, year, edition, pages
CEUR-WS, 2024
Series
CEUR Workshop Proceedings (CEUR-WS), ISSN 1613-0073 ; 3722
Keywords
computer lexicography, corpus technologies, natural language processing, NLP, ontologies
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-227956 (URN)2-s2.0-85198720383 (Scopus ID)
Conference
CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12-13, 2024
Available from: 2024-07-22 Created: 2024-07-22 Last updated: 2024-07-22Bibliographically approved
Khairova, N., Holyk, Y., Sytnikov, D., Mishcheriakov, Y. & Shanidze, N. (2024). Topic modelling of ukraine war-related news using latent dirichlet allocation with collapsed Gibbs sampling. In: ISW-CoLInS 2024. Intelligent Systems Workshop at CoLInS 2024: Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Systems. Volume III: Intelligent Systems Workshop. Paper presented at 8th International Conference on Computational Linguistics and Intelligent Systems, Lviv, Ukraine, April 12-13, 2024 (pp. 1-15). CEUR-WS
Open this publication in new window or tab >>Topic modelling of ukraine war-related news using latent dirichlet allocation with collapsed Gibbs sampling
Show others...
2024 (English)In: ISW-CoLInS 2024. Intelligent Systems Workshop at CoLInS 2024: Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Systems. Volume III: Intelligent Systems Workshop, CEUR-WS , 2024, p. 1-15Conference paper, Published paper (Refereed)
Abstract [en]

The context of this research is the application of topic modeling to war-related news in the context of the Ukraine war. The objective of the research is to use Latent Dirichlet Allocation (LDA) with Collapsed Gibbs sampling to identify distinct content groups in war-related news. The method used in the research involves data scraping from a Ukrainian news website, data preprocessing, and applying the LDA with Collapsed Gibbs algorithm to infer the latent topics within the corpus. The results of the research include the identification of twelve distinct topics and the corresponding keywords that characterize each topic. The analysis of the results provides insights into the context of each topic, such as discussions on safety measures during wartime, consequences of military actions, and reports on military casualties. The research concludes that the application of LDA with Collapsed Gibbs is a valuable tool for identifying and understanding the context of war-related news. However, there may be discrepancies between the results of the model and human interpretation, which may be due to limitations in the results, model parameters, and the presence of noise data. Future research should focus on optimizing model parameters, filtering noise data, and improving the analysis of topic context to enhance the reliability and interpretability of the results.

Place, publisher, year, edition, pages
CEUR-WS, 2024
Series
CEUR workshop proceedings, ISSN 1613-0073 ; 3688
Keywords
Latent Dirichlet Allocation, Topic modeling, Ukraine war
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-225937 (URN)2-s2.0-85195141693 (Scopus ID)
Conference
8th International Conference on Computational Linguistics and Intelligent Systems, Lviv, Ukraine, April 12-13, 2024
Available from: 2024-06-12 Created: 2024-06-12 Last updated: 2024-07-22Bibliographically approved
Khairova, N., Rizun, N., Alexopoulos, C., Ciesielska, M., Lukashevskyi, A. & Redozub, I. (2024). Understanding the Ukrainian migrants challenges in the EU: a topic modeling approach. In: Hsin-Chung Liao; David Duenas Cid; Marie Anne Macadar; Flavia Bernardini (Ed.), dg.o '24: Proceedings of the 25th Annual International Conference on Digital Government Research: . Paper presented at dg.o 2024: 25th Annual International Conference on Digital Government Research, Taipei, Taiwan, June 11-14, 2024 (pp. 196-205). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Understanding the Ukrainian migrants challenges in the EU: a topic modeling approach
Show others...
2024 (English)In: dg.o '24: Proceedings of the 25th Annual International Conference on Digital Government Research / [ed] Hsin-Chung Liao; David Duenas Cid; Marie Anne Macadar; Flavia Bernardini, Association for Computing Machinery (ACM), 2024, p. 196-205Conference paper, Published paper (Refereed)
Abstract [en]

Confronted with the aggression against Ukraine in 2022, Europe faces one of the most important humanitarian challenges - the migration of war refugees from Ukraine, most of them women with children and the elderly. Both international institutions such as the European Union and the United Nations, but also national governments and, above all, local governments, which are the main providers of services and resources for refugees, are taking a number of measures to meet the needs. The extraordinary nature and extensive humanitarian needs pose exceptional challenges for both governments and Non-Governmental Organizations (NGOs) as well as civil society. The European countries adopted distinct reception procedures to accommodate war refugees in their territories. The purpose of this paper is to examine the challenges of war refugees from Ukraine and gain an understanding of how they vary across selected European countries. Using a text analytics approach such as BERTopic topic modeling, we analyzed text messages published on Telegram channels from February 2022 to September 2023, revealing 12 challenges facing Ukrainian migrants. Furthermore, our study delves into these challenges distribution across 6 major European countries with significant migrant populations, providing insights into regional differences. Additionally, temporal changes in 8 narrative themes in discussions of Ukrainian migration, extracted from official government websites, were examined. Together, this research contributes (1) to demonstrating how analytics-driven methodology can potentially be used to extract in-depth knowledge from textual data freely available on social media; and (2) to a deeper understanding of the various issues affecting the adaptation of Ukrainian migrants in European countries. The study also provides recommendations to improve programs and policies to better support the successful integration of Ukrainian migrants in host countries.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2024
Series
ACM International Conference Proceeding Series
Keywords
European countries, Migration challenges, Social media, Topic Modelling, Ukraine
National Category
International Migration and Ethnic Relations Computer and Information Sciences
Identifiers
urn:nbn:se:umu:diva-226180 (URN)10.1145/3657054.3657252 (DOI)2-s2.0-85195266600 (Scopus ID)9798400709883 (ISBN)
Conference
dg.o 2024: 25th Annual International Conference on Digital Government Research, Taipei, Taiwan, June 11-14, 2024
Available from: 2024-06-20 Created: 2024-06-20 Last updated: 2024-07-22Bibliographically approved
Khairova, N., Galassi, A., Scudo, F. L., Ivasiuk, B. & Redozub, I. (2024). Unsupervised approach for misinformation detection in Russia-Ukraine war news. In: Nina Khairova; Victoria Vysotska (Ed.), CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024. Paper presented at CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12–13, 2024 (pp. 21-36). CEUR-WS, IV
Open this publication in new window or tab >>Unsupervised approach for misinformation detection in Russia-Ukraine war news
Show others...
2024 (English)In: CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024 / [ed] Nina Khairova; Victoria Vysotska, CEUR-WS , 2024, Vol. IV, p. 21-36Conference paper, Published paper (Refereed)
Abstract [en]

The Russian-Ukrainian war has attracted considerable global attention; however, fake news often obstructs the formation of public opinion and disseminates false information. To address this issue, we have curated the RUWA dataset, comprising over 16,500 news articles covering the pivotal events of the Russian invasion of Ukraine. These articles were sourced from established outlets in the USA, EU, Asia, Ukraine, and Russia, spanning the period from February to September 2022. The paper explores the use of semantic similarity to compare different aspects of articles from various web sources that cover the same events of the war. This unsupervised machine learning approach becomes crucial when obtaining annotated datasets is practically impossible due to the lack of real fact-checking during the ongoing war. The research goal is to uncover the potential of employing semantic similarity measures as a viable approach for detecting misinformation in news articles.

Place, publisher, year, edition, pages
CEUR-WS, 2024
Series
CEUR Workshop Proceedings (CEUR-WS), ISSN 1613-0073 ; 3722
Keywords
dataset, fake news detection, Misinformation issues, Russian-Ukraine war, semantic similarity
National Category
Natural Language Processing Media and Communication Studies
Identifiers
urn:nbn:se:umu:diva-227968 (URN)2-s2.0-85198728913 (Scopus ID)
Conference
CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12–13, 2024
Projects
Humane AI Net
Funder
EU, Horizon 2020, 952026
Available from: 2024-07-22 Created: 2024-07-22 Last updated: 2025-02-11Bibliographically approved
Khairova, N., Mamyrbayev, O., Rizun, N., Razno, M. & Galiya, Y. (2023). A parallel corpus-based approach to the crime event extraction for low-resource languages. IEEE Access, 11, 54093-54111
Open this publication in new window or tab >>A parallel corpus-based approach to the crime event extraction for low-resource languages
Show others...
2023 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 11, p. 54093-54111Article in journal (Refereed) Published
Abstract [en]

These days, a lot of crime-related events take place all over the world. Most of them are reported in news portals and social media. Crime-related event extraction from the published texts can allow monitoring, analysis, and comparison of police or criminal activities in different countries or regions. Existing approaches to event extraction mainly suggest processing texts in English, French, Chinese, and some other resource-rich and well-annotated languages. This paper presents a parallel corpus-based approach that follows a closed-domain event extraction methodology to event extraction from web news articles in low-resource languages. To identify the event, its arguments, and the arguments' roles in the source-language part of the corpus we utilize an enhanced pattern-based method that involves the multilingual synonyms dictionary with knowledge about crime-related concepts and logic-linguistic equations. The event extraction from the target-language part of the corpus uses a cross-lingual crime-related event extraction transfer technique that is based on supplementary knowledge about the semantic similarity patterns of the considered pair of languages. The presented approach does not require a preliminarily annotated corpus for training making it more attractive to low-resource languages and allows extracting TRANSFER, CRIME, and POLICE types of events and their seven subtypes from various topics of news articles simultaneously. Implementation of our approach for the Russian-Kazakh parallel corpus of news portals articles allowed obtaining the F1-measure of crime-related event extraction of over 82% for the source language and 63% for the target language.

Place, publisher, year, edition, pages
IEEE, 2023
Keywords
crime analysis, Cross-lingual transfer, event extraction, low-resource language, natural language processing, parallel corpus, semantic annotation
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-211831 (URN)10.1109/ACCESS.2023.3281680 (DOI)001005528500001 ()2-s2.0-85161064304 (Scopus ID)
Available from: 2023-07-11 Created: 2023-07-11 Last updated: 2024-07-22Bibliographically approved
Khairova, N., Hamon, T., Grabar, N. & Burov, Y. (Eds.). (2023). Colins 2023, computational linguistics and intelligent systems 2023: proceedings of the 7th international conference on computational linguistics and intelligent systems. Volume II: computational linguistics workshop, Kharkiv, Ukraine, April 20-21, 2023. Paper presented at COLINS-2023: 7th International Conference on Computational Linguistics and Intelligent Systems, Kharkiv, Ukraine, April 20–21, 2023. CEUR-WS
Open this publication in new window or tab >>Colins 2023, computational linguistics and intelligent systems 2023: proceedings of the 7th international conference on computational linguistics and intelligent systems. Volume II: computational linguistics workshop, Kharkiv, Ukraine, April 20-21, 2023
2023 (English)Conference proceedings (editor) (Refereed)
Place, publisher, year, edition, pages
CEUR-WS, 2023. p. 545
Series
CEUR Workshop Proceedings (CEUR-WS), ISSN 1613-0073 ; 3396
National Category
Natural Language Processing
Identifiers
urn:nbn:se:umu:diva-228016 (URN)
Conference
COLINS-2023: 7th International Conference on Computational Linguistics and Intelligent Systems, Kharkiv, Ukraine, April 20–21, 2023
Available from: 2024-07-22 Created: 2024-07-22 Last updated: 2025-02-07Bibliographically approved
Ybytayeva, G., Mamyrbayev, O., Khairova, N., Rizun, N., Berdali, S. & Mukhsina, K. (2023). Creating a Thesaurus "Crime-Related Web Content" Based on a Multilingual Corpus. In: Nina Khairova; Thierry Hamon; Natalia Grabar; Yevhen Burov (Ed.), CoLInS 2023, Computational Linguistics and Intelligent Systems 2023: Proceedings of the 7th International Conference on Computational Linguistics and Intelligent Systems. Volume II: Computational Linguistics Workshop. Paper presented at 7th International Conference on Computational Linguistics and Intelligent Systems, Kharkiv, Ukraine, April 20-21, 2023 (pp. 77-87). CEUR-WS
Open this publication in new window or tab >>Creating a Thesaurus "Crime-Related Web Content" Based on a Multilingual Corpus
Show others...
2023 (English)In: CoLInS 2023, Computational Linguistics and Intelligent Systems 2023: Proceedings of the 7th International Conference on Computational Linguistics and Intelligent Systems. Volume II: Computational Linguistics Workshop / [ed] Nina Khairova; Thierry Hamon; Natalia Grabar; Yevhen Burov, CEUR-WS , 2023, p. 77-87Conference paper, Published paper (Refereed)
Abstract [en]

An overview of the most common ontological resources and methods of their construction and application is given. For purposes of scientific research we analyzed the characteristics of ontologies in the public domain and corpus containing criminal context. Additionally, we have recently developed a Flask-based web application that generates ontologies using the Anytree library.

The authors also developed a multilingual basic ontology called "Illegal Web content" based on a corpus of texts in criminal context in English, Ukrainian, Kazakh and Russian languages. The development of this ontology was motivated by the need for effective analysis and prevention of criminal activities based on textual information disseminated on the internet. The newly developed web application allows users to create ontologies by importing text files in different languages, and then automatically generates an ontology based on the text. The application is user-friendly, and allows users to customize the ontology by adding or removing nodes, changing the labels of nodes and edges, and setting the weight of edges.

Overall, the development of the "Illegal Web content" ontology and the web application represents a significant contribution to the field of ontology development and text processing for criminal investigation and prevention. The main characteristics of the Web application, including its ease of use and customizability, make it a valuable tool for researchers and practitioners alike.

Place, publisher, year, edition, pages
CEUR-WS, 2023
Series
CEUR Workshop proceedings, ISSN 1613-0073
Keywords
criminal topics, Kazakh-Russian parallel corpus, Multilingual basic ontology, multilingual corpus, Web application
National Category
Computer Sciences Natural Language Processing
Identifiers
urn:nbn:se:umu:diva-209572 (URN)2-s2.0-85160814416 (Scopus ID)
Conference
7th International Conference on Computational Linguistics and Intelligent Systems, Kharkiv, Ukraine, April 20-21, 2023
Available from: 2023-06-12 Created: 2023-06-12 Last updated: 2025-02-01Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-9826-0286

Search in DiVA

Show all publications