Umeå University's logo

umu.sePublications
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-6th-edition.csl
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Models for effective categorization and classification of texts into specific thematic groups (using gender and criminal themes as examples)
Umeå University, Faculty of Science and Technology, Department of Computing Science. National Technical University “Kharkiv Polytechnic Institute”, Kharkiv, Ukraine.ORCID iD: 0000-0002-9826-0286
National Technical University “Kharkiv Polytechnic Institute”, Kharkiv, Ukraine.
National Technical University “Kharkiv Polytechnic Institute”, Kharkiv, Ukraine.
National Technical University “Kharkiv Polytechnic Institute”, Kharkiv, Ukraine.
2024 (English)In: CLW-CoLInS 2024, computational linguistics workshop at Colins 2024: proceedings of the 8th international conference on computational linguistics and intelligent systems. Volume IV: computational linguistics workshop, Lviv, Ukraine, April 12-13, 2024 / [ed] Nina Khairova; Victoria Vysotska, CEUR-WS , 2024, Vol. IV, p. 37-49Conference paper, Published paper (Refereed)
Abstract [en]

An analysis of existing automated methods for text classification, used to develop an effective approach for automated text classification by thematic groups in the context of information related to criminal and gender themes, was conducted. Based on the analysis of classification methods, an algorithm for classifying texts by types of crime and gender was developed, information-linguistic and software for the task of distributing texts into thematic groups were developed, and the effectiveness of the developed application was assessed.

Place, publisher, year, edition, pages
CEUR-WS , 2024. Vol. IV, p. 37-49
Series
CEUR Workshop Proceedings (CEUR-WS), ISSN 1613-0073 ; 3722
Keywords [en]
categorization, Classification, criminal justice theme, gender criminology, gender stereotypes, social practices, thematic groups
National Category
Natural Language Processing
Identifiers
URN: urn:nbn:se:umu:diva-227967Scopus ID: 2-s2.0-85198743336OAI: oai:DiVA.org:umu-227967DiVA, id: diva2:1885255
Conference
CLW-2024: Computational Linguistics Workshop at 8th International Conference on Computational Linguistics and Intelligent Systems (CoLInS-2024), Lviv, Ukraine, April 12–13, 2024
Available from: 2024-07-22 Created: 2024-07-22 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

fulltext(724 kB)125 downloads
File information
File name FULLTEXT01.pdfFile size 724 kBChecksum SHA-512
745d2674192ca8cb0e4a693572e1e812b27f5ddc1a8b69bf20c6013db7075c78785330aa295bf7505f9e2397da541b46110ed8585d2876b1730a99c0df668e7d
Type fulltextMimetype application/pdf

Other links

ScopusProceedings

Authority records

Khairova, Nina

Search in DiVA

By author/editor
Khairova, Nina
By organisation
Department of Computing Science
Natural Language Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 125 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 320 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-6th-edition.csl
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf