Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Sentiment Analysis for Swedish: The Impact of Emojis on Sentiment Analysis of Swedish Informal Texts
Umeå University, Faculty of Science and Technology, Department of Computing Science.
2023 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

This study investigates the use of emojis in sentiment analysis for the Swedish language, with the objective to assess if emojis improve the performance of the model. Sentiment analysis is an NLP classification task aimed at extracting people's opinions, sentiments, and attitudes from language. Though sentiment analysis as a research area has made a lot of progress recently, there are still some challenges to overcome. In this work, two of these challenges were considered; the analysis of a non-English language and the impact of emojis. These areas were explored through creating a sentiment annotated dataset of Swedish texts containing emojis, and creating a Swedish sentiment analysis model for evaluation. The sentiment analysis model created, SweVADER, was based on the English Lexicon-based model VADER. 

The best performing SweVADER model achieved an accuracy of 0.53 and an F1-score of 0.47. Furthermore, the presence of emojis improved the analysis for most models, but not by much. The results indicate that the use of emojis can improve the sentiment analysis, but there were other features affecting the results as well. The sentiment lexicon used plays a key role, and pre-processing techniques like stemming could affect the performance too. A takeaway from this study is that emojis contain important sentiment information, and should not be disregarded. Furthermore, emojis are useful when analyzing texts, if there is a lack of linguistic resources for the language in question.

Place, publisher, year, edition, pages
2023. , p. 34
Series
UMNAD ; 1379
Keywords [en]
sentiment analysis, NLP, natural language processing, emojis, swedish
National Category
Computer Sciences Engineering and Technology
Identifiers
URN: urn:nbn:se:umu:diva-209069OAI: oai:DiVA.org:umu-209069DiVA, id: diva2:1762861
External cooperation
Daresay by Knightec
Subject / course
Examensarbete i Interaktionsteknik och design
Educational program
Master of Science Programme in Interaction Technology and Design - Engineering
Supervisors
Examiners
Available from: 2023-06-08 Created: 2023-06-05 Last updated: 2023-06-08Bibliographically approved

Open Access in DiVA

sentiment-analysis-swedish-lovisa-berggren(1061 kB)515 downloads
File information
File name FULLTEXT01.pdfFile size 1061 kBChecksum SHA-512
186dfff44e2d3fda1eab4e0c9579620d79f40b57b69fa7f26182dc94b0c0302157e3a0851c381c129d7d4877fd4c816a2d00d889e470933ab61590258c32576f
Type fulltextMimetype application/pdf

By organisation
Department of Computing Science
Computer SciencesEngineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 515 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1278 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf