Change search
ReferencesLink to record
Permanent link

Direct link
Capturing Curiosity: A Comparison of Machine Learning Algorithms for Classification of Curiosity in Text
Umeå University, Faculty of Science and Technology, Department of Computing Science.
2016 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

The amount of text available to us on a daily basis, in the form of blogs, news articles, and social media updates, is larger then it has ever been. Being able to analyze large amounts of text and being able to determine its overall sentiment by using machinelearning algorithms has been a large area of research during the last few decades. This thesis will attempt to build on that work by looking at three different algorithms – Naive Bayes, Support Vector Machine and J48 Decision Tree, and evaluating their performance on the special problem of identifying curiosity in text. It also examines differences in result depending on how the feature selection is performed. The results indicate that Naive Bayes performs the best at the task.

Place, publisher, year, edition, pages
2016. , 16 p.
UMNAD, 1056
National Category
Engineering and Technology
URN: urn:nbn:se:umu:diva-126893OAI: diva2:1038664
Educational program
Bachelor of Science Programme in Computing Science
Available from: 2016-10-19 Created: 2016-10-19 Last updated: 2016-11-30Bibliographically approved

Open Access in DiVA

fulltext(283 kB)8 downloads
File information
File name FULLTEXT01.pdfFile size 283 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Computing Science
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 8 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 111 hits
ReferencesLink to record
Permanent link

Direct link