Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Named Entity Recognition for Detecting Trends in Biomedical Literature
Umeå University, Faculty of Science and Technology, Department of Computing Science.
2024 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The number of publications in the biomedical field increases exponentially, which makes the task of keeping up with current research more and more difficult. However, rapid advances in the field of Natural Language Processing (NLP) offer possible solutions to this problem. In this thesis we focus on investigating three main questions of importance for utilizing the field of NLP, or more specifically the two subfields Named Entity Recognition (NER) and Large Language Models (LLM), to help solve this problem. The questions are; comparing LLM performance to NER models on NER-tasks, the importance of normalization, and how the analysis is affected by the availability of data. We find for the first question that the two models offer a reasonably comparable performance for the specific task we are looking at. For the second question, we find that normalization plays a substantial role in improving the results for tasks involving data synthesis and analysis. Lastly, for the third question, we find that it is important to have access to full papers in most cases since important information can be hidden outside of the abstracts.

Place, publisher, year, edition, pages
2024. , p. 58
Series
UMNAD ; 1453
Keywords [en]
NLP NER CHO
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-222394OAI: oai:DiVA.org:umu-222394DiVA, id: diva2:1845028
External cooperation
Sartorius Stedim Data Analytics
Educational program
Master of Science Programme in Computing Science and Engineering
Supervisors
Examiners
Available from: 2024-03-18 Created: 2024-03-15 Last updated: 2024-03-18Bibliographically approved

Open Access in DiVA

fulltext(1857 kB)368 downloads
File information
File name FULLTEXT01.pdfFile size 1857 kBChecksum SHA-512
5a48603b0b28178e276c740397a43f13e6d8bb1457209419846e206a17d7ebc2d2795ad6c6ede5d7d87af592055458de2d8559ff37a966b6524021099e1287ef
Type fulltextMimetype application/pdf

By organisation
Department of Computing Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 369 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 731 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf