umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
EVALUATING RECORD LINKAGE METHODS FOR MANIFOLD IDENTITY DETECTION
Umeå University, Faculty of Science and Technology, Department of Computing Science.
2019 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Record Linkage is the process of linking two or more records in a database to the same real life entity. Th‘ese records do not share a common identifi€er. ‘This makes connecting them to each other a difficult task since they can only be linked based on similarities in their data. Th‘is data can also contain errors due to misspellings or missing €fields further increasing the difficulty of the task. In this thesis, common methods for comparing records and fi€nding duplicates are presented. Methods for increasing the performance and reducing the computer power needed are also presented to show how record-linkage can be used with big amounts of data. Built on this knowledge, several experiments comparing these methods have been conducted, using data from two benchmark data sets including Freely Extensible Biomedical Record Linkage (FEBRL) and the North Carolina Voter Registration (NCVR) data set. ‘The results presented show that di‚fferent types of similarity measures can have similar performance, and that supervised methods provide be‹tter prediction rates than unsupervised methods. Finally, suggestions for future work and improvements are given.

Place, publisher, year, edition, pages
2019. , p. 50
Series
UMNAD ; 1204
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:umu:diva-164720OAI: oai:DiVA.org:umu-164720DiVA, id: diva2:1366355
External cooperation
Skatteverket
Educational program
Master of Science Programme in Computing Science and Engineering
Supervisors
Examiners
Available from: 2019-10-29 Created: 2019-10-29 Last updated: 2019-10-29Bibliographically approved

Open Access in DiVA

fulltext(1809 kB)22 downloads
File information
File name FULLTEXT01.pdfFile size 1809 kBChecksum SHA-512
4c8bfc20474b4369d2b107949835b38bbb7149415e3e943089f130555c942e90d3f33c00e8f5aac054367d8b8cf73a7ae52efe816ef1afd21bdd3dabe5b2f7b5
Type fulltextMimetype application/pdf

By organisation
Department of Computing Science
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 22 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 160 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf