umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Textual Summarization of Scientific Publications and Usage Patterns
Umeå University, Faculty of Science and Technology, Department of Computing Science. (mcs10aok@cs.umu.se)
2012 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

In this study, we propose textual summarization for scientific publications and mobile phone usage patterns. Textual summarization is a process that takes a source document or set of related documents, identifying the most salient information and conveying it in less space than the original text. The increasing availability of information has necessitated deep research for textual summarization within Information Retrieval and the Natural Language Processing (NLP) area because textual summaries are easier to read, and provide to access to large repositories of content data in an efficient way. For example, snippets in web search are helpful for users as textual summaries. While there exists summarization tools for textual summarization, either they are not adapted to scientific collection of documents or they summarize short form of text such as news. In the first part of this study, we adapt the MEAD 3.11 summarization tool [19] to propose a method for building summaries of a set of related scientific articles by exploiting the structure of scientific publications in order to focus on some parts that are known to be the most informative in such documents. In the second part, we generate a natural language statement that describes a more readable form of a given symbolic pattern extracted from Nokia Challenge data. The reason is that the availability of mobile phone usage details enables new opportunities to provide a better understanding of the interest of user populations in mobile phone applications. For evaluating the first part of study, we make use of Amazon Mechanical Turk (Mturk) to validate summarization output.

Place, publisher, year, edition, pages
2012.
Series
UMNAD, ISSN 931
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:umu:diva-61840OAI: oai:DiVA.org:umu-61840DiVA: diva2:572240
Educational program
Master's Programme in Computing Science
Uppsok
Technology
Supervisors
Examiners
Available from: 2012-11-27 Created: 2012-11-27 Last updated: 2012-12-14Bibliographically approved

Open Access in DiVA

fulltext(4814 kB)529 downloads
File information
File name FULLTEXT02.pdfFile size 4814 kBChecksum SHA-512
3b53c44223f098fc73afc5376c05c191c7f8bb81a7d3d8adfa6e67c0f0cd434d5866d4425cc92b5e979f12c2e7e8f08301c4349a5860ddf70e51e0c4293957ec
Type fulltextMimetype application/pdf

By organisation
Department of Computing Science
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 537 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 283 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf