Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Distributed representation of n-gram statistics for boosting self-organizing maps with hyperdimensional computing
Umeå University, Faculty of Medicine, Department of Radiation Sciences, Radiation Physics.ORCID iD: 0000-0002-1313-0934
Show others and affiliations
2019 (English)In: Perspectives of system informatics / [ed] Nikolaj Bjørner, Irina Virbitskaite, Andrei Voronkov, Cham: Springer, 2019, p. 64-79Conference paper, Published paper (Refereed)
Abstract [en]

This paper presents an approach for substantial reduction of the training and operating phases of Self-Organizing Maps in tasks of 2-D projection of multi-dimensional symbolic data for natural language processing such as language classification, topic extraction, and ontology development. The conventional approach for this type of problem is to use n-gram statistics as a fixed size representation for input of Self-Organizing Maps. The performance bottleneck with n-gram statistics is that the size of representation and as a result the computation time of Self-Organizing Maps grows exponentially with the size of n-grams. The presented approach is based on distributed representations of structured data using principles of hyperdimensional computing. The experiments performed on the European languages recognition task demonstrate that Self-Organizing Maps trained with distributed representations require less computations than the conventional n-gram statistics while well preserving the overall performance of Self-Organizing Maps.

Place, publisher, year, edition, pages
Cham: Springer, 2019. p. 64-79
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 11964
Keywords [en]
Self-organizing maps, n-gram statistics, Hyperdimensional computing, Symbol strings
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:umu:diva-169610DOI: 10.1007/978-3-030-37487-7_6ISI: 000612725600006Scopus ID: 2-s2.0-85077499893ISBN: 978-3-030-37486-0 (print)ISBN: 978-3-030-37487-7 (electronic)OAI: oai:DiVA.org:umu-169610DiVA, id: diva2:1422893
Conference
12th International Andrei P. Ershov Informatics Conference, PSI 2019, Novosibirsk, Russia, July 2–5, 2019
Available from: 2020-04-09 Created: 2020-04-09 Last updated: 2023-09-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Wiklund, Urban

Search in DiVA

By author/editor
Wiklund, Urban
By organisation
Radiation Physics
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 324 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf