Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Beyond Adjacency Pairs: Hierarchical Clustering of Long Sequences for Human-Machine Dialogues
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0002-3036-6519
2020 (English)In: Proceedings of the First Workshop on Computational Approaches to Discourse / [ed] Chloé Braud, Christian Hardmeier, Junyi Jessy Li, Annie Louis, Michael Strube, 2020, p. 11-19Conference paper, Published paper (Refereed)
Abstract [en]

This work proposes a framework to predict sequences in dialogues, using turn based syntactic features and dialogue control functions. Syntactic features were extracted using dependency parsing, while dialogue control functions were manually labelled. These features were transformed using tf-idf and word embedding; feature selection was done using Principal Component Analysis (PCA). We ran experiments on six combinations of features to predict sequences with Hierarchical Agglomerative Clustering. An analysis of the clustering results indicate that using word embeddings and syntactic features, significantly improved the results.

Place, publisher, year, edition, pages
2020. p. 11-19
Keywords [en]
Syntactic Features, Semnatic Features, Communicative Features, Agglomerative Clustering, Dialogue Structuring, Longer Regularities, N-gram of Features, Dependency Prsing, Dimension Reduction.
National Category
Computer Systems
Research subject
Computer Science; human-computer interaction; Linguistics
Identifiers
URN: urn:nbn:se:umu:diva-175486DOI: 10.18653/v1/2020.codi-1.2OAI: oai:DiVA.org:umu-175486DiVA, id: diva2:1471769
Conference
Computational Approaches to Discourse (CODI), held in conjunction with Empirical Methods in Natural language processing (EMNLP), Virtual meeting, November 16-20, 2020
Available from: 2020-09-29 Created: 2020-09-29 Last updated: 2021-02-11Bibliographically approved

Open Access in DiVA

fulltext(794 kB)266 downloads
File information
File name FULLTEXT01.pdfFile size 794 kBChecksum SHA-512
b19f5c04f56d60d321555c4b2780467de11c808c14890b0667dea08d6ebc05c2b260f76e8340e6dcc7aed0177620e6f94b3a1d2d4a27e37d8c5a56e609d93500
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Tewari, Maitreyee

Search in DiVA

By author/editor
Tewari, Maitreyee
By organisation
Department of Computing Science
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 266 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 793 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf