Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Speaking to see: A feasibility study of voice-assisted visual search
Umeå universitet, Samhällsvetenskapliga fakulteten, Institutionen för informatik. University of Bergen, Department of Information Science and Media Studies, Bergen, Norway.
Umeå universitet, Samhällsvetenskapliga fakulteten, Institutionen för informatik.
2011 (Engelska)Ingår i: Human-Computer Interaction - INTERACT 2011 - 13th IFIP TC 13 International Conference, Lisbon, Portugal, September 5-9, 2011, Proceedings, Part I / [ed] Pedro F. Campos, T. C. Nicholas Graham, Joaquim A. Jorge, Nuno Jardim Nunes, Philippe A. Palanque, and Marco Winckler, Springer Publishing Company, 2011, s. 444-451Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The paper presents the concept, implementation, and a feasibility study of a user interface technique, named VAVS ("voice-assisted visual search"). VAVS employs user's voice input for assisting the user in searching for objects of interest in complex displays. User voice input is compared with attributes of visually presented objects and, if there is a match, the matching object is highlighted to help the user visually locate the object. The paper discusses differences between, on the one hand, VAVS and, on the other hand, voice commands and multimodal input techniques. An interactive prototype implementing the VAVS concept and employing a standard voice recognition program is described. The paper reports an empirical study, in which an object location task was carried out with and without VAVS. It was found that the VAVS condition was associated with higher performance and use satisfaction. The paper concludes with a discussion of directions for future work. 

Ort, förlag, år, upplaga, sidor
Springer Publishing Company, 2011. s. 444-451
Serie
Lecture Notes in Computer Science, ISSN 1611-3349 ; 6946
Nyckelord [en]
multimodal input, visual search, voice command, Voice recognition
Nationell ämneskategori
Människa-datorinteraktion (interaktionsdesign)
Forskningsämne
data- och systemvetenskap
Identifikatorer
URN: urn:nbn:se:umu:diva-51213DOI: 10.1007/978-3-642-23774-4_37Scopus ID: 2-s2.0-80052821044ISBN: 978-3-642-23773-7 (tryckt)ISBN: 978-3-642-23774-4 (digital)OAI: oai:DiVA.org:umu-51213DiVA, id: diva2:477061
Konferens
13th IFIP TC 13 International Conference, Lisbon, Portugal, September 5-9, 2011
Tillgänglig från: 2012-01-12 Skapad: 2012-01-12 Senast uppdaterad: 2023-07-31Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Kaptelinin, Victor

Sök vidare i DiVA

Av författaren/redaktören
Kaptelinin, Victor
Av organisationen
Institutionen för informatik
Människa-datorinteraktion (interaktionsdesign)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 291 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf