Change search
ReferencesLink to record
Permanent link

Direct link
Cues for hesitation in speech synthesis
Centrum för talteknologi, KTH.
Centrum för talteknologi, KTH.
Umeå University, Faculty of Arts, Comparative Literature and Scandinavian Languages.
2006 (English)In: Proceedings of Interspeech 2006: Ninth International Conference of Spoken Language Processing, Pittsburgh, USA, 2006, 1300-1303 p.Conference paper (Refereed)
Abstract [en]

The current study investigates acoustic correlates to perceived hesitation based on previous work showing that pause duration and final lengthening both contribute to the perception of hesitation. It is the total duration increase that is the valid cue rather than the contribution by either factor. The present experiment using speech synthesis was designed to evaluate F0 slope and presence vs. absence of creaky voice before the inserted hesitation in addition to durational cues. The manipulations occurred in two syntactic positions, within a phrase and between two phrases, respectively. The results showed that in addition to durational increase, variation of both F0 slope and creaky voice had perceptual effects, although to a much lesser degree. The results have a bearing on efforts to model spontaneous speech including disfluencies, to be explored, for example, in spoken dialogue systems.

Place, publisher, year, edition, pages
2006. 1300-1303 p.
URN: urn:nbn:se:umu:diva-18691OAI: diva2:174505
Available from: 2009-02-23 Created: 2009-02-23 Last updated: 2010-12-20Bibliographically approved

Open Access in DiVA

No full text

Other links

Search in DiVA

By author/editor
Strangert, Eva
By organisation
Comparative Literature and Scandinavian Languages

Search outside of DiVA

GoogleGoogle ScholarTotal: 11 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 45 hits
ReferencesLink to record
Permanent link

Direct link