Change search
ReferencesLink to record
Permanent link

Direct link
Cross-language speaker identification using spectral moments
Umeå University, Faculty of Arts, Philosophy and Linguistics.
Show others and affiliations
2004 (English)In: Proceedings of the XVIIth Swedish Phonetics Conference FONETIK 2004, Stockholm University, 2004, 76-79 p.Chapter in book (Other academic)
Place, publisher, year, edition, pages
Stockholm University, 2004. 76-79 p.
National Category
General Language Studies and Linguistics
URN: urn:nbn:se:umu:diva-2282OAI: diva2:140213
Available from: 2007-05-03 Created: 2007-05-03 Last updated: 2013-04-16Bibliographically approved
In thesis
1. That voice sounds familiar: factors in speaker recognition
Open this publication in new window or tab >>That voice sounds familiar: factors in speaker recognition
2007 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Humans have the ability to recognize other humans by voice alone. This is important both socially and for the robustness of speech perception. This Thesis contains a set of eight studies that investigates how different factors impact on speaker recognition and how these factors can help explain how listeners perceive and evaluate speaker identity. The first study is a review paper overviewing emotion decoding and encoding research. The second study compares the relative importance of the emotional tone in the voice and the emotional content of the message. A mismatch between these was shown to impact upon decoding speed. The third study investigates the factor dialect in speaker recognition and shows, using a bidialectal speaker as the target voice to control all other variables, that the dominance of dialect cannot be overcome. The fourth paper investigates if imitated stage dialects are as perceptually dominant as natural dialects. It was found that a professional actor could disguise his voice successfully by imitating a dialect, yet that a listener's proficiency in a language or accent can reduce susceptibility to a dialect imitation. Papers five to seven focus on automatic techniques for speaker separation. Paper five shows that a method developed for Australian English diphthongs produced comparable results with a Swedish glide + vowel transition. The sixth and seventh papers investigate a speaker separation technique developed for American English. It was found that the technique could be used to separate Swedish speakers and that it is robust against professional imitations. Paper eight investigates how age and hearing impact upon earwitness reliability. This study shows that a senior citizen with corrected hearing can be as reliable an earwitness as a younger adult with no hearing problem, but suggests that a witness' general cognitive skill deterioration needs to be considered when assessing a senior citizen's earwitness evidence. On the basis of the studies a model of speaker recognition is presented, based on the face recognition model by V. Bruce and Young (1986; British Journal of Psychology, 77, pp. 305 - 327) and the voice recognition model by Belin, Fecteau and Bédard (2004; TRENDS in Cognitive Science, 8, pp. 129 - 134). The merged and modified model handles both familiar and unfamiliar voices. The findings presented in this Thesis, in particular the findings of the individual papers in Part II, have implications for criminal cases in which speaker recognition forms a part. The findings feed directly into the growing body of forensic phonetic and forensic linguistic research.

Place, publisher, year, edition, pages
Umeå: Filosofi och lingvistik, 2007. 160 p.
speaker recognition, accent, emotions, hearing, spectral moments, formant transitions, dialect
National Category
Human Computer Interaction
urn:nbn:se:umu:diva-1106 (URN)978-91-7264-311-6 (ISBN)
Public defence
2007-05-24, Hörsal F, Humanisthuset, Umeå, 10:00
Available from: 2007-05-03 Created: 2007-05-03 Last updated: 2013-04-09Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Eriksson, Erik J
By organisation
Philosophy and Linguistics
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 76 hits
ReferencesLink to record
Permanent link

Direct link