Exploitation of Phase Information for Speaker Recognition
Refereed conference paper presented and published in conference proceedings


全文

引用次數

其它資訊
摘要Auditory experiments show insensitivity of human ears to phase information in perceiving phonetic content of speech signal. However, the discarded phase information may provide useful acoustic cue for identifying individual speaker, this is especially useful for speaker recognition systems operated with degraded magnitude in adverse conditions. This paper is therefore motivated to derive phase-related features for reliable speaker recognition performance. A pertinent representation for most dominant primary frequencies present in the speech signal is first built. It is then applied to frames of the speech signal to derive effective speaker-discriminative features. Through a set of specifically designed experiments on synthetic vowels, it is observed that the proposed features are capable of differentiating the inclusive formants, pitch harmonics from other components, and expressing the vocal particularities in various-shaped formants. By combining with standard cepstral parameters, these phase-related features have shown to evidently reduce the identification error rate and equal error rate in the context of Gaussian mixture model-based speaker recognition system.
著者Wang N, Ching PC, Lee T
會議名稱11th Annual Conference of the International-Speech-Communication-Association 2010
會議開始日26.09.2010
會議完結日30.09.2010
會議地點Makuhari
會議國家/地區日本
詳細描述organized by International Speech Communication Association,
出版年份2010
月份1
日期1
出版社ISCA-INST SPEECH COMMUNICATION ASSOC
頁次2126 - 2129
國際標準書號978-1-61782-123-3
語言英式英語
關鍵詞phase information; speaker recognition
Web of Science 學科類別Engineering; Engineering, Electrical & Electronic; Telecommunications

上次更新時間 2020-20-10 於 01:14