Acoustic Assessment of Disordered Voice with Continuous Speech Based on Utterance-level ASR Posterior Features
Refereed conference paper presented and published in conference proceedings


引用次數
替代計量分析
.

其它資訊
摘要Most previous studies on acoustic assessment of disordered voice were focused on extracting perturbation features from isolated vowels produced with steady-state phonation. Natural speech, however, is considered to be more preferable in the aspects of flexibility, effectiveness and reliability for clinical practice. This paper presents an investigation on applying automatic speech recognition (ASR) technology to disordered voice assessment of Cantonese speakers. A DNN-based ASR system is trained using phonetically-rich continuous utterances from normal
speakers. It was found that frame-level phone posteriors obtained from the ASR system are strongly correlated with the severity level of voice disorder. Phone posteriors in utterances with severe disorder exhibit significantly larger variation than those with mild disorder. A set of utterance-level posterior features are computed to quantify such variation for pattern recognition purpose. An SVM based classifier is used to classify an input utterance into the categories of mild, moderate and severe disorder. The two-class classification accuracy for mild and severe disorders is 90.3%, and significant confusion between mild and moderate disorders is observed. For some of the subjects
with severe voice disorder, the classification results are highly inconsistent among individual utterances. Furthermore, short utterances tend to have more classification errors.
Index Terms: disordered voice, continuous speech, speech recognition, acoustic posteriors
著者Liu Y. Y., Lee T., Ching P. C., Law T., Lee K. Y. S.
會議名稱INTERSPEECH 2017
會議開始日20.08.2017
會議完結日24.08.2017
會議地點Stockholm
會議國家/地區瑞典
會議論文集題名Proceedings of Interspeech 2017
出版年份2017
月份8
日期20
出版地Sweden
頁次2680 - 2684
語言美式英語
關鍵詞disordered voice, continuous speech, speech recognition, acoustic posteriors

上次更新時間 2020-14-10 於 03:46