Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling
Refereed conference paper presented and published in conference proceedings

Times Cited
Altmetrics Information

Other information
AbstractFor acoustical assessment of pathological speech, naturally spoken sentences are believed to be most suitable from the perspectives of both patients and clinicians. This is a challenging problem, as the extraction of pathology-dependent features is not straightforward. Previous research showed that features derived from lattice posteriors and decoding results of automatic speech recognition (ASR) could be used to quantifying various types of speech impairments. This paper describes a novel feature that can be derived from phone posterior probabilities generated by an ASR system. The Kullback-Leibler (KL) divergence is used to measure the phone-level distortion between unimpaired and impaired speakers. A Cantonese ASR system is trained with a combination of normal and impaired speech corpora. The multi-task learning approach is applied in order to incorporate different speech characteristics. Experimental results show that the proposed KL divergence feature is effective in the continuous speech based assessment of different pathologies, including voice disorder and post-stroke aphasia. The KL divergence feature is found to outperform conventional acoustic features and supra-segmental duration features, and is complementary to text features in quantifying language impairment. Index Terms: disordered speech assessment, voice disorders, aphasia, continuous speech, KL divergence, ASR, multi-task learning.
All Author(s) ListYuanyuan Liu, Ying Qin, Siyuan Feng, Tan Lee, P.C. Ching
Name of Conference11th International Symposium on Chinese Spoken Language Processing (ISCSLP)
Start Date of Conference26/11/2018
End Date of Conference29/11/2018
Place of ConferenceTaipei
Country/Region of ConferenceTaiwan
Proceedings TitleProceedings of ISCSLP 2018
Place of PublicationTaipei
Pages61 - 65
LanguagesEnglish-United States
Keywordsdisordered speech assessment, voice disorders, aphasia, continuous speech, KL divergence, ASR, multi-task learning

Last updated on 2020-23-11 at 01:58