On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition
Refereed conference paper presented and published in conference proceedings
已正式接受出版
全文
沒有全文檔案提供 |
其它資訊
摘要Accurate recognition of dysarthric and elderly speech re- main challenging tasks to date. Speaker-level heterogeneity attributed to accent or gender, when aggregated with age and speech impairment, create large diversity among these speak- ers. Scarcity of speaker-level data limits the practical use of data-intensive model based speaker adaptation methods. To this end, this paper proposes two novel forms of data-efficient. feature based on-the-fly speaker adaptation methods: variance- regularized spectral basis embedding (SVR) and spectral fea- ture driven f-LHUC transforms. Experiments conducted on UASpeech dysarthric and DementiaBank Pitt elderly speech corpora suggest the proposed on-the-fly speaker adaptation ap- proaches consistently outperform baseline iVector adapted hy- brid DNN/TDNN and E2E Conformer systems by statistically significant WER reduction of 2.48%-2.85% absolute (7.92%- 8.06% relative), and offline model based LHUC adaptation by 1.82% absolute (5.63% relative) respectively.
出版社接受日期18.05.2023
著者Mengzhe Geng, Xurong Xie, Rongfeng Su, Jianwei Yu, Zengrui Jin, Tianzi Wang, Shujie Hu, Zi Ye, Helen Meng, Xunying Liu
會議名稱ISCA Interspeech2023
會議開始日20.08.2023
會議完結日24.08.2023
會議地點Dublin
會議國家/地區愛爾蘭
出版年份2023
語言美式英語