Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders
Refereed conference paper presented and published in conference proceedings


Full Text

Other information
AbstractThis paper presents an enhanced pipeline system for automated screening of neurocognitive disorders, e.g. Alzheimer’s Disease (AD), using spoken language technologies. To ensure local relevance, the pipeline is applied to two-way interactions between clinical assessors and older adult participants in spoken Cantonese, the pre- dominant language used in Hong Kong. The pipeline includes: (i) Speaker diarization using speaker-turn-aware scoring to capture the temporal structure of conversations. (ii) ASR using XLS-R wav2vec 2.0 models further pre-trained on Cantonese speech data and fine-tuned. (iii) Language modelling using RoBERTa with further fine-tuning. (iv) AD screening with neural network classification. A reference benchmark is obtained using the ADReSS corpus where no diarization is needed, and the partial pipeline attained a competitive detection accuracy of 87.5%.
Acceptance Date18/05/2023
All Author(s) ListHelen Meng, Brian Mak, Man-Wai Mak, Helene Fung, Xianmin Gong, Timothy Kwok, Xunying Liu, Vincent C. T. Mok, Patrick Wong, Jean Woo, Xixin Wu, Ka Ho Wong, Shensheng Xu, Naijun Zheng, Ranzo Huang, Jiawen Kang, Xiaoquan Ke, Junan Li, Jinchao Li, Yi Wang
Name of ConferenceISCA Interspeech2023
Start Date of Conference20/08/2023
End Date of Conference24/08/2023
Place of ConferenceDublin
Country/Region of ConferenceIreland
Year2023
LanguagesEnglish-United States

Last updated on 2023-10-10 at 09:37