Cross-lingual Speaker Adaptation via Gaussian Component Mapping
Refereed conference paper presented and published in conference proceedings


全文

引用次數

其它資訊
摘要This paper is focused on the use of acoustic information from an existing source language (Cantonese) to implement speaker adaptation for a new target language (English). Speaker-independent (SI) model mapping between Cantonese and English is investigated at different levels of acoustic units. Phones, states, and Gaussian mixture components are used as the mapping units respectively. With the model mapping, cross-lingual speaker adaptation can be performed. The performance of the proposed cross-lingual speaker adaptation system is determined by model mapping effectiveness and speaker adaptation effectiveness. Experimental results show that the model mapping effectiveness increased with the refinement of mapping units, and the speaker adaptation effectiveness depends on the model mapping effectiveness. Mapping between Gaussian mixture components is proved effective for various speech recognition tasks. A relative error reduction of 10.12% on English words is achieved by using a small amount of (4 minutes) Cantonese adaptation data, compared with the SI English recognizer.
著者Cao HW, Lee T, Ching PC
會議名稱11th Annual Conference of the International-Speech-Communication-Association 2010
會議開始日26.09.2010
會議完結日30.09.2010
會議地點Makuhari
會議國家/地區日本
詳細描述organized by International Speech Communication Association,
出版年份2010
月份1
日期1
出版社ISCA-INST SPEECH COMMUNICATION ASSOC
頁次869 - 872
國際標準書號978-1-61782-123-3
語言英式英語
關鍵詞cross-lingual; model mapping; speaker adaptation; speech recognition
Web of Science 學科類別Computer Science; Computer Science, Artificial Intelligence; Engineering; Engineering, Electrical & Electronic

上次更新時間 2020-24-10 於 01:33