Using cross-syllable units for Cantonese speech synthesis
Refereed conference paper presented and published in conference proceedings


Full Text

Times Cited

Other information
AbstractMonosyllables have been widely accepted as the basic units for concatenative speech synthesis of Chinese dialects. However, concatenating individual syllables is not adequate to produce highly natural synthetic speech because of the improper coupling at syllable boundaries. This paper describes a preliminary research of using cross-syllable units for Cantonese speech synthesis. The acoustic inventory contains 1,725 cross-syllable units, which are excised from properly selected and recorded carrier words. TD-PSOLA is employed for prosodic modification of synthetic speech. The results of subjective listening tests reveal that the proposed use of cross-syllable units has potential in producing highly natural synthetic speech, although the currently achieved performance is only fair. Substantial improvement is anticipated with better smoothing technique for waveform concatenation and greater coverage of context-dependent variation of the acoustic units.
All Author(s) ListLaw K.M., Lee T.
Name of Conference6th International Conference on Spoken Language Processing, ICSLP 2000
Start Date of Conference16/10/2000
End Date of Conference20/10/2000
Place of ConferenceBeijing
Country/Region of ConferenceChina
Detailed descriptionvol.2
Year2000
Month1
Day1
ISBN7801501144
LanguagesEnglish-United Kingdom

Last updated on 2020-06-09 at 01:34