Multi-scale audio indexing for Chinese spoken document retrieval
Refereed conference paper presented and published in conference proceedings


Full Text

Times Cited

Other information
AbstractThe advent of the information age has brought massive digital libraries of multimedia content. This development creates a high demand for information indexing and retrieval technologies, and the capability of browsing through audio archives is much desired. This paper reports on our initial attempt in the use of syllable units for Chinese spoken document retrieval. Our experiments are based on 1801 news stories from local television broadcasts in Cantonese, a monosyllabic Chinese dialect with a rich tonal structure. Results show that indexing with overlapping bi-syllables (tonal syllables) mapped from text delivers the reference retrieval performance at average inverse rank (AIR)=0.830. Retrieval based on overlapping bisyllables (base syllables) recognized from audio achieved an AIR of 0.460.
All Author(s) ListMeng H.M., Lo W.K., Li Y.C., Ching P.C.
Name of Conference6th International Conference on Spoken Language Processing, ICSLP 2000
Start Date of Conference16/10/2000
End Date of Conference20/10/2000
Place of ConferenceBeijing
Country/Region of ConferenceChina
Year2000
Month1
Day1
ISBN7801501144
LanguagesEnglish-United Kingdom

Last updated on 2020-06-09 at 01:34