ACE: Automatic Colloquialism, Typographical and Orthographic Errors Detection for Chinese Language
Other conference paper


Other information
AbstractWe present a system called ACE for Automatic Colloquialism and Errors detection for written Chinese. ACE is based on the combination of N-gram model and rule-base model. Although it focuses on detecting colloquial Cantonese (a dialect of Chinese) at the current stage, it can be extended to detect other dialects. We chose Cantonese becauase it has
many interesting properties, such as unique grammar system and huge colloquial terms, that turn the detection task extremely challenging. We conducted experiments using real data and synthetic data. The results indicated that ACE is highly reliable and effective.
All Author(s) ListShichao Dong, Gabriel Pui Cheong Fung, Binyang Li, Baolin Peng, Ming Liao, Jia Zhu, Kam-Fai Wong
Name of ConferenceThe 26th International Conference on Computational Linguistics: System Demonstrations (COLING 2016)
Start Date of Conference11/12/2016
End Date of Conference16/12/2016
Place of ConferenceOsaka
Country/Region of ConferenceJapan
Year2016
LanguagesEnglish-United States

Last updated on 2018-18-01 at 11:20