The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank
Refereed conference paper presented and published in conference proceedings


Full Text

Times Cited
Web of Science2WOS source URL (as at 27/02/2021) Click here for the latest count

Other information
AbstractThe lack of open discourse corpus for Chinese brings limitations for many natural language processing tasks. In this work, we present the first open discourse treebank for Chinese, namely, the Discourse Treebank for Chinese (DTBC). At the current stage, we annotated explicit intra-sentence discourse connectives, their corresponding arguments and senses for all 890 documents of the Chinese Treebank 5. We started by analysing the characteristics of discourse annotation for Chinese, adapted the annotation scheme of Penn Discourse Treebank 2 (PDTB2) to Chinese language while maintaining the compatibility as far as possible. We made adjustments to 3 essential aspects according to the previous study of Chinese linguistics. They are sense hierarchy, argument scope and semantics of arguments. Agreement study showed that our annotation scheme could achieve highly reliable results.
All Author(s) ListZhou LJ, Li BY, Wei ZY, Wong KF
Name of Conference9th International Conference on Language Resources and Evaluation (LREC)
Start Date of Conference26/05/2014
End Date of Conference31/05/2014
Place of ConferenceReykjavik
Country/Region of ConferenceIceland
Detailed descriptionorganized by The European Language Resources Association,
Year2014
Month1
Day1
PublisherEUROPEAN LANGUAGE RESOURCES ASSOC-ELRA
Pages942 - 949
eISBN978-2-9517408-8-4
LanguagesEnglish-United Kingdom
KeywordsChinese Discourse; Discourse Annotation; Explicit Discourse Connectives
Web of Science Subject CategoriesLanguage & Linguistics; Linguistics

Last updated on 2021-27-02 at 23:45