Tree topological features for unlexicalized parsing
Refereed conference paper presented and published in conference proceedings


全文

其它資訊
摘要As unlexicalized parsing lacks word token information, it is important to investigate novel parsing features to improve the accuracy. This paper studies a set of tree topological (TT) features. They quantitatively describe the tree shape dominated by each non-terminal node. The features are useful in capturing linguistic notions such as grammatical weight and syntactic branching, which are factors important to syntactic processing but overlooked in the parsing literature. By using an ensemble classifierbased model, TT features can significantly improve the parsing accuracy of our unlexicalized parser. Further, the ease of estimating TT feature values makes them easy to be incorporated into virtually any mainstream parsers.
著者Chan S.W.K., Cheung L.Y.L., Chong M.W.C.
會議名稱23rd International Conference on Computational Linguistics, Coling 2010
會議開始日23.08.2010
會議完結日27.08.2010
會議地點Beijing
會議國家/地區中國
詳細描述organized by Chinese Information Processing Society of China,
出版年份2010
月份12
日期1
卷號2
頁次117 - 125
語言英式英語

上次更新時間 2020-28-07 於 00:59