Chinese Comma Disambiguation for Discourse Analysis

13 years 9 months ago

Download aclweb.org

The Chinese comma signals the boundary of discourse units and also anchors discourse relations between adjacent text spans. In this work, we propose a discourse structureoriented classiﬁcation of the comma that can be automatically extracted from the Chinese Treebank based on syntactic patterns. We then experimented with two supervised learning methods that automatically disambiguate the Chinese comma based on this classiﬁcation. The ﬁrst method integrates comma classiﬁcation into parsing, and the second method adopts a “post-processing” approach that extracts features from automatic parses to train a classiﬁer. The experimental results show that the second approach compares favorably against the ﬁrst approach.

Yaqin Yang, Nianwen Xue

Real-time Traffic

ACL 2012 | Chinese Treebank | Computational Linguistics | Discourse Relations | Syntactic Patterns |

claim paper

Post Info
More Details (n/a)

Added	29 Sep 2012
Updated	29 Sep 2012
Type	Journal
Year	2012
Where	ACL
Authors	Yaqin Yang, Nianwen Xue

Comments (0)

Sciweavers

Chinese Comma Disambiguation for Discourse Analysis

ACL 2012 | Chinese Treebank | Computational Linguistics | Discourse Relations | Syntactic Patterns |

Explore & Download

Productivity Tools

Sciweavers