Using a Partially Annotated Corpus to Build a Dependency Parser for Japanese

16 years 28 days ago

Download www.aclweb.org

Abstract. We explore the use of a partially annotated corpus to build a dependency parser for Japanese. We examine two types of partially annotated corpora. It is found that a parser trained with a corpus that does not have any grammatical tags for words can demonstrate an accuracy of 87.38%, which is comparable to the current state-of-the-art accuracy on the Kyoto University Corpus. In contrast, a parser trained with a corpus that has only dependency annotations for each two adjacent bunsetsus (chunks) shows moderate performance. Nonetheless, it is notable that features based on character n-grams are found very useful for a dependency parser for Japanese.

Manabu Sassano

Real-time Traffic

Current State-of-the-art Accuracy | Dependency Parser | IJCNLP 2005 | Kyoto University Corpus | Natural Language Processing |

claim paper

» Training Conditional Random Fields Using Incomplete Annotations

» FrAG a Hybrid Constraint Grammar Parser for French

» Mining Parenthetical Translations from the Web by Word Alignment

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	IJCNLP
Authors	Manabu Sassano

Comments (0)

Sciweavers

Using a Partially Annotated Corpus to Build a Dependency Parser for Japanese

Current State-of-the-art Accuracy | Dependency Parser | IJCNLP 2005 | Kyoto University Corpus | Natural Language Processing |

Explore & Download

Productivity Tools

Sciweavers