Period Disambiguation with Maxent Model

14 years 7 months ago

Download personal.cityu.edu.hk

Abstract. This paper presents our recent work on period disambiguation, the kernel problem in sentence boundary identiﬁcation, with the maximum entropy (Maxent) model. A number of experiments are conducted on PTB-II WSJ corpus for the investigation of how context window, feature space and lexical information such as abbreviated and sentence-initial words aﬀect the learning performance. Such lexical information can be automatically acquired from a training corpus by a learner. Our experimental results show that extending the feature space to integrate these two kinds of lexical information can eliminate 93.52% of the remaining errors from the baseline Maxent model, achieving an F-score of 99.8227%.

Chunyu Kit, Xiaoyue Liu

Real-time Traffic

Feature Space | IJCNLP 2005 | Lexical Information | Natural Language Processing | PTB-II WSJ Corpus |

claim paper

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	IJCNLP
Authors	Chunyu Kit, Xiaoyue Liu

Comments (0)

Sciweavers

Period Disambiguation with Maxent Model

Feature Space | IJCNLP 2005 | Lexical Information | Natural Language Processing | PTB-II WSJ Corpus |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers