Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

184

ANLP
2000

92views more ANLP 2000»

Tagging Sentence Boundaries

15 years 8 months ago

Tagging Sentence Boundaries

Download acl.ldc.upenn.edu

In this paper we tackle sentence boundary disambiguation through a part-of-speech (POS) tagging framework. We describe necessary changes in text tokenization and the implementation of a POS tagger and provide results of an evaluation of this system on two corpora. We also describe an extension of the traditional POS tagging by combining it with the document-centered approach to proper name identification and abbreviation handling. This made the resulting system robust to domain and topic shifts.

Andrei Mikheev

Real-time Traffic

ANLP 2000 | POS Tagger | Sentence Boundary Disambiguation | Text Tokenization |

claim paper

Related Content

» A Maximum Entropy Approach to Identifying Sentence Boundaries

» The Prague Dependency Treebank Crossing the Sentence Boundary

» Local context templates for Chinese constituent boundary prediction

» Semantic Role Tagging for Chinese at the Lexical Level

» A Unified Tagging Approach to Text Normalization

» A New Prosodic Phrasing Model for Chinese TTS Systems

» Performance evaluation for text processing of noisy inputs

» Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries

» AutoTagTCG A Framework for Automatic Thai CG Tagging

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2000
Where	ANLP
Authors	Andrei Mikheev

Comments (0)