Sciweavers

487 search results - page 22 / 98
» Segmentation Standard for Chinese Natural Language Processin...
Sort
View
CICLING
2001
Springer
14 years 29 days ago
Contextual Rules for Text Analysis
In this paper we describe a rule-based formalism for the analysis and labelling of texts segments. The rules are contextual rewriting rules with a restricted form of negation. They...
Dina Wonsever, Jean-Luc Minel
COLING
1994
13 years 9 months ago
Encoding standards for large text resources: The Text Encoding Initiative
The Text Encoding Initiative (TEl) is an international project established in 1988 to develop guidelines for the preparation and interchange of electronic texts for research, and t...
Nancy Ide
FINTAL
2006
14 years 4 days ago
Improving Phrase-Based Statistical Translation Through Combination of Word Alignments
This paper investigates the combination of word-alignments computed with the competitive linking algorithm and well-established IBM models. New training methods for phrase-based st...
Boxing Chen, Marcello Federico
EMNLP
2009
13 years 6 months ago
Unsupervised Tokenization for Machine Translation
Training a statistical machine translation starts with tokenizing a parallel corpus. Some languages such as Chinese do not incorporate spacing in their writing system, which creat...
Tagyoung Chung, Daniel Gildea
EMNLP
2006
13 years 10 months ago
A Hybrid Markov/Semi-Markov Conditional Random Field for Sequence Segmentation
Markov order-1 conditional random fields (CRFs) and semi-Markov CRFs are two popular models for sequence segmentation and labeling. Both models have advantages in terms of the typ...
Galen Andrew