Sciweavers

111 search results - page 10 / 23
» Word Segmentation of Vietnamese Texts: a Comparison of Appro...
Sort
View
COLING
2002
13 years 7 months ago
An Agent-based Approach to Chinese Named Entity Recognition
Chinese NE (Named Entity) recognition is a difficult problem because of the uncertainty in word segmentation and flexibility in language structure. This paper proposes the use of ...
Shiren Ye, Tat-Seng Chua, Jimin Liu
ICDAR
2009
IEEE
14 years 1 months ago
Word-Based Adaptive OCR for Historical Books
The aim of this work is to propose a new approach to the recognition of historical texts by providing an adaptive mechanism that automatically tunes itself to a specific book. Th...
Vladimir Kluzner, Asaf Tzadok, Yuval Shimony, Euge...
ACL
2006
13 years 8 months ago
A Collaborative Framework for Collecting Thai Unknown Words from the Web
We propose a collaborative framework for collecting Thai unknown words found on Web pages over the Internet. Our main goal is to design and construct a Webbased system which allow...
Choochart Haruechaiyasak, Chatchawal Sangkeettraka...
AIMSA
2008
Springer
14 years 1 months ago
Using Text Segmentation to Enhance the Cluster Hypothesis
An alternative way to tackle Information Retrieval, called Passage Retrieval, considers text fragments independently rather than assessing global relevance of documents. In such a ...
Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...
CICLING
2010
Springer
13 years 2 months ago
A Chunk-Driven Bootstrapping Approach to Extracting Translation Patterns
Abstract. We present a linguistically-motivated sub-sentential alignment system that extends the intersected IBM Model 4 word alignments. The alignment system is chunk-driven and r...
Lieve Macken, Walter Daelemans