Sciweavers

722 search results - page 1 / 145
» On the use of words and n-grams for Chinese information retr...
Sort
View
CLEF
2006
Springer
13 years 11 months ago
A First Approach to CLIR Using Character N -Grams Alignment
Abstract. This paper describes the technique for translation of character n-grams we developed for our participation in CLEF 2006. This solution avoids the need for word normalizat...
Jesús Vilares, Michael P. Oakes, John Tait
CICLING
2010
Springer
13 years 11 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
ICTAI
2007
IEEE
14 years 1 months ago
Webpage Genre Identification Using Variable-Length Character n-Grams
An important factor for discriminating between webpages is their genre (e.g., blogs, personal homepages, e-shops, online newspapers, etc). Webpage genre identification has a great...
Ioannis Kanaris, Efstathios Stamatatos
ICDAR
2011
IEEE
12 years 7 months ago
Character n-Gram Spotting in Document Images
—In this paper, we present a novel approach to search and retrieve from document image collections, without explicit recognition. Existing recognition-free approaches such as wor...
M. Sudha Praveen, K. Pramod Sankar, C. V. Jawahar
IRAL
2000
ACM
13 years 11 months ago
Construction of a Chinese-English WordNet and its application to CLIR
This paper integrates five linguistic resources, including Cilin, a Chinese-English dictionary, ASBC corpus, SemCor, and WordNet, to construct a Chinese-English WordNet. The resul...
Hsin-Hsi Chen, Chi-Ching Lin, Wen-Cheng Lin