Sciweavers

290 search results - page 41 / 58
» Document normalization revisited
Sort
View
IJCAI
1997
13 years 8 months ago
Toward Structured Retrieval in Semi-structured Information Spaces
A semi-structured information space consists of multiple collections of textual documents containing fielded or tagged sections. The space can be highly heterogeneous, because eac...
Scott B. Huffman, Catherine Baudin
ECIR
2010
Springer
13 years 5 months ago
Query Aspect Based Term Weighting Regularization in Information Retrieval
Abstract. Traditional retrieval models assume that query terms are independent and rank documents primarily based on various term weighting strategies including TF-IDF and document...
Wei Zheng, Hui Fang
JCIT
2010
112views more  JCIT 2010»
13 years 2 months ago
A Preprocessing Method for NaXi Pictograph Character Recognition
Preprocessing, a major component of Character Recognition System, has direct effect on the recognition system by its performance. A preprocessing method for NaXi Pictograph Charac...
Hai Guo, Jing-ying Zhao, Ming-jun Da
AAAI
2011
12 years 7 months ago
Exploiting Phase Transition in Latent Networks for Clustering
In this paper, we model the pair-wise similarities of a set of documents as a weighted network with a single cutoff parameter. Such a network can be thought of an ensemble of unwe...
Vahed Qazvinian, Dragomir R. Radev
SIGMOD
2005
ACM
154views Database» more  SIGMOD 2005»
14 years 7 months ago
Lazy XML Updates: Laziness as a Virtue of Update and Structural Join Efficiency
XML documents are normally stored as plain text files. Hence, the natural and most convenient way to update XML documents is to simply edit the text files. But efficient query eva...
Barbara Catania, Wen Qiang Wang, Beng Chin Ooi, Xi...