Sciweavers

224 search results - page 9 / 45
» Semi-parametric and Non-parametric Term Weighting for Inform...
Sort
View
ITCC
2003
IEEE
14 years 1 months ago
A Method for Calculating Term Similarity on Large Document Collections
We present an efficient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...
Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva
ANLP
1994
86views more  ANLP 1994»
13 years 9 months ago
Robust Text Processing in Automated Information Retrieval
We report on the results of a series of experiments with a prototype text retrieval system which uses relatively advanced natural language processing techniques in order to enhanc...
Tomek Strzalkowski
CLEF
2006
Springer
13 years 11 months ago
Amharic-English Information Retrieval
We describe Amharic-English cross lingual information retrieval experiments in the adhoc bilingual tracs of the CLEF 2006. The query analysis is supported by morphological analysi...
Atelach Alemu Argaw, Lars Asker
SIGIR
2006
ACM
14 years 1 months ago
Measuring similarity of semi-structured documents with context weights
In this work, we study similarity measures for text-centric XML documents based on an extended vector space model, which considers both document content and structure. Experimenta...
Christopher C. Yang, Nan Liu
SIGIR
1995
ACM
13 years 11 months ago
Probability Kinematics in Information Retrieval
We analyse the kinematics of probabilistic term weights at retrieval time for di erent Information Retrieval models. We present four models based on di erent notions of probabilis...
Fabio Crestani, C. J. van Rijsbergen