Sciweavers

2524 search results - page 42 / 505
» Numerical document queries
Sort
View
ITCC
2003
IEEE
14 years 3 months ago
A Method for Calculating Term Similarity on Large Document Collections
We present an efficient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...
Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva
PVLDB
2010
135views more  PVLDB 2010»
13 years 8 months ago
SXPath - Extending XPath towards Spatial Querying on Web Documents
Querying data from presentation formats like HTML, for purposes such as information extraction, requires the consideration of tree structures as well as the consideration of spati...
Ermelinda Oro, Massimo Ruffolo, Steffen Staab
RIAO
2007
13 years 11 months ago
Capturing Sentence Prior for Query-Based Multi-Document Summarization
In this paper, we have considered a real world information synthesis task, generation of a fixed length multi document summary which satisfies a specific information need. This...
Jagadeesh Jagarlamudi, Prasad Pingali, Vasudeva Va...
SIGIR
2011
ACM
13 years 22 days ago
Faster top-k document retrieval using block-max indexes
Large search engines process thousands of queries per second over billions of documents, making query processing a major performance bottleneck. An important class of optimization...
Shuai Ding, Torsten Suel
ICDAR
1997
IEEE
14 years 2 months ago
Document image database retrieval and browsing using texture analysis
A system is presented that uses texture to retrieve and browse images stored in a large document image database. A method of graphically generating a candidate search image is use...
John F. Cullen, Jonathan J. Hull, Peter E. Hart