Sciweavers

168 search results - page 5 / 34
» Efficient Search in Document Image Collections
Sort
View
DAS
2006
Springer
14 years 2 months ago
Efficient Word Retrieval by Means of SOM Clustering and PCA
Abstract. We propose an approach for efficient word retrieval from printed documents belonging to Digital Libraries. The approach combines word image clustering (based on Self Orga...
Simone Marinai, Stefano Faini, Emanuele Marino, Gi...
SIGMOD
2010
ACM
199views Database» more  SIGMOD 2010»
13 years 8 months ago
Keyword search across databases and documents
Given the continuous growth of databases and the abundance of diverse files in modern IT environments, there is a pressing need to integrate keyword search on heterogeneous inform...
Carlos Garcia-Alvarado, Carlos Ordonez
ICDAR
2011
IEEE
12 years 10 months ago
Towards Searchable Digital Urdu Libraries - A Word Spotting Based Retrieval Approach
—Libraries in South Asia hold huge collections of valuable printed documents in Urdu and it is of interest to digitize these collections to make them more accessible. The unavail...
Ali Abidi, Imran Siddiqi, Khurram Khurshid
SIGIR
2008
ACM
13 years 11 months ago
SpotSigs: robust and efficient near duplicate detection in large web collections
Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...
Martin Theobald, Jonathan Siddharth, Andreas Paepc...
IJDAR
2008
136views more  IJDAR 2008»
13 years 11 months ago
Matching word images for content-based retrieval from printed document images
As large quantity of document images is getting archived by the digital libraries, there is a need for an efficient search strategies to make them available as per users informatio...
Million Meshesha, C. V. Jawahar