Sciweavers

2763 search results - page 171 / 553
» Retrieval of Ottoman documents
Sort
View
ICTIR
2009
Springer
15 years 10 months ago
Modeling the Score Distributions of Relevant and Non-relevant Documents
Empirical modeling of the score distributions associated with retrieved documents is an essential task for many retrieval applications. In this work, we propose modeling the releva...
Evangelos Kanoulas, Virgiliu Pavlu, Keshi Dai, Jav...
HICSS
1997
IEEE
89views Biometrics» more  HICSS 1997»
15 years 8 months ago
Text Types in Hypermedia
The discipline of narratology has long recognized the need to classify documents as instances of different text types. We have discovered that classification is as applicable to h...
Stephen W. Smoliar, James D. Baker
SIGIR
2009
ACM
15 years 10 months ago
Brute force and indexed approaches to pairwise document similarity comparisons with MapReduce
This paper explores the problem of computing pairwise similarity on document collections, focusing on the application of “more like this” queries in the life sciences domain. ...
Jimmy J. Lin
CIKM
2003
Springer
15 years 9 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
ICDAR
2003
IEEE
15 years 9 months ago
Graphics Recognition - from Re-engineering to Retrieval
In this paper, we discuss how the focus in document analysis, generally speaking, and in graphics recognition more specifically, has moved from re-engineering problems to indexin...
Karl Tombre, Bart Lamiroy