Sciweavers

258 search results - page 24 / 52
» Efficient techniques for document sanitization
Sort
View
TKDE
1998
111views more  TKDE 1998»
13 years 8 months ago
Efficient Data Mining for Path Traversal Patterns
—In this paper, we explore a new data mining capability that involves mining path traversal patterns in a distributed information-providing environment where documents or objects...
Ming-Syan Chen, Jong Soo Park, Philip S. Yu
KI
2002
Springer
13 years 8 months ago
Employing Text Mining for Semantic Tagging in DIAsDEM
Both public and private organizations have been accumulating large volumes of electronically available text documents for the past years. However, to turn text archives into profi...
Karsten Winkler, Myra Spiliopoulou
WWW
2005
ACM
14 years 9 months ago
Improving Web search efficiency via a locality based static pruning method
The unarguably fast, and continuous, growth of the volume of indexed (and indexable) documents on the Web poses a great challenge for search engines. This is true regarding not on...
Edleno Silva de Moura, Célia Francisca dos ...
ICDAR
2009
IEEE
13 years 6 months ago
Low Cost Correction of OCR Errors Using Learning in a Multi-Engine Environment
We propose a low cost method for the correction of the output of OCR engines through the use of human labor. The method employs an error estimator neural network that learns to as...
Ahmad Abdulkader, Mathew R. Casey
ISESE
2003
IEEE
14 years 1 months ago
An Experimental Evaluation of Inspection and Testing for Detection of Design Faults
The two most common strategies for verification and validation, inspection and testing, are in a controlled experiment evaluated in terms of their fault detection capabilities. Th...
Carina Andersson, Thomas Thelin, Per Runeson, Nina...