Sciweavers

466 search results - page 3 / 94
» Scalable Feature Extraction from Noisy Documents
Sort
View
DAS
2006
Springer
13 years 9 months ago
Extraction and Analysis of Document Examiner Features from Vector Skeletons of Grapheme 'th'
Abstract. This paper presents a study of 25 structural features extracted from samples of grapheme `th' that correspond to features commonly used by forensic document examiner...
Vladimir Pervouchine, Graham Leedham
JCDL
2003
ACM
160views Education» more  JCDL 2003»
14 years 20 days ago
Automatic Document Metadata Extraction Using Support Vector Machines
Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadat...
Hui Han, C. Lee Giles, Eren Manavoglu, Hongyuan Zh...
PKDD
2001
Springer
127views Data Mining» more  PKDD 2001»
13 years 12 months ago
Sentence Filtering for Information Extraction in Genomics, a Classification Problem
In some domains, Information Extraction (IE) from texts requires syntactic and semantic parsing. This analysis is computationally expensive and IE is potentially noisy if it applie...
Claire Nedellec, Mohamed Ould Abdel Vetah, Philipp...
ICDAR
2005
IEEE
14 years 1 months ago
A Segmentation-free Approach for Keyword Search in Historical Typewritten Documents
In this paper, we propose a novel segmentation-free approach for keyword search in historical typewritten documents combining image preprocessing, synthetic data creation, word sp...
Basilios Gatos, Thomas Konidaris, Kostas Ntzios, I...
DOCENG
2009
ACM
14 years 1 months ago
Web document text and images extraction using DOM analysis and natural language processing
: © Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing Parag Mulendra Joshi, Sam Liu HP Laboratories HPL-2009-187 Web page text extraction,...
Parag Mulendra Joshi, Sam Liu