Sciweavers

466 search results - page 48 / 94
» Scalable Feature Extraction from Noisy Documents
Sort
View
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
14 years 2 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
BMCBI
2008
131views more  BMCBI 2008»
13 years 10 months ago
K-OPLS package: Kernel-based orthogonal projections to latent structures for prediction and interpretation in feature space
Background: Kernel-based classification and regression methods have been successfully applied to modelling a wide variety of biological data. The Kernel-based Orthogonal Projectio...
Max Bylesjö, Mattias Rantalainen, Jeremy K. N...
FEGC
2010
307views Biometrics» more  FEGC 2010»
13 years 10 months ago
Visual sentence-phrase-based document representation for effective and efficient content-based image retrieval
Abstract. Having effective and efficient methods to get access to desired images is essential nowadays with the huge amount of digital images. This paper presents an analogy betwee...
Ismail Elsayad, Jean Martinet, Thierry Urruty, Cha...
ICDAR
2011
IEEE
12 years 9 months ago
Character n-Gram Spotting in Document Images
—In this paper, we present a novel approach to search and retrieve from document image collections, without explicit recognition. Existing recognition-free approaches such as wor...
M. Sudha Praveen, K. Pramod Sankar, C. V. Jawahar
DAS
2004
Springer
14 years 3 months ago
Automatic Fax Routing
Abstract. We present a system for automatic FAX routing which processes incoming FAX images and forwards them to the correct email alias. The system first performs optical charact...
Paul A. Viola, James Rinker, Martin Law