Sciweavers

2137 search results - page 40 / 428
» Extraction of Structural Information from the Web
Sort
View
IJDAR
2011
114views more  IJDAR 2011»
13 years 2 months ago
Setting up a competition framework for the evaluation of structure extraction from OCR-ed books
Abstract. This paper describes the setup of the Book Structure Extraction competition run at ICDAR 2009. The goal of the competition was to evaluate and compare automatic technique...
Antoine Doucet, Gabriella Kazai, Bodin Dresevic, A...
WWW
2004
ACM
14 years 8 months ago
OntoMiner: bootstrapping ontologies from overlapping domain specific web sites
In this paper, we present automated techniques for bootstrapping and populating specialized domain ontologies by organizing and mining a set of relevant overlapping Web sites prov...
Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...
28
Voted
DILS
2009
Springer
14 years 2 months ago
Site-Wide Wrapper Induction for Life Science Deep Web Databases
We present a novel approach to automatic information extraction from Deep Web Life Science databases using wrapper induction. Traditional wrapper induction techniques focus on lear...
Saqib Mir, Steffen Staab, Isabel Rojas
IJCAI
2007
13 years 9 months ago
What You Seek Is What You Get: Extraction of Class Attributes from Query Logs
Within the larger area of automatic acquisition of knowledge from the Web, we introduce a method for extracting relevant attributes, or quantifiable properties, for various class...
Marius Pasca, Benjamin Van Durme
DAS
2010
Springer
13 years 5 months ago
Information extraction by finding repeated structure
Repetition of layout structure is prevalent in document images. In document design, such repetition conveys the underlying logical and functional structure of the data. For exampl...
Evgeniy Bart, Prateek Sarkar