Sciweavers

2137 search results - page 40 / 428
» Extraction of Structural Information from the Web
Sort
View
153
Voted
IJDAR
2011
114views more  IJDAR 2011»
14 years 10 months ago
Setting up a competition framework for the evaluation of structure extraction from OCR-ed books
Abstract. This paper describes the setup of the Book Structure Extraction competition run at ICDAR 2009. The goal of the competition was to evaluate and compare automatic technique...
Antoine Doucet, Gabriella Kazai, Bodin Dresevic, A...
136
Voted
WWW
2004
ACM
16 years 4 months ago
OntoMiner: bootstrapping ontologies from overlapping domain specific web sites
In this paper, we present automated techniques for bootstrapping and populating specialized domain ontologies by organizing and mining a set of relevant overlapping Web sites prov...
Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...
139
Voted
DILS
2009
Springer
15 years 10 months ago
Site-Wide Wrapper Induction for Life Science Deep Web Databases
We present a novel approach to automatic information extraction from Deep Web Life Science databases using wrapper induction. Traditional wrapper induction techniques focus on lear...
Saqib Mir, Steffen Staab, Isabel Rojas
120
Voted
IJCAI
2007
15 years 5 months ago
What You Seek Is What You Get: Extraction of Class Attributes from Query Logs
Within the larger area of automatic acquisition of knowledge from the Web, we introduce a method for extracting relevant attributes, or quantifiable properties, for various class...
Marius Pasca, Benjamin Van Durme
125
Voted
DAS
2010
Springer
15 years 1 months ago
Information extraction by finding repeated structure
Repetition of layout structure is prevalent in document images. In document design, such repetition conveys the underlying logical and functional structure of the data. For exampl...
Evgeniy Bart, Prateek Sarkar