Sciweavers

1541 search results - page 11 / 309
» Extracting Web Data Using Instance-Based Learning
Sort
View
ISCIS
2009
Springer
14 years 1 months ago
PopulusLog: People information database
—Information about individuals on publicly available web sites stands as a valuable, yet unorganized, data source. Turning such an enormous data source into a “database” is h...
Ali Cakmak, Mustafa Kirac, Gultekin Özsoyoglu
DASFAA
2005
IEEE
123views Database» more  DASFAA 2005»
13 years 10 months ago
Automatic Data Extraction from Data-Rich Web Pages
Abstract. Extracting data from web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. In this paper, we propose a...
Dongdong Hu, Xiaofeng Meng
AAAI
2006
13 years 10 months ago
Table Extraction Using Spatial Reasoning on the CSS2 Visual Box Model
Tables on web pages contain a huge amount of semantically explicit information, which makes them a worthwhile target for automatic information extraction and knowledge acquisition...
Wolfgang Gatterbauer, Paul Bohunsky
SAINT
2005
IEEE
14 years 2 months ago
Learning Logic Wrappers for Information Extraction from the Web
This paper discusses a methodology for applying general-purpose first-order inductive learning to extract information from Web documents structured as unranked ordered trees. The...
Costin Badica, Elvira Popescu, Amelia Badica
PKDD
2007
Springer
143views Data Mining» more  PKDD 2007»
14 years 2 months ago
Using the Web to Reduce Data Sparseness in Pattern-Based Information Extraction
Textual patterns have been used effectively to extract information from large text collections. However they rely heavily on textual redundancy in the sense that facts have to be m...
Sebastian Blohm, Philipp Cimiano