Sciweavers

1541 search results - page 12 / 309
» Extracting Web Data Using Instance-Based Learning
Sort
View
WWW
2009
ACM
14 years 1 months ago
Extracting data records from the web using tag path clustering
Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the first step of this object extraction process, identifies...
Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...
CIKM
2009
Springer
14 years 1 months ago
Data extraction from the web using wild card queries
This paper presents an overview of our framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task o...
Davood Rafiei, Haobin Li
JMLR
2008
159views more  JMLR 2008»
13 years 8 months ago
Dynamic Hierarchical Markov Random Fields for Integrated Web Data Extraction
Existing template-independent web data extraction approaches adopt highly ineffective decoupled strategies--attempting to do data record detection and attribute labeling in two se...
Jun Zhu, Zaiqing Nie, Bo Zhang, Ji-Rong Wen
ICDM
2007
IEEE
149views Data Mining» more  ICDM 2007»
14 years 3 months ago
Extracting Author Meta-Data from Web Using Visual Features
Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...
Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles
WISE
2007
Springer
14 years 3 months ago
Using Clustering and Edit Distance Techniques for Automatic Web Data Extraction
Manuel Álvarez, Alberto Pan, Juan Raposo, F...