Sciweavers

92 search results - page 10 / 19
» HTML Pattern Generator--Automatic Data Extraction from Web P...
Sort
View
EKAW
2010
Springer
13 years 7 months ago
RDFauthor: Employing RDFa for Collaborative Knowledge Engineering
Abstract. In this paper we present RDFauthor, an approach for authoring information that adheres to the RDF data model. RDFauthor completely hides syntax as well as RDF and ontolog...
Sebastian Tramp, Norman Heino, Sören Auer, Ph...
AUSAI
2003
Springer
14 years 1 months ago
Semi-Automatic Construction of Metadata from a Series of Web Documents
Metadata plays an important role in discovering, collecting, extracting and aggregating Web data. This paper proposes a method of constructing metadata for a specific topic. The m...
Sachio Hirokawa, Eisuke Itoh, Tetsuhiro Miyahara
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 9 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
AAAI
2006
13 years 10 months ago
Automatic Wrapper Generation Using Tree Matching and Partial Tree Alignment
This paper is concerned with the problem of structured data extraction from Web pages. The objective of the research is to automatically segment data records in a page, extract da...
Yanhong Zhai, Bing Liu
ICDM
2007
IEEE
149views Data Mining» more  ICDM 2007»
14 years 2 months ago
Extracting Author Meta-Data from Web Using Visual Features
Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...
Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles