Sciweavers

391 search results - page 6 / 79
» Finding and Extracting Data Records from Web Pages
Sort
View
IPM
2007
149views more  IPM 2007»
13 years 8 months ago
Web page title extraction and its application
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...
Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...
CIKM
2010
Springer
13 years 7 months ago
Mapping web pages to database records via link paths
In this paper we propose a new knowledge management task which aims to map Web pages to their corresponding records in a structured database. For example, the DBLP database contai...
Tim Weninger, Fabio Fumarola, Jiawei Han, Donato M...
FLAIRS
2001
13 years 9 months ago
Syntactic Folding and its Application to the Information Extraction from Web Pages
Thepaper deals with investigations concerning potential structures of documentsthat will be subject to automated information extraction. The focus is on folding principles and the...
Jörg Herrmann
JCDL
2005
ACM
84views Education» more  JCDL 2005»
14 years 2 months ago
Finding a catalog: generating analytical catalog records from well-structured digital texts
One of the criticisms library users often make of catalogs is that they rarely include information below the bibliographic level. It is generally impossible to search a catalog fo...
David M. Mimno, Alison Jones, Gregory Crane
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
14 years 2 months ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...