Search Sciweavers | Sciweavers

391 search results - page 6 / 79

» Finding and Extracting Data Records from Web Pages

134

click to vote

IPM
2007

149views more IPM 2007»

Web page title extraction and its application

15 years 3 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...

claim paper

Read More »

116

click to vote

CIKM
2010
Springer

115views Information Technology» more CIKM 2010»

Mapping web pages to database records via link paths

15 years 2 months ago

Download www.cs.uiuc.edu

In this paper we propose a new knowledge management task which aims to map Web pages to their corresponding records in a structured database. For example, the DBLP database contai...

Tim Weninger, Fabio Fumarola, Jiawei Han, Donato M...

claim paper

Read More »

146

click to vote

FLAIRS
2001

131views Artificial Intelligence» more FLAIRS 2001»

Syntactic Folding and its Application to the Information Extraction from Web Pages

15 years 5 months ago

Download www.aaai.org

Thepaper deals with investigations concerning potential structures of documentsthat will be subject to automated information extraction. The focus is on folding principles and the...

Jörg Herrmann

claim paper

Read More »

163

click to vote

JCDL
2005
ACM

84views Education» more JCDL 2005»

Finding a catalog: generating analytical catalog records from well-structured digital texts

15 years 9 months ago

Download www.cs.umass.edu

One of the criticisms library users often make of catalogs is that they rarely include information below the bibliographic level. It is generally impossible to search a catalog fo...

David M. Mimno, Alison Jones, Gregory Crane

claim paper

Read More »

144

click to vote

SYNASC
2006
IEEE

211views Algorithms» more SYNASC 2006»

HTML Pattern Generator--Automatic Data Extraction from Web Pages

15 years 10 months ago

Download www.informatik.tu-cottbus.de

Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...

Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...

claim paper

Read More »

« Prev « First page 6 / 79 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers