Search Sciweavers | Sciweavers

8479 search results - page 16 / 1696

» Data Extraction from Web Data Sources

192

click to vote

SIGMOD
2003
ACM

190views Database» more SIGMOD 2003»

Extracting Structured Data from Web Pages

15 years 12 months ago

Download infolab.stanford.edu

Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...

Arvind Arasu, Hector Garcia-Molina

claim paper

Read More »

197

click to vote

ESWS
2010
Springer

201views Internet Technology» more ESWS 2010»

An Unsupervised Approach for Acquiring Ontologies and RDF Data from Online Life Science Databases

15 years 8 months ago

Download www.uni-koblenz.de

In the Linked Open Data cloud one of the largest data sets, comprising of 2.5 billion triples, is derived from the Life Science domain. Yet this represents a small fraction of the ...

Saqib Mir, Steffen Staab, Isabel Rojas

claim paper

Read More »

183

click to vote

ICDM
2007
IEEE

149views Data Mining» more ICDM 2007»

Extracting Author Meta-Data from Web Using Visual Features

16 years 1 months ago

Download www.cse.psu.edu

Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...

Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles

claim paper

Read More »

195

click to vote

CIKM
2007
Springer

134views Information Technology» more CIKM 2007»

The role of documents vs. queries in extracting class attributes from text

16 years 26 days ago

Download www.cs.jhu.edu

Challenging the implicit reliance on document collections, this paper discusses the pros and cons of using query logs rather than document collections, as self-contained sources o...

Marius Pasca, Benjamin Van Durme, Nikesh Garera

claim paper

Read More »

186

click to vote

SYNASC
2006
IEEE

211views Algorithms» more SYNASC 2006»

HTML Pattern Generator--Automatic Data Extraction from Web Pages

16 years 21 days ago

Download www.informatik.tu-cottbus.de

Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...

Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...

claim paper

Read More »

« Prev « First page 16 / 1696 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers