Search Sciweavers | Sciweavers

391 search results - page 25 / 79

» Finding and Extracting Data Records from Web Pages

142

click to vote

WWW
2005
ACM

153views Internet Technology» more WWW 2005»

METEOR: metadata and instance extraction from object referral lists on the web

16 years 4 months ago

Download www2005.org

The Web has established itself as the largest public data repository ever available. Even though the vast majority of information on the Web is formatted to be easily readable by ...

Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...

claim paper

Read More »

154

click to vote

PVLDB
2008

141views more PVLDB 2008»

WebTables: exploring the power of tables on the web

15 years 3 months ago

Download turing.cs.washington.edu

The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...

Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...

claim paper

Read More »

114

click to vote

ER
2007
Springer

142views Database» more ER 2007»

Automatic Hidden-Web Table Interpretation by Sibling Page Comparison

15 years 10 months ago

Download www.deg.byu.edu

The longstanding problem of automatic table interpretation still illudes us. Its solution would not only be an aid to table processing applications such as large volume table conve...

Cui Tao, David W. Embley

claim paper

Read More »

108

click to vote

LREC
2010

216views Education» more LREC 2010»

BlogBuster: A Tool for Extracting Corpora from the Blogosphere

15 years 5 months ago

Download www.lrec-conf.org

This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...

Georgios Petasis, Dimitrios Petasis

claim paper

Read More »

150

click to vote

COMAD
2009

142views Knowledge Management» more COMAD 2009»

Querying for relations from the semi-structured Web

15 years 5 months ago

Download www.cse.iitb.ac.in

We present a class of web queries whose result is a multi-column relation instead of a collection of unstructured documents as in standard web search. The user specifies the query...

Sunita Sarawagi

claim paper

Read More »

« Prev « First page 25 / 79 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers