Search Sciweavers | Sciweavers

2137 search results - page 8 / 428

» Extraction of Structural Information from the Web

143

click to vote

WEBI
2005
Springer

127views Internet Technology» more WEBI 2005»

Automated Metadata and Instance Extraction from News Web Sites

15 years 11 months ago

Download www.public.asu.edu

In this paper, we present automated techniques for extracting metadata instance information by organizing and mining a set of news Web sites. We develop algorithms that detect and...

Srinivas Vadrevu, Saravanakumar Nagarajan, Fatih G...

claim paper

Read More »

169

click to vote

WWW
2011
ACM

298views Internet Technology» more WWW 2011»

HyLiEn: a hybrid approach to general list extraction on the web

15 years 29 days ago

Download www.cs.uiuc.edu

We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...

Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...

claim paper

Read More »

157

click to vote

CLEF
2010
Springer

164views Information Technology» more CLEF 2010»

Person Attribute Extraction from the Textual Parts of Web Pages

15 years 6 months ago

Download www.clef2010.org

We present the RGAI systems which participated in the third Web People Search Task challenge. The chief characteristics of our approach are that we focus on the raw textual parts o...

István Nagy, Richárd Farkas

claim paper

Read More »

194

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 6 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

179

click to vote

KDD
2008
ACM

153views Data Mining» more KDD 2008»

Information extraction from Wikipedia: moving down the long tail

16 years 6 months ago

Download www.cs.washington.edu

Not only is Wikipedia a comprehensive source of quality information, it has several kinds of internal structure (e.g., relational summaries known as infoboxes), which enable self-...

Fei Wu, Raphael Hoffmann, Daniel S. Weld

claim paper

Read More »

« Prev « First page 8 / 428 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers