Search Sciweavers | Sciweavers

2677 search results - page 7 / 536

» Extracting Structured Data from Web Pages

197

click to vote

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

15 years 6 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

161

Voted

SAINT
2005
IEEE

120views Internet Technology» more SAINT 2005»

Learning Logic Wrappers for Information Extraction from the Web

16 years 7 days ago

Download software.ucv.ro

This paper discusses a methodology for applying general-purpose ﬁrst-order inductive learning to extract information from Web documents structured as unranked ordered trees. The...

Costin Badica, Elvira Popescu, Amelia Badica

claim paper

Read More »

189

click to vote

WWW
2011
ACM

298views Internet Technology» more WWW 2011»

HyLiEn: a hybrid approach to general list extraction on the web

15 years 1 months ago

Download www.cs.uiuc.edu

We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...

Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...

claim paper

Read More »

174

click to vote

CLEF
2010
Springer

164views Information Technology» more CLEF 2010»

Person Attribute Extraction from the Textual Parts of Web Pages

15 years 6 months ago

Download www.clef2010.org

We present the RGAI systems which participated in the third Web People Search Task challenge. The chief characteristics of our approach are that we focus on the raw textual parts o...

István Nagy, Richárd Farkas

claim paper

Read More »

187

click to vote

ITCC
2005
IEEE

105views Information Technology» more ITCC 2005»

Elimination of Redundant Information for Web Data Mining

16 years 7 days ago

Download eprints.utas.edu.au

These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...

Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang

claim paper

Read More »

« Prev « First page 7 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers