Search Sciweavers | Sciweavers

85 search results - page 6 / 17

» Extracting Content Structure for Web Pages Based on Visual R...

159

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 3 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

139

click to vote

WIRI
2005
IEEE

117views Internet Technology» more WIRI 2005»

Extended Link Analysis for Extracting Spatial Information Hubs

15 years 8 months ago

Download www.db.itc.nagoya-u.ac.jp

Recently, web mining that tries to ﬁnd useful knowledge from the vast amount of web pages has attracted a lot of research interests. Besides, it is becoming an essential task to...

Jianwei Zhang 0002, Yoshiharu Ishikawa, Hiroyuki K...

claim paper

Read More »

140

click to vote

LREC
2008

160views Education» more LREC 2008»

Automatic Extraction of Textual Elements from News Web Pages

15 years 4 months ago

Download www.lrec-conf.org

In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...

Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany

claim paper

Read More »

136

click to vote

WWW
2009
ACM

213views Internet Technology» more WWW 2009»

Extracting article text from the web with maximum subsequence segmentation

16 years 3 months ago

Download www2009.org

Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...

Jeff Pasternack, Dan Roth

claim paper

Read More »

152

click to vote

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

15 years 3 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

« Prev « First page 6 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers