Search Sciweavers | Sciweavers

543 search results - page 11 / 109

» Exploiting content redundancy for web information extraction

136

click to vote

JUCS
2008

185views more JUCS 2008»

Recognising Informative Web Page Blocks Using Visual Segmentation for Efficient Information Extraction

15 years 5 months ago

Download www.jucs.org

Abstract: As web sites are getting more complicated, the construction of web information extraction systems becomes more troublesome and time-consuming. A common theme is the diffi...

Jinbeom Kang, Joongmin Choi

claim paper

Read More »

159

click to vote

WIRI
2005
IEEE

117views Internet Technology» more WIRI 2005»

Extended Link Analysis for Extracting Spatial Information Hubs

15 years 11 months ago

Download www.db.itc.nagoya-u.ac.jp

Recently, web mining that tries to ﬁnd useful knowledge from the vast amount of web pages has attracted a lot of research interests. Besides, it is becoming an essential task to...

Jianwei Zhang 0002, Yoshiharu Ishikawa, Hiroyuki K...

claim paper

Read More »

118

click to vote

WWW
2005
ACM

103views Internet Technology» more WWW 2005»

An information extraction engine for web discussion forums

15 years 11 months ago

Download www.www2005.org

In this poster, we present an information extraction engine for web-based forums. The engine analyzes the HTML files crawled from web forums, deduces the wrapper (template) of the...

Hanny Yulius Limanto, Nguyen Ngoc Giang, Vo Tan Tr...

claim paper

Read More »

169

click to vote

WWW
2005
ACM

188views Internet Technology» more WWW 2005»

Hybrid semantic tagging for information extraction

16 years 6 months ago

Download www.www2005.org

The semantic web is expected to have an impact at least as big as that of the existing HTML based web, if not greater. However, the challenge lays in creating this semantic web an...

Ronen Feldman, Binyamin Rosenfeld, Moshe Fresko, B...

claim paper

Read More »

170

click to vote

AIRWEB
2008
Springer

126views Internet Technology» more AIRWEB 2008»

Web spam identification through content and hyperlinks

15 years 7 months ago

Download airweb.cse.lehigh.edu

We present an algorithm, witch, that learns to detect spam hosts or pages on the Web. Unlike most other approaches, it simultaneously exploits the structure of the Web graph as we...

Jacob Abernethy, Olivier Chapelle, Carlos Castillo

claim paper

Read More »

« Prev « First page 11 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers