Search Sciweavers | Sciweavers

391 search results - page 20 / 79

» Finding and Extracting Data Records from Web Pages

142

click to vote

IPM
2006

146views more IPM 2006»

Dictionary-based text categorization of chemical web pages

15 years 4 months ago

Download chemport.ipe.ac.cn

A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...

Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...

claim paper

Read More »

140

click to vote

APWEB
2006
Springer

161views Internet Technology» more APWEB 2006»

Image Description Mining and Hierarchical Clustering on Data Records Using HR-Tree

15 years 7 months ago

Download eelab.sjtu.edu.cn

Since we can hardly get semantics from the low-level features of the image, it is much more difficult to analyze the image than textual information on the Web. Traditionally, textu...

Congle Zhang, Sheng Huang, Gui-Rong Xue, Yong Yu

claim paper

Read More »

144

click to vote

WWW
2008
ACM

214views Internet Technology» more WWW 2008»

16 years 4 months ago

Efficient similarity joins for near duplicate detection

Download www2008.org

With the increasing amount of data and the need to integrate data from multiple data sources, a challenging issue is to find near duplicate records efficiently. In this paper, we ...

Chuan Xiao, Wei Wang 0011, Xuemin Lin, Jeffrey Xu ...

claim paper

Read More »

164

click to vote

AIIA
2003
Springer

163views Artificial Intelligence» more AIIA 2003»

Preprocessing and Mining Web Log Data for Web Personalization

15 years 9 months ago

Download www.di.unipi.it

We describe the web usage mining activities of an on-going project, called ClickWorld3 , that aims at extracting models of the navigational behaviour of a web site users. The model...

Miriam Baglioni, U. Ferrara, Andrea Romei, Salvato...

claim paper

Read More »

146

click to vote

WWW
2011
ACM

298views Internet Technology» more WWW 2011»

HyLiEn: a hybrid approach to general list extraction on the web

14 years 11 months ago

Download www.cs.uiuc.edu

We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...

Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...

claim paper

Read More »

« Prev « First page 20 / 79 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers