Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

83

WISE
2002
Springer

favoriteEmaildiscussreport

122views Internet Technology» more WISE 2002»

A Unified Framework for Clustering Heterogeneous Web Objects

14 years 12 months ago

A Unified Framework for Clustering Heterogeneous Web Objects

Download research.microsoft.com

In this paper, we introduce a novel framework for clustering web data which is often heterogeneous in nature. As most existing methods often integrate heterogeneous data into a unified feature space, their flexibilities to explore and adjust contributing effect from different heterogeneous information are compromised. In contrast, our framework enables separate clustering of homogeneous data in the entire process based on their respective features, and a layered structure with link information is used to iteratively project and propagate the clustered results between layers until it converges. Our experimental results show that such a scheme not only effectively overcomes the problem of data sparseness caused by the high dimensional link space but also improves the clustering accuracy significantly. We achieve 19% and 41% performance increases when clustering web-pages and users based on a semi-synthetic web log. Finally, we show a real clustering result based on UC Berkeley's we...

Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma

Real-time Traffic

Clustering Accuracy | Heterogeneous Data | Internet Technology | Separate Clustering | WISE 2002 |

claim paper

Related Content

» Similarity spreading a unified framework for similarity calculation of interrelated object...

» Specifying a WSECA Working Framework for Ubiquitous Web Services in ObjectProcess Methodol...

» Edge Weight Regularization over Multiple Graphs for Similarity Learning

» ClusterBased Computing with Active Persistent Objects on the Web

» A probabilistic framework for relational clustering

» A Unified Resource Scheduling Framework for Heterogeneous Computing Environments

» Link fusion a unified link analysis framework for multitype interrelated data objects

» WebSplitter a unified XML framework for multidevice collaborative Web browsing

» Mining clickthrough data for collaborative web search

Post Info
More Details (n/a)

Added	16 Jul 2010
Updated	16 Jul 2010
Type	Conference
Year	2002
Where	WISE
Authors	Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma

Comments (0)