Data Mining | Sciweavers

211

KDD
2005
ACM

137views Data Mining» more KDD 2005»

Pattern-based similarity search for microarray data

16 years 8 months ago

One fundamental task in near-neighbor search as well as other similarity matching efforts is to find a distance function that can efficiently quantify the similarity between two o...

Haixun Wang, Jian Pei, Philip S. Yu

claim paper

Read More »

232

click to vote

KDD
2005
ACM

194views Data Mining» more KDD 2005»

Web object indexing using domain knowledge

16 years 8 months ago

Download research.microsoft.com

Web object is defined to represent any meaningful object embedded in web pages (e.g. images, music) or pointed to by hyperlinks (e.g. downloadable files). Users usually search for...

Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiya...

claim paper

Read More »

132

click to vote

KDD
2005
ACM

111views Data Mining» more KDD 2005»

Finding partial orders from unordered 0-1 data

16 years 8 months ago

Download www.cs.helsinki.fi

Antti Ukkonen, Mikael Fortelius, Heikki Mannila

claim paper

Read More »

223

click to vote

KDD
2005
ACM

130views Data Mining» more KDD 2005»

Regression error characteristic surfaces

16 years 8 months ago

Download www.liaad.up.pt

This paper presents a generalization of Regression Error Characteristic (REC) curves. REC curves describe the cumulative distribution function of the prediction error of models an...

Luís Torgo

claim paper

Read More »

211

click to vote

KDD
2005
ACM

185views Data Mining» more KDD 2005»

Mining comparable bilingual text corpora for cross-language information integration

16 years 8 months ago

Download sifaka.cs.uiuc.edu

Integrating information in multiple natural languages is a challenging task that often requires manually created linguistic resources such as a bilingual dictionary or examples of...

Tao Tao, ChengXiang Zhai

claim paper

Read More »

238

click to vote

KDD
2005
ACM

125views Data Mining» more KDD 2005»

Email data cleaning

16 years 8 months ago

Download research.microsoft.com

Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...

Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang

claim paper

Read More »

199

click to vote

KDD
2005
ACM

135views Data Mining» more KDD 2005»

A hybrid unsupervised approach for document clustering

16 years 8 months ago

Download www.surdeanu.name

We propose a hybrid, unsupervised document clustering approach that combines a hierarchical clustering algorithm with Expectation Maximization. We developed several heuristics to ...

Mihai Surdeanu, Jordi Turmo, Alicia Ageno

claim paper

Read More »

201

click to vote

KDD
2005
ACM

181views Data Mining» more KDD 2005»

16 years 8 months ago

Evaluating similarity measures: a large-scale study in the orkut social network

Download research.google.com

Online information services have grown too large for users to navigate without the help of automated tools such as collaborative filtering, which makes recommendations to users ba...

Ellen Spertus, Mehran Sahami, Orkut Buyukkokten

claim paper

Read More »

223

click to vote

KDD
2005
ACM

192views Data Mining» more KDD 2005»

Modeling and predicting personal information dissemination behavior

16 years 8 months ago

Download delivery.acm.org

In this paper, we propose a new way to automatically model and predict human behavior of receiving and disseminating information by analyzing the contact and content of personal c...

Xiaodan Song, Ching-Yung Lin, Belle L. Tseng, Ming...

claim paper

Read More »

172

click to vote

KDD
2005
ACM

86views Data Mining» more KDD 2005»

Probabilistic workflow mining

16 years 8 months ago

Download www.cs.cmu.edu

In several organizations, it has become increasingly popular to document and log the steps that makeup a typical business process. In some situations, a normative workflow model o...

Ricardo Silva, Jiji Zhang, James G. Shanahan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers