Sciweavers

ICDE
2009
IEEE

A Rule-Based Classification Algorithm for Uncertain Data

15 years 2 months ago
A Rule-Based Classification Algorithm for Uncertain Data
Abstract-- Data uncertainty is common in real-world applications due to various causes, including imprecise measurement, network latency, outdated sources and sampling errors. These kinds of uncertainty have to be handled cautiously, or else the mining results could be unreliable or even wrong. In this paper, we propose a new rule-based classification and prediction algorithm called uRule for classifying uncertain data. This algorithm introduces new measures for generating, pruning and optimizing rules. These new measures are computed considering uncertain data interval and probability distribution function. Based on the new measures, the optimal splitting attribute and splitting value can be identified and used for classification and prediction. The proposed uRule algorithm can process uncertainty in both numerical and categorical data. Our experimental results show that uRule has excellent performance even when data is highly uncertain.
Biao Qin, Yuni Xia, Sunil Prabhakar, Yi-Cheng Tu
Added 20 Oct 2009
Updated 20 Oct 2009
Type Conference
Year 2009
Where ICDE
Authors Biao Qin, Yuni Xia, Sunil Prabhakar, Yi-Cheng Tu
Comments (0)