Sciweavers

APBC
2006
181views Bioinformatics» more  APBC 2006»
13 years 10 months ago
Analyzing Inconsistency Toward Enhancing Integration of Biological Molecular Databases
: The rapid growth of biological databases not only provides biologists with abundant data but also presents a big challenge in relation to the analysis of data. Many data analysis...
Yi-Ping Phoebe Chen, Qingfeng Chen
ACSW
2004
13 years 10 months ago
A Framework for Privacy Preserving Classification in Data Mining
Nowadays organizations all over the world are dependent on mining gigantic datasets. These datasets typically contain delicate individual information, which inevitably gets expose...
Zahidul Islam, Ljiljana Brankovic
ACSW
2004
13 years 10 months ago
Early Assessment of Classification Performance
The ability to distinguish between objects is the fundamental to learning and intelligent behavior in general. The difference between two things is the information we seek; the pr...
Bostjan Brumen, Izidor Golob, Hannu Jaakkola, Tatj...
SDM
2007
SIAM
106views Data Mining» more  SDM 2007»
13 years 10 months ago
Performance of Recommendation Systems in Dynamic Streaming Environments
In this paper, we study the behavior of collaborative filtering based recommendations under evolving user profile scenarios. We propose a systematic validation methodology that ...
Olfa Nasraoui, Jeff Cerwinske, Carlos Rojas, Fabio...
SDM
2007
SIAM
117views Data Mining» more  SDM 2007»
13 years 10 months ago
Summarizing Review Scores of "Unequal" Reviewers
A frequently encountered problem in decision making is the following review problem: review a large number of objects and select a small number of the best ones. An example is sel...
Hady Wirawan Lauw, Ee-Peng Lim, Ke Wang
SDM
2007
SIAM
137views Data Mining» more  SDM 2007»
13 years 10 months ago
Semi-supervised Feature Selection via Spectral Analysis
Feature selection is an important task in effective data mining. A new challenge to feature selection is the so-called “small labeled-sample problem” in which labeled data is...
Zheng Zhao, Huan Liu
SDM
2007
SIAM
108views Data Mining» more  SDM 2007»
13 years 10 months ago
Semi-Supervised Dimensionality Reduction
Dimensionality reduction is among the keys in mining highdimensional data. This paper studies semi-supervised dimensionality reduction. In this setting, besides abundant unlabeled...
Daoqiang Zhang, Zhi-Hua Zhou, Songcan Chen
SDM
2007
SIAM
107views Data Mining» more  SDM 2007»
13 years 10 months ago
On Demand Phenotype Ranking through Subspace Clustering
High throughput biotechnologies have enabled scientists to collect a large number of genetic and phenotypic attributes for a large collection of samples. Computational methods are...
Xiang Zhang, Wei Wang 0010, Jun Huan
SDM
2007
SIAM
74views Data Mining» more  SDM 2007»
13 years 10 months ago
HACS: Heuristic Algorithm for Clustering Subsets
The term consideration set is used in marketing to refer to the set of items a customer thought about purchasing before making a choice. While consideration sets are not directly ...
Ding Yuan, W. Nick Street
SDM
2007
SIAM
201views Data Mining» more  SDM 2007»
13 years 10 months ago
Fast Best-Match Shape Searching in Rotation Invariant Metric Spaces
Object recognition and content-based image retrieval systems rely heavily on the accurate and efficient identification of shapes. A fundamental requirement in the shape analysis ...
Dragomir Yankov, Eamonn J. Keogh, Li Wei, Xiaopeng...