Sciweavers

IEAAIE
2009
Springer
13 years 10 months ago
Hiding Predictive Association Rules on Horizontally Distributed Data
Abstract. In this work, we propose two approaches of hiding predictive association rules where the data sets are horizontally distributed and owned by collaborative but non-trustin...
Shyue-Liang Wang, Ting-Zheng Lai, Tzung-Pei Hong, ...
SEMWEB
2010
Springer
13 years 10 months ago
Compact Representation of Large RDF Data Sets for Publishing and Exchange
Abstract. Increasingly huge RDF data sets are being published on the Web. Currently, they use different syntaxes of RDF, contain high levels of redundancy and have a plain indivisi...
Javier D. Fernández, Miguel A. Martí...
NAACL
2010
13 years 10 months ago
The Effect of Ambiguity on the Automated Acquisition of WSD Examples
Several methods for automatically generating labeled examples that can be used as training data for WSD systems have been proposed, including a semisupervised approach based on re...
Mark Stevenson, Yikun Guo
ISAMI
2010
13 years 10 months ago
GRASP for Instance Selection in Medical Data Sets
Abstract Medical data sets consist of a huge amount of data organized in instances, where each one contains several attributes. The quality of the models obtained from a database s...
Alfonso Fernández, Abraham Duarte, Rosa Her...
IEAAIE
2010
Springer
13 years 10 months ago
Exploring the Performance of Resampling Strategies for the Class Imbalance Problem
The present paper studies the influence of two distinct factors on the performance of some resampling strategies for handling imbalanced data sets. In particular, we focus on the n...
Vicente García, José Salvador S&aacu...
ICDM
2010
IEEE
168views Data Mining» more  ICDM 2010»
13 years 10 months ago
Anomaly Detection Using an Ensemble of Feature Models
We present a new approach to semi-supervised anomaly detection. Given a set of training examples believed to come from the same distribution or class, the task is to learn a model ...
Keith Noto, Carla E. Brodley, Donna K. Slonim
ICDM
2010
IEEE
134views Data Mining» more  ICDM 2010»
13 years 10 months ago
Consequences of Variability in Classifier Performance Estimates
The prevailing approach to evaluating classifiers in the machine learning community involves comparing the performance of several algorithms over a series of usually unrelated data...
Troy Raeder, T. Ryan Hoens, Nitesh V. Chawla
ACL
2010
13 years 10 months ago
Cognitively Plausible Models of Human Language Processing
We pose the development of cognitively plausible models of human language processing as a challenge for computational linguistics. Existing models can only deal with isolated phen...
Frank Keller
KAIS
2010
144views more  KAIS 2010»
13 years 11 months ago
Boosting support vector machines for imbalanced data sets
Real world data mining applications must address the issue of learning from imbalanced data sets. The problem occurs when the number of instances in one class greatly outnumbers t...
Benjamin X. Wang, Nathalie Japkowicz
INCDM
2010
Springer
172views Data Mining» more  INCDM 2010»
13 years 11 months ago
Evaluating the Quality of Clustering Algorithms Using Cluster Path Lengths
Many real world systems can be modeled as networks or graphs. Clustering algorithms that help us to organize and understand these networks are usually referred to as, graph based c...
Faraz Zaidi, Daniel Archambault, Guy Melanç...