data sets | Sciweavers

160

IEAAIE
2009
Springer

105views Artificial Intelligence» more IEAAIE 2009»

Hiding Predictive Association Rules on Horizontally Distributed Data

15 years 4 months ago

Abstract. In this work, we propose two approaches of hiding predictive association rules where the data sets are horizontally distributed and owned by collaborative but non-trustin...

Shyue-Liang Wang, Ting-Zheng Lai, Tzung-Pei Hong, ...

claim paper

Read More »

191

Voted

SEMWEB
2010
Springer

164views Internet Technology» more SEMWEB 2010»

Compact Representation of Large RDF Data Sets for Publishing and Exchange

15 years 4 months ago

Download iswc2010.semanticweb.org

Abstract. Increasingly huge RDF data sets are being published on the Web. Currently, they use different syntaxes of RDF, contain high levels of redundancy and have a plain indivisi...

Javier D. Fernández, Miguel A. Martí...

claim paper

Read More »

165

click to vote

NAACL
2010

149views Computational Linguistics» more NAACL 2010»

The Effect of Ambiguity on the Automated Acquisition of WSD Examples

15 years 4 months ago

Download www.aclweb.org

Several methods for automatically generating labeled examples that can be used as training data for WSD systems have been proposed, including a semisupervised approach based on re...

Mark Stevenson, Yikun Guo

claim paper

Read More »

176

click to vote

ISAMI
2010

104views Emerging Technology» more ISAMI 2010»

GRASP for Instance Selection in Medical Data Sets

15 years 4 months ago

Download www.escet.urjc.es

Abstract Medical data sets consist of a huge amount of data organized in instances, where each one contains several attributes. The quality of the models obtained from a database s...

Alfonso Fernández, Abraham Duarte, Rosa Her...

claim paper

Read More »

153

click to vote

IEAAIE
2010
Springer

125views Artificial Intelligence» more IEAAIE 2010»

Exploring the Performance of Resampling Strategies for the Class Imbalance Problem

15 years 4 months ago

Download marmota.dlsi.uji.es

The present paper studies the influence of two distinct factors on the performance of some resampling strategies for handling imbalanced data sets. In particular, we focus on the n...

Vicente García, José Salvador S&aacu...

claim paper

Read More »

212

Voted

ICDM
2010
IEEE

168views Data Mining» more ICDM 2010»

Anomaly Detection Using an Ensemble of Feature Models

15 years 4 months ago

Download bcb.cs.tufts.edu

We present a new approach to semi-supervised anomaly detection. Given a set of training examples believed to come from the same distribution or class, the task is to learn a model ...

Keith Noto, Carla E. Brodley, Donna K. Slonim

claim paper

Read More »

180

click to vote

ICDM
2010
IEEE

134views Data Mining» more ICDM 2010»

Consequences of Variability in Classifier Performance Estimates

15 years 4 months ago

Download www.nd.edu

The prevailing approach to evaluating classifiers in the machine learning community involves comparing the performance of several algorithms over a series of usually unrelated data...

Troy Raeder, T. Ryan Hoens, Nitesh V. Chawla

claim paper

Read More »

148

Voted

ACL
2010

144views Computational Linguistics» more ACL 2010»

Cognitively Plausible Models of Human Language Processing

15 years 4 months ago

Download www.aclweb.org

We pose the development of cognitively plausible models of human language processing as a challenge for computational linguistics. Existing models can only deal with isolated phen...

Frank Keller

claim paper

Read More »

204

click to vote

KAIS
2010

144views more KAIS 2010»

Boosting support vector machines for imbalanced data sets

15 years 5 months ago

Download www.site.uottawa.ca

Real world data mining applications must address the issue of learning from imbalanced data sets. The problem occurs when the number of instances in one class greatly outnumbers t...

Benjamin X. Wang, Nathalie Japkowicz

claim paper

Read More »

195

Voted

INCDM
2010
Springer

172views Data Mining» more INCDM 2010»

Evaluating the Quality of Clustering Algorithms Using Cluster Path Lengths

15 years 5 months ago

Download hal.archives-ouvertes.fr

Many real world systems can be modeled as networks or graphs. Clustering algorithms that help us to organize and understand these networks are usually referred to as, graph based c...

Faraz Zaidi, Daniel Archambault, Guy Melanç...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers