data sets | Sciweavers

184

Voted

SDM
2008
SIAM

177views Data Mining» more SDM 2008»

Roughly Balanced Bagging for Imbalanced Data

15 years 8 months ago

Imbalanced class problems appear in many real applications of classification learning. We propose a novel sampling method to improve bagging for data sets with skewed class distri...

Shohei Hido, Hisashi Kashima

claim paper

Read More »

188

click to vote

SDM
2008
SIAM

168views Data Mining» more SDM 2008»

Semi-Supervised Clustering via Matrix Factorization

15 years 8 months ago

Download users.cis.fiu.edu

The recent years have witnessed a surge of interests of semi-supervised clustering methods, which aim to cluster the data set under the guidance of some supervisory information. U...

Fei Wang, Tao Li, Changshui Zhang

claim paper

Read More »

203

click to vote

ICMLA
2008

106views Machine Learning» more ICMLA 2008»

A Weighted Distance Measure for Calculating the Similarity of Sparsely Distributed Trajectories

15 years 8 months ago

Download www.ee.oulu.fi

This article presents a method for the calculating similarity of two trajectories. The method is especially designed for a situation where the points of the trajectories are distr...

Pekka Siirtola, Perttu Laurinen, Juha Röning

claim paper

Read More »

162

click to vote

HIS
2008

130views Information Technology» more HIS 2008»

Artificial Data Sets Based on Knowledge Generators: Analysis of Learning Algorithms Efficiency

15 years 8 months ago

Download www.salle.url.edu

This paper proposes a methodology to generate artificial data sets to evaluate the behavior of machine learning techniques. The methodology relies in the definition of a domain an...

Joaquin Rios-Boutin, Albert Orriols-Puig, Josep Ma...

claim paper

Read More »

187

click to vote

GRAPHICSINTERFACE
2007

127views Computer Graphics» more GRAPHICSINTERFACE 2007»

Visualization and exploration of time-varying medical image data sets

15 years 8 months ago

Download www.cs.sfu.ca

In this work, we propose and compare several methods for the visualization and exploration of time-varying volumetric medical images based on the temporal characteristics of the d...

Zhe Fang, Torsten Möller, Ghassan Hamarneh, A...

claim paper

Read More »

187

click to vote

ESANN
2008

131views Neural Networks» more ESANN 2008»

Parallelizing single patch pass clustering

15 years 8 months ago

Download www.dice.ucl.ac.be

Clustering algorithms such as k-means, the self-organizing map (SOM), or Neural Gas (NG) constitute popular tools for automated information analysis. Since data sets are becoming l...

Nikolai Alex, Barbara Hammer

claim paper

Read More »

186

Voted

SDM
2010
SIAM

204views Data Mining» more SDM 2010»

Scalable Tensor Factorizations with Missing Data

15 years 8 months ago

Download csmr.ca.sandia.gov

The problem of missing data is ubiquitous in domains such as biomedical signal processing, network traffic analysis, bibliometrics, social network analysis, chemometrics, computer...

Evrim Acar, Daniel M. Dunlavy, Tamara G. Kolda, Mo...

claim paper

Read More »

192

Voted

EMNLP
2008

109views Natural Language Processing» more EMNLP 2008»

A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers

15 years 8 months ago

Download aclweb.org

There is growing interest in applying Bayesian techniques to NLP problems. There are a number of different estimators for Bayesian models, and it is useful to know what kinds of t...

Jianfeng Gao, Mark Johnson

claim paper

Read More »

163

click to vote

DMIN
2008

176views Data Mining» more DMIN 2008»

Multi-Class SVM for Large Data Sets Considering Models of Classes Distribution

15 years 8 months ago

Download www.ctrl.cinvestav.mx

Support Vector Machines (SVM) have gained profound interest amidst the researchers. One of the important issues concerning SVM is with its application to large data sets. It is rec...

Jair Cervantes, Xiaoou Li, Wen Yu

claim paper

Read More »

284

Voted

EMNLP
2007

198views Natural Language Processing» more EMNLP 2007»

The CoNLL 2007 Shared Task on Dependency Parsing

15 years 8 months ago

Download acl.ldc.upenn.edu

The Conference on Computational Natural Language Learning features a shared task, in which participants train and test their learning systems on the same data sets. In 2007, as in...

Joakim Nivre, Johan Hall, Sandra Kübler, Ryan...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers