Sciweavers

DMIN
2009
223views Data Mining» more  DMIN 2009»
13 years 5 months ago
Are Decision Trees Always Greener on the Open (Source) Side of the Fence?
- This short paper compares the performance of three popular decision tree algorithms: C4.5, C5.0, and WEKA's J48. These decision tree algorithms are all related in that C5.0 ...
Samuel Moore, Daniel D'Addario, James Kurinskas, G...
DMIN
2009
185views Data Mining» more  DMIN 2009»
13 years 5 months ago
A Sparse Coding Based Similarity Measure
In high dimensional data sets not all dimensions contain an equal amount of information and most of the time global features are more important than local differences. This makes ...
Sebastian Klenk, Gunther Heidemann
DMIN
2009
136views Data Mining» more  DMIN 2009»
13 years 5 months ago
Evaluating Algorithms for Concept Description
When performing concept description, models need to be evaluated both on accuracy and comprehensibility. A comprehensible concept description model should present the most importan...
Cecilia Sönströd, Ulf Johansson, Tuve L&...
DMIN
2009
195views Data Mining» more  DMIN 2009»
13 years 5 months ago
Improved k-NN Algorithm for Text Classification
- Over the last twenty years, text classification has become one of the key techniques for organizing electronic information such as text and web documents. The k-Nearest Neighbor ...
Muhammed Miah
DMIN
2009
144views Data Mining» more  DMIN 2009»
13 years 5 months ago
Fish or Shark - Data Mining Online Poker
In this paper, data mining techniques are used to analyze data gathered from online poker. The study focuses on short-handed Texas Hold'em, and the data sets used contain thou...
Ulf Johansson, Cecilia Sönströd
ADMA
2009
Springer
121views Data Mining» more  ADMA 2009»
13 years 5 months ago
Quantitative Comparison of Similarity Measure and Entropy for Fuzzy Sets
Comparison and data analysis to the similarity measures and entropy for fuzzy sets are studied. The distance proportional value between the fuzzy set and the corresponding crisp se...
Hongmei Wang, Sanghyuk Lee, Jaehyung Kim
WKDD
2010
CPS
238views Data Mining» more  WKDD 2010»
13 years 5 months ago
3D Scientific Data Mining in Ion Trajectories
In physics, structure of glass and ion trajectories are essentially based on statistical analysis of data acquired through experimental measurement and computer simulation [1, 2]. ...
J. M. Sharif, M. Mahadi Abdul Jamil, Md. Asri Ngad...
SISAP
2010
IEEE
108views Data Mining» more  SISAP 2010»
13 years 5 months ago
Enlarging nodes to improve dynamic spatial approximation trees
Marcelo Barroso, Nora Reyes, Rodrigo Paredes
SISAP
2010
IEEE
243views Data Mining» more  SISAP 2010»
13 years 5 months ago
Similarity matrix compression for efficient signature quadratic form distance computation
Determining similarities among multimedia objects is a fundamental task in many content-based retrieval, analysis, mining, and exploration applications. Among state-of-the-art sim...
Christian Beecks, Merih Seran Uysal, Thomas Seidl
SISAP
2010
IEEE
259views Data Mining» more  SISAP 2010»
13 years 5 months ago
kNN based image classification relying on local feature similarity
In this paper, we propose a novel image classification approach, derived from the kNN classification strategy, that is particularly suited to be used when classifying images descr...
Giuseppe Amato, Fabrizio Falchi