Sciweavers

1006 search results - page 7 / 202
» A Case Study for Learning from Imbalanced Data Sets
Sort
View
CLUSTER
2003
IEEE
14 years 29 days ago
A Case Study of Parallel I/O for Biological Sequence Search on Linux Clusters
In this paper we analyze the I/O access patterns of a widely-used biological sequence search tool and implement two variations that employ parallel-I/O for data access based on PV...
Yifeng Zhu, Hong Jiang, Xiao Qin, David R. Swanson
HIS
2008
13 years 9 months ago
Artificial Data Sets Based on Knowledge Generators: Analysis of Learning Algorithms Efficiency
This paper proposes a methodology to generate artificial data sets to evaluate the behavior of machine learning techniques. The methodology relies in the definition of a domain an...
Joaquin Rios-Boutin, Albert Orriols-Puig, Josep Ma...
CNSM
2010
13 years 5 months ago
An investigation on the identification of VoIP traffic: Case study on Gtalk and Skype
The classification of encrypted traffic on the fly from network traces represents a particularly challenging application domain. Recent advances in machine learning provide the opp...
Riyad Alshammari, A. Nur Zincir-Heywood
EDM
2008
129views Data Mining» more  EDM 2008»
13 years 9 months ago
Mining the Student Assessment Data: Lessons Drawn from a Small Scale Case Study
In this paper we describe an educational data mining (EDM) case study based on the data collected during the online assessment of students who were able to immediately receive tail...
Mykola Pechenizkiy, Toon Calders, Ekaterina Vasily...
CORR
2008
Springer
116views Education» more  CORR 2008»
13 years 7 months ago
Learning to rank with combinatorial Hodge theory
Abstract. We propose a number of techniques for learning a global ranking from data that may be incomplete and imbalanced -- characteristics that are almost universal to modern dat...
Xiaoye Jiang, Lek-Heng Lim, Yuan Yao, Yinyu Ye