Sciweavers

971 search results - page 102 / 195
» Mining Several Data Bases with an Ensemble of Classifiers
Sort
View
SIGKDD
2010
128views more  SIGKDD 2010»
13 years 3 months ago
On cross-validation and stacking: building seemingly predictive models on random data
A number of times when using cross-validation (CV) while trying to do classification/probability estimation we have observed surprisingly low AUC's on real data with very few...
Claudia Perlich, Grzegorz Swirszcz
COMAD
2008
13 years 10 months ago
CUM: An Efficient Framework for Mining Concept Units
Web is the most important repository of different kinds of media such as text, sound, video, images etc. Web mining is the process of applying data mining techniques to automatica...
Santhi Thilagam
GRC
2008
IEEE
13 years 10 months ago
Fuzzy Entropy based Max-Relevancy and Min-Redundancy Feature Selection
Feature selection is an important problem for pattern classification systems. Mutual information is a good indicator of relevance between variables, and has been used as a measure...
Shuang An, Qinghua Hu, Daren Yu
WWW
2003
ACM
14 years 9 months ago
Mining newsgroups using networks arising from social behavior
Recent advances in information retrieval over hyperlinked corpora have convincinglydemonstratedthat links carry less noisy information than text. We investigate the feasibility of...
Rakesh Agrawal, Sridhar Rajagopalan, Ramakrishnan ...
SAC
2004
ACM
14 years 2 months ago
A new algorithm for gap constrained sequence mining
The sequence mining problem consists in finding frequent sequential patterns in a database of time-stamped events. Several application domains require limiting the maximum tempor...
Salvatore Orlando, Raffaele Perego, Claudio Silves...