Sciweavers

2497 search results - page 387 / 500
» A Partial-Repeatability Approach to Data Mining
Sort
View
127
Voted
WWW
2010
ACM
15 years 10 months ago
Large-scale bot detection for search engines
In this paper, we propose a semi-supervised learning approach for classifying program (bot) generated web search traffic from that of genuine human users. The work is motivated by...
Hongwen Kang, Kuansan Wang, David Soukal, Fritz Be...
159
Voted
SAC
2006
ACM
15 years 9 months ago
The impact of sample reduction on PCA-based feature extraction for supervised learning
“The curse of dimensionality” is pertinent to many learning algorithms, and it denotes the drastic raise of computational complexity and classification error in high dimension...
Mykola Pechenizkiy, Seppo Puuronen, Alexey Tsymbal
126
Voted
CINQ
2004
Springer
133views Database» more  CINQ 2004»
15 years 7 months ago
Inductive Querying for Discovering Subgroups and Clusters
We introduce the problem of cluster-grouping and show that it integrates several important data mining tasks, i.e. subgroup discovery, mining correlated patterns and aspects from c...
Albrecht Zimmermann, Luc De Raedt
108
Voted
AUSDM
2006
Springer
139views Data Mining» more  AUSDM 2006»
15 years 7 months ago
Integrated Scoring For Spelling Error Correction, Abbreviation Expansion and Case Restoration in Dirty Text
An increasing number of language and speech applications are gearing towards the use of texts from online sources as input. Despite such rise, not much work can be found in the as...
Wilson Wong, Wei Liu, Mohammed Bennamoun
132
Voted
SDM
2008
SIAM
121views Data Mining» more  SDM 2008»
15 years 5 months ago
Integration of Multiple Networks for Robust Label Propagation
Transductive inference on graphs such as label propagation algorithms is receiving a lot of attention. In this paper, we address a label propagation problem on multiple networks a...
Tsuyoshi Kato, Hisashi Kashima, Masashi Sugiyama