Sciweavers

13414 search results - page 2539 / 2683
» The distributed boosting algorithm
Sort
View
107
Voted
KDD
2007
ACM
141views Data Mining» more  KDD 2007»
16 years 3 months ago
Detecting anomalous records in categorical datasets
We consider the problem of detecting anomalies in high arity categorical datasets. In most applications, anomalies are defined as data points that are 'abnormal'. Quite ...
Kaustav Das, Jeff G. Schneider
128
Voted
KDD
2007
ACM
178views Data Mining» more  KDD 2007»
16 years 3 months ago
Practical learning from one-sided feedback
In many data mining applications, online labeling feedback is only available for examples which were predicted to belong to the positive class. Such applications include spam filt...
D. Sculley
145
Voted
KDD
2007
ACM
193views Data Mining» more  KDD 2007»
16 years 3 months ago
Joint optimization of wrapper generation and template detection
Many websites have large collections of pages generated dynamically from an underlying structured source like a database. The data of a category are typically encoded into similar...
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, Di Wu
124
Voted
KDD
2007
ACM
169views Data Mining» more  KDD 2007»
16 years 3 months ago
Exploiting underrepresented query aspects for automatic query expansion
Users attempt to express their search goals through web search queries. When a search goal has multiple components or aspects, documents that represent all the aspects are likely ...
Daniel Crabtree, Peter Andreae, Xiaoying Gao
116
Voted
KDD
2006
ACM
128views Data Mining» more  KDD 2006»
16 years 3 months ago
On privacy preservation against adversarial data mining
Privacy preserving data processing has become an important topic recently because of advances in hardware technology which have lead to widespread proliferation of demographic and...
Charu C. Aggarwal, Jian Pei, Bo Zhang 0002
« Prev « First page 2539 / 2683 Last » Next »