Sciweavers

743 search results - page 94 / 149
» Performance Measurements for Privacy Preserving Data Mining
Sort
View
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 11 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
KDD
2009
ACM
248views Data Mining» more  KDD 2009»
15 years 9 months ago
PSkip: estimating relevance ranking quality from web search clickthrough data
1 In this article, we report our efforts in mining the information encoded as clickthrough data in the server logs to evaluate and monitor the relevance ranking quality of a commer...
Kuansan Wang, Toby Walker, Zijian Zheng
OOPSLA
2007
Springer
15 years 10 months ago
Statistically rigorous java performance evaluation
Java performance is far from being trivial to benchmark because it is affected by various factors such as the Java application, its input, the virtual machine, the garbage collect...
Andy Georges, Dries Buytaert, Lieven Eeckhout
KDD
2001
ACM
142views Data Mining» more  KDD 2001»
16 years 5 months ago
TreeDT: gene mapping by tree disequilibrium test
We introduce and evaluate TreeDT, a novel gene mapping method which is based on discovering and assessing tree-like patterns in genetic marker data. Gene mapping aims at discoveri...
Petteri Sevon, Hannu Toivonen, Vesa Ollikainen
KDD
2005
ACM
130views Data Mining» more  KDD 2005»
16 years 5 months ago
Regression error characteristic surfaces
This paper presents a generalization of Regression Error Characteristic (REC) curves. REC curves describe the cumulative distribution function of the prediction error of models an...
Luís Torgo