Sciweavers

1457 search results - page 79 / 292
» On the Uniform Distribution of Strings
Sort
View
JACM
2012
13 years 8 months ago
Continuous sampling from distributed streams
A fundamental problem in data management is to draw and maintain a sample of a large data set, for approximate query answering, selectivity estimation, and query planning. With la...
Graham Cormode, S. Muthukrishnan, Ke Yi, Qin Zhang
KDD
2006
ACM
112views Data Mining» more  KDD 2006»
16 years 6 months ago
K-means clustering versus validation measures: a data distribution perspective
K-means is a widely used partitional clustering method. While there are considerable research efforts to characterize the key features of K-means clustering, further investigation...
Hui Xiong, Junjie Wu, Jian Chen
WWW
2009
ACM
16 years 25 days ago
Threshold selection for web-page classification with highly skewed class distribution
We propose a novel cost-efficient approach to threshold selection for binary web-page classification problems with imbalanced class distributions. In many binary-classification ta...
Xiaofeng He, Lei Duan, Yiping Zhou, Byron Dom
ICN
2009
Springer
16 years 18 days ago
Distributed Information Object Resolution
The established host-centric networking paradigm is challenged due to handicaps related with disconnected operation, mobility, and broken locator/identifier semantics. This paper...
Kostas Pentikousis
ICPP
2008
IEEE
16 years 14 days ago
Taming Single-Thread Program Performance on Many Distributed On-Chip L2 Caches
This paper presents a two-part study on managing distributed NUCA (Non-Uniform Cache Architecture) L2 caches in a future manycore processor to obtain high singlethread program per...
Lei Jin, Sangyeun Cho