Sciweavers

1091 search results - page 39 / 219
» Approximation in Databases
Sort
View
KDD
2012
ACM
235views Data Mining» more  KDD 2012»
11 years 11 months ago
A near-linear time approximation algorithm for angle-based outlier detection in high-dimensional data
Outlier mining in d-dimensional point sets is a fundamental and well studied data mining task due to its variety of applications. Most such applications arise in high-dimensional ...
Ninh Pham, Rasmus Pagh
WWW
2005
ACM
14 years 9 months ago
Modeling the author bias between two on-line computer science citation databases
We examine the difference and similarities between two online computer science citation databases DBLP and CiteSeer. The database entries in DBLP are inserted manually while the C...
Vaclav Petricek, Ingemar J. Cox, Hui Han, Isaac G....
ICDE
2008
IEEE
152views Database» more  ICDE 2008»
14 years 10 months ago
Efficient Merging and Filtering Algorithms for Approximate String Searches
We study the following problem: how to efficiently find in a collection of strings those similar to a given query string? Various similarity functions can be used, such as edit dis...
Chen Li, Jiaheng Lu, Yiming Lu
VLDB
2007
ACM
179views Database» more  VLDB 2007»
14 years 8 months ago
Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data
Market analysis is a representative data analysis process with many applications. In such an analysis, critical numerical measures, such as profit and sales, fluctuate over time a...
Xiaolei Li, Jiawei Han
PODS
2006
ACM
134views Database» more  PODS 2006»
14 years 8 months ago
Approximate quantiles and the order of the stream
Recently, there has been an increased focus on modeling uncertainty by distributions. Suppose we wish to compute a function of a stream whose elements are samples drawn independen...
Sudipto Guha, Andrew McGregor