Sciweavers

161 search results - page 22 / 33
» Workshop on massive datasets
Sort
View
KDD
2004
ACM
170views Data Mining» more  KDD 2004»
14 years 22 days ago
Estimating the size of the telephone universe: a Bayesian Mark-recapture approach
Mark-recapture models have for many years been used to estimate the unknown sizes of animal and bird populations. In this article we adapt a finite mixture mark-recapture model i...
David Poole
SIGMOD
2010
ACM
321views Database» more  SIGMOD 2010»
14 years 5 days ago
HadoopDB in action: building real world applications
HadoopDB is a hybrid of MapReduce and DBMS technologies, designed to meet the growing demand of analyzing massive datasets on very large clusters of machines. Our previous work ha...
Azza Abouzied, Kamil Bajda-Pawlikowski, Jiewen Hua...
MM
2009
ACM
233views Multimedia» more  MM 2009»
14 years 1 days ago
Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression
Photo community sites such as Flickr and Picasa Web Album host a massive amount of personal photos with millions of new photos uploaded every month. These photos constitute an ove...
Liangliang Cao, Jie Yu, Jiebo Luo, Thomas S. Huang
ICDE
2010
IEEE
290views Database» more  ICDE 2010»
13 years 11 months ago
The Model-Summary Problem and a Solution for Trees
Modern science is collecting massive amounts of data from sensors, instruments, and through computer simulation. It is widely believed that analysis of this data will hold the key ...
Biswanath Panda, Mirek Riedewald, Daniel Fink
SDM
2010
SIAM
115views Data Mining» more  SDM 2010»
13 years 8 months ago
Radius Plots for Mining Tera-byte Scale Graphs: Algorithms, Patterns, and Observations
Given large, multi-million node graphs (e.g., FaceBook, web-crawls, etc.), how do they evolve over time? How are they connected? What are the central nodes and the outliers of the...
U. Kang, Charalampos E. Tsourakakis, Ana Paula App...