Sciweavers

38 search results - page 7 / 8
» Multiple Query Optimization for Data Analysis Applications o...
Sort
View
CIKM
2011
Springer
12 years 8 months ago
Emerging topic detection using dictionary learning
Streaming user-generated content in the form of blogs, microblogs, forums, and multimedia sharing sites, provides a rich source of data from which invaluable information and insig...
Shiva Prasad Kasiviswanathan, Prem Melville, Arind...
CVPR
2003
IEEE
14 years 10 months ago
Generalized Principal Component Analysis (GPCA)
This paper presents an algebro-geometric solution to the problem of segmenting an unknown number of subspaces of unknown and varying dimensions from sample data points. We represen...
René Vidal, Shankar Sastry, Yi Ma
ICDE
2002
IEEE
102views Database» more  ICDE 2002»
14 years 9 months ago
Cost Models for Overlapping and Multi-Version B-Trees
Overlapping and multi-version techniques are two popular frameworks that transform an ephemeral index into a multiple logical-tree structure in order to support versioning databas...
Yufei Tao, Dimitris Papadias, Jun Zhang
WWW
2007
ACM
14 years 9 months ago
Efficient search in large textual collections with redundancy
Current web search engines focus on searching only the most recent snapshot of the web. In some cases, however, it would be desirable to search over collections that include many ...
Jiangong Zhang, Torsten Suel
SIGMOD
2011
ACM
210views Database» more  SIGMOD 2011»
12 years 11 months ago
A platform for scalable one-pass analytics using MapReduce
Today’s one-pass analytics applications tend to be data-intensive in nature and require the ability to process high volumes of data efficiently. MapReduce is a popular programm...
Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGreg...