Sciweavers

SDM
2007
SIAM
171views Data Mining» more  SDM 2007»
13 years 10 months ago
A Better Alternative to Piecewise Linear Time Series Segmentation
Time series are difficult to monitor, summarize and predict. Segmentation organizes time series into few intervals having uniform characteristics (flatness, linearity, modality,...
Daniel Lemire
SDM
2007
SIAM
138views Data Mining» more  SDM 2007»
13 years 10 months ago
Fast Newton-type Methods for the Least Squares Nonnegative Matrix Approximation Problem
Dongmin Kim, Suvrit Sra, Inderjit S. Dhillon
SDM
2007
SIAM
182views Data Mining» more  SDM 2007»
13 years 10 months ago
Distance Preserving Dimension Reduction for Manifold Learning
Manifold learning is an effective methodology for extracting nonlinear structures from high-dimensional data with many applications in image analysis, computer vision, text data a...
Hyunsoo Kim, Haesun Park, Hongyuan Zha
SDM
2007
SIAM
190views Data Mining» more  SDM 2007»
13 years 10 months ago
AC-Framework for Privacy-Preserving Collaboration
The secure multi-party computation (SMC) model provides means for balancing the use and confidentiality of distributed data. Increasing security concerns have led to a surge in w...
Wei Jiang, Chris Clifton
SDM
2007
SIAM
106views Data Mining» more  SDM 2007»
13 years 10 months ago
Approximating Representations for Large Numerical Databases
The paper introduces a notion of support for realvalued functions. It is shown how to approximate supports of a large class of functions based on supports of so called polynomial ...
Szymon Jaroszewicz, Marcin Korzen
SDM
2007
SIAM
133views Data Mining» more  SDM 2007»
13 years 10 months ago
Change-Point Detection using Krylov Subspace Learning
We propose an efficient algorithm for principal component analysis (PCA) that is applicable when only the inner product with a given vector is needed. We show that Krylov subspace...
Tsuyoshi Idé, Koji Tsuda
SDM
2007
SIAM
98views Data Mining» more  SDM 2007»
13 years 10 months ago
Lattice based Clustering of Temporal Gene-Expression Matrices
Individuals show different cell classes when they are in the different stages of a disease, have different disease subtypes, or have different response to a treatment or envir...
Yang Huang, Martin Farach-Colton
SDM
2007
SIAM
103views Data Mining» more  SDM 2007»
13 years 10 months ago
A System for Keyword Search on Textual Streams
An increasing amount of data is produced in the form of text streams − these can be RSS news feeds, TV closed captions, emails, etc. We study the problem of answering keyword qu...
Vagelis Hristidis, Oscar Valdivia, Michail Vlachos...
SDM
2007
SIAM
204views Data Mining» more  SDM 2007»
13 years 10 months ago
Flexible Anonymization For Privacy Preserving Data Publishing: A Systematic Search Based Approach
k-anonymity is a popular measure of privacy for data publishing: It measures the risk of identity-disclosure of individuals whose personal information are released in the form of ...
Bijit Hore, Ravi Chandra Jammalamadaka, Sharad Meh...