Sciweavers

ECCV
2010
Springer
14 years 6 months ago
Word Spotting in the Wild
We present a method for spotting words in the wild, i.e., in real images taken in unconstrained environments. Text found in the wild has a surprising range of difficulty. At one en...
VLDB
2004
ACM
126views Database» more  VLDB 2004»
14 years 6 months ago
Database Challenges in the Integration of Biomedical Data Sets
The clinical and basic science research domains present exciting and difficult data integration issues. Solving these problems is crucial as current research efforts in the field ...
Rakesh Nagarajan, Mushtaq Ahmed, Aditya Phatak
VLDB
2004
ACM
178views Database» more  VLDB 2004»
14 years 6 months ago
High-Dimensional OLAP: A Minimal Cubing Approach
Data cube has been playing an essential role in fast OLAP (online analytical processing) in many multi-dimensional data warehouses. However, there exist data sets in applications ...
Xiaolei Li, Jiawei Han, Hector Gonzalez
VLDB
2004
ACM
227views Database» more  VLDB 2004»
14 years 6 months ago
Approximate NN queries on Streams with Guaranteed Error/performance Bounds
In data stream applications, data arrive continuously and can only be scanned once as the query processor has very limited memory (relative to the size of the stream) to work with...
Nick Koudas, Beng Chin Ooi, Kian-Lee Tan, Rui Zhan...
VLDB
2004
ACM
106views Database» more  VLDB 2004»
14 years 6 months ago
Structures, Semantics and Statistics
At a fundamental level, the key challenge in data integration is to reconcile the semantics of disparate data sets, each expressed with a different database structure. I argue th...
Alon Y. Halevy
MICAI
2004
Springer
14 years 6 months ago
An Improved ICP Algorithm Based on the Sensor Projection for Automatic 3D Registration
Three-dimensional (3D) registration is the process aligning the range data sets form different views in a common coordinate system. In order to generate a complete 3D model, we nee...
Sang-Hoon Kim, Yong Ho Hwang, Hyun-Ki Hong, Min-Hy...
KELSI
2004
Springer
14 years 6 months ago
Improving Rule Induction Precision for Automated Annotation by Balancing Skewed Data Sets
There is an overwhelming increase in submissions to genomic databases, posing a problem for database maintenance, especially regarding annotation of fields left blank during submi...
Gustavo E. A. P. A. Batista, Maria Carolina Monard...
ICA
2004
Springer
14 years 6 months ago
Second-Order Blind Source Separation Based on Multi-dimensional Autocovariances
SOBI is a blind source separation algorithm based on time decorrelation. It uses multiple time autocovariance matrices, and performs joint diagonalization thus being more robust th...
Fabian J. Theis, Anke Meyer-Bäse, Elmar Wolfg...
GFKL
2004
Springer
137views Data Mining» more  GFKL 2004»
14 years 6 months ago
Density Estimation and Visualization for Data Containing Clusters of Unknown Structure
Abstract. A method for measuring the density of data sets that contain an unknown number of clusters of unknown sizes is proposed. This method, called Pareto Density Estimation (PD...
Alfred Ultsch
DILS
2004
Springer
14 years 6 months ago
Heterogeneous Data Integration with the Consensus Clustering Formalism
Meaningfully integrating massive multi-experimental genomic data sets is becoming critical for the understanding of gene function. We have recently proposed methodologies for integ...
Vladimir Filkov, Steven Skiena