Automated Data Discovery in Similarity Score Queries

14 years 6 months ago

Download www.cse.ohio-state.edu

A vast amount of information is being stored in scientiﬁc databases on the web. The dynamic nature of the scientiﬁc data, the cost of providing an up-to-date snapshot of the whole database, and proprietary considerations compel the database owners to hide the original data behind search interfaces. The information is often provided to researchers through similarity-search query interfaces, which limits a proper and focused analysis of the data. In this study, we present systematic methods of data discovery through similarity-score queries in such “uncooperative” databases. The methods are generalized to multidimensional data, and to L-p norm distance functions. The accuracy and performance of our methods are demonstrated on synthetic and real-life datasets. The methods developed in this study enable the scientists to obtain the data within the range of their research interests, overcoming the limitations of the similarity-search interface. The results of this study also present...

Fatih Altiparmak, Ali Saman Tosun, Hakan Ferhatosm

Real-time Traffic

DASFAA 2008 | Database | Original Data | Scientiﬁc Data | Similarity-search Query Interfaces |

claim paper

Post Info
More Details (n/a)

Added	29 May 2010
Updated	29 May 2010
Type	Conference
Year	2008
Where	DASFAA
Authors	Fatih Altiparmak, Ali Saman Tosun, Hakan Ferhatosmanoglu, Ahmet Sacan

Comments (0)

Sciweavers

Automated Data Discovery in Similarity Score Queries

DASFAA 2008 | Database | Original Data | Scientiﬁc Data | Similarity-search Query Interfaces |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers