In data stream processing systems, Quality of Service (or QoS) requirements, as specified by users, are extremely important. Unlike in a database management system (DBMS), a query...
We propose an efficient sampling based outlier detection method for large high-dimensional data. Our method consists of two phases. In the first phase, we combine a "sampling...
Timothy de Vries, Sanjay Chawla, Pei Sun, Gia Vinh...
Distance measure plays an important role in clustering data points. Choosing the right distance measure for a given dataset is a non-trivial problem. In this paper, we study vario...
Ankita Vimal, Satyanarayana R. Valluri, Kamalakar ...
Today, information integration has assumed a completely different, complex connotation than what it used to be. The advent of the Internet, the proliferation of information source...
In this paper, we address the problem of query formulation in the context of multi-domain integration of heterogeneous data on the Web. We argue that effectively tackling this pro...
Answering aggregate queries like sum, count, min, max over regions containing moving objects is often needed for virtual world applications, real-time monitoring systems, etc. Sin...
Huge amount of information is present in the World Wide Web and a large amount is being added to it frequently. A query-specific summary of multiple documents is very helpful to t...
Repositories like arXiv1 and knowledge bases like CiteSeer2 are increasingly becoming central to academicians and researchers. However, current systems provide too little semantic...
Orion is a state-of-the-art uncertain database management system that extends the relational model to include probabilistic uncertain data as first call data types. This demonstra...
Sarvjeet Singh, Chris Mayfield, Sagar Mittal, Suni...