Sciweavers

PVLDB
2010
125views more  PVLDB 2010»
13 years 10 months ago
Interesting-Phrase Mining for Ad-Hoc Text Analytics
Large text corpora with news, customer mail and reports, or Web 2.0 contributions offer a great potential for enhancing business-intelligence applications. We propose a framework ...
Srikanta J. Bedathur, Klaus Berberich, Jens Dittri...
PVLDB
2010
98views more  PVLDB 2010»
13 years 10 months ago
Dremel: Interactive Analysis of Web-Scale Datasets
Dremel is a scalable, interactive ad-hoc query system for analysis of read-only nested data. By combining multi-level execution trees and columnar data layout, it is capable of ru...
Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geo...
PVLDB
2010
134views more  PVLDB 2010»
13 years 10 months ago
High-Performance Dynamic Pattern Matching over Disordered Streams
Current pattern-detection proposals for streaming data recognize the need to move beyond a simple regular-expression model over strictly ordered input. We continue in this directi...
Badrish Chandramouli, Jonathan Goldstein, David Ma...
PVLDB
2010
151views more  PVLDB 2010»
13 years 10 months ago
Advanced Processing for Ontological Queries
Ontology-based data access is a powerful form of extending database technology, where a classical extensional database (EDB) is enhanced by an ontology that generates new intensio...
Andrea Calì, Georg Gottlob, Andreas Pieris
PVLDB
2010
159views more  PVLDB 2010»
13 years 10 months ago
Explore or Exploit? Effective Strategies for Disambiguating Large Databases
Data ambiguity is inherent in applications such as data integration, location-based services, and sensor monitoring. In many situations, it is possible to “clean”, or remove, ...
Reynold Cheng, Eric Lo, Xuan Yang, Ming-Hay Luk, X...
PVLDB
2010
112views more  PVLDB 2010»
13 years 10 months ago
Querying Probabilistic Information Extraction
Recently, there has been increasing interest in extending relational query processing to include data obtained from unstructured sources. A common approach is to use stand-alone I...
Daisy Zhe Wang, Michael J. Franklin, Minos N. Garo...
PVLDB
2010
145views more  PVLDB 2010»
13 years 10 months ago
Big Data and Cloud Computing: New Wine or just New Bottles?
Divyakant Agrawal, Sudipto Das, Amr El Abbadi
PVLDB
2010
129views more  PVLDB 2010»
13 years 10 months ago
Entity Resolution with Evolving Rules
Entity resolution (ER) identifies database records that refer to the same real world entity. In practice, ER is not a one-time process, but is constantly improved as the data, sc...
Steven Whang, Hector Garcia-Molina
PVLDB
2010
269views more  PVLDB 2010»
13 years 10 months ago
Shortest Path Computation on Air Indexes
Shortest path computation is one of the most common queries in location-based services that involve transportation networks. Motivated by scalability challenges faced in the mobil...
Georgios Kellaris, Kyriakos Mouratidis
PVLDB
2010
172views more  PVLDB 2010»
13 years 10 months ago
Database-support for Continuous Prediction Queries over Streaming Data
Prediction is emerging as an essential ingredient for real-time monitoring, planning and decision support applications such as intrusion detection, e-commerce pricing and automate...
Mert Akdere, Ugur Çetintemel, Eli Upfal