real data sets | Sciweavers

189

WG
2010
Springer

156views Theoretical Computer Science» more WG 2010»

Generalized Graph Clustering: Recognizing (p, q)-Cluster Graphs

15 years 5 months ago

Cluster Editing is a classical graph theoretic approach to tackle the problem of data set clustering: it consists of modifying a similarity graph into a disjoint union of cliques,...

Pinar Heggernes, Daniel Lokshtanov, Jesper Nederlo...

claim paper

Read More »

154

click to vote

PVLDB
2010

151views more PVLDB 2010»

Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux

15 years 5 months ago

Download www.comp.nus.edu.sg

We present Data Auditor, a tool for exploring data quality and data semantics. Given a rule or an integrity constraint and a target relation, Data Auditor computes pattern tableau...

Lukasz Golab, Howard J. Karloff, Flip Korn, Divesh...

claim paper

Read More »

165

click to vote

GEOINFORMATICA
2002

77views more GEOINFORMATICA 2002»

On the Generation of Time-Evolving Regional Data

15 years 6 months ago

Download delab.csd.auth.gr

Benchmarking of spatio-temporal databases is an issue of growing importance. In case large real data sets are not available, benchmarking requires the generation of arti

Theodoros Tzouramanis, Michael Vassilakopoulos, Ya...

claim paper

Read More »

202

click to vote

JIIS
2006

76views more JIIS 2006»

Holes in joins

15 years 6 months ago

Download www.cse.yorku.ca

A join of two relations in real databases is usually much smaller than their cartesian product. This means that most of the combinations of tuples in the crossproduct of the respe...

Jarek Gryz, Dongming Liang

claim paper

Read More »

178

click to vote

CIKM
2000
Springer

110views Information Technology» more CIKM 2000»

Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets

15 years 11 months ago

Download www.ee.ucr.edu

With the proliferation of multimedia data, there is increasing need to support the indexing and searching of high dimensional data. Recently, a vector approximation based techniqu...

Hakan Ferhatosmanoglu, Ertem Tuncel, Divyakant Agr...

claim paper

Read More »

180

click to vote

ICDM
2003
IEEE

184views Data Mining» more ICDM 2003»

Analyzing High-Dimensional Data by Subspace Validity

15 years 12 months ago

Download infovis.uni-konstanz.de

We are proposing a novel method that makes it possible to analyze high dimensional data with arbitrary shaped projected clusters and high noise levels. At the core of our method l...

Amihood Amir, Reuven Kashi, Nathan S. Netanyahu, D...

claim paper

Read More »

285

click to vote

SIGMOD
2009
ACM

175views Database» more SIGMOD 2009»

Ranking distributed probabilistic data

16 years 6 months ago

Download www.cs.fsu.edu

Ranking queries are essential tools to process large amounts of probabilistic data that encode exponentially many possible deterministic instances. In many applications where unce...

Feifei Li, Ke Yi, Jeffrey Jestes

claim paper

Read More »

278

click to vote

VLDB
2009
ACM

159views Database» more VLDB 2009»

Anytime measures for top-k algorithms on exact and fuzzy data sets

16 years 6 months ago

Download ranger.uta.edu

Top-k queries on large multi-attribute data sets are fundamental operations in information retrieval and ranking applications. In this article, we initiate research on the anytime ...

Benjamin Arai, Gautam Das, Dimitrios Gunopulos, Ni...

claim paper

Read More »

209

click to vote

KDD
2003
ACM

180views Data Mining» more KDD 2003»

Classifying large data sets using SVMs with hierarchical clusters

16 years 7 months ago

Download vorlon.case.edu

Support vector machines (SVMs) have been promising methods for classification and regression analysis because of their solid mathematical foundations which convey several salient ...

Hwanjo Yu, Jiong Yang, Jiawei Han

claim paper

Read More »

182

click to vote

KDD
2003
ACM

99views Data Mining» more KDD 2003»

Fragments of order

16 years 7 months ago

Download www.cs.helsinki.fi

High-dimensional collections of 0-1 data occur in many applications. The attributes in such data sets are typically considered to be unordered. However, in many cases there is a n...

Aristides Gionis, Teija Kujala, Heikki Mannila

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers