Sciweavers

17688 search results - page 99 / 3538
» Data Set Balancing
Sort
View
ICDE
2003
IEEE
146views Database» more  ICDE 2003»
16 years 5 months ago
Similarity Search in Sets and Categorical Data Using the Signature Tree
Data mining applications analyze large collections of set data and high dimensional categorical data. Search on these data types is not restricted to the classic problems of minin...
Nikos Mamoulis, David W. Cheung, Wang Lian
VL
1996
IEEE
157views Visual Languages» more  VL 1996»
15 years 8 months ago
Visualizing Program Executions on Large Data Sets
Understanding and interpreting a large data source is an important but challenging operation in many technical disciplines. Computer visualization has become a valuable tool to he...
John T. Stasko, Jeyakumar Muthukumarasamy
ICDT
2001
ACM
124views Database» more  ICDT 2001»
15 years 8 months ago
Mining for Empty Rectangles in Large Data Sets
Abstract. Many data mining approaches focus on the discovery of similar (and frequent) data values in large data sets. We present an alternative, but complementary approach in whic...
Jeff Edmonds, Jarek Gryz, Dongming Liang, Ren&eacu...
144
Voted
PVLDB
2010
126views more  PVLDB 2010»
15 years 2 months ago
Set Similarity Join on Probabilistic Data
Set similarity join has played an important role in many real-world applications such as data cleaning, near duplication detection, data integration, and so on. In these applicati...
Xiang Lian, Lei Chen 0002
166
Voted
JMLR
2006
135views more  JMLR 2006»
15 years 4 months ago
Statistical Comparisons of Classifiers over Multiple Data Sets
While methods for comparing two learning algorithms on a single data set have been scrutinized for quite some time already, the issue of statistical tests for comparisons of more ...
Janez Demsar