Sciweavers

88 search results - page 11 / 18
» On Using Extended Statistical Queries to Avoid Membership Qu...
Sort
View
ICANN
2009
Springer
14 years 8 days ago
Statistical Instance-Based Ensemble Pruning for Multi-class Problems
Recent research has shown that the provisional count of votes of an ensemble of classifiers can be used to estimate the probability that the final ensemble prediction coincides w...
Gonzalo Martínez-Muñoz, Daniel Hern&...
ALENEX
2001
89views Algorithms» more  ALENEX 2001»
13 years 9 months ago
Estimating Resemblance of MIDI Documents
Abstract. Search engines often employ techniques for determining syntactic similarity of Web pages. Such a tool allows them to avoid returning multiple copies of essentially the sa...
Michael Mitzenmacher, Sean Owen
DEXAW
2009
IEEE
129views Database» more  DEXAW 2009»
14 years 2 months ago
RDFStats - An Extensible RDF Statistics Generator and Library
—In this paper RDFStats is introduced, which is a generator for statistics of RDF sources like SPARQL endpoints and RDF documents. RDFStats does not only provide a statistics gen...
Andreas Langegger, Wolfram Wöß
ICDE
2003
IEEE
160views Database» more  ICDE 2003»
14 years 9 months ago
SWAT: Hierarchical Stream Summarization in Large Networks
The problem of statistics and aggregate maintenance over data streams has gained popularity in recent years especially in telecommunications network monitoring, trend-related anal...
Ahmet Bulut, Ambuj K. Singh
ICDE
2010
IEEE
408views Database» more  ICDE 2010»
14 years 2 months ago
Hive - a petabyte scale data warehouse using Hadoop
— The size of data sets being collected and analyzed in the industry for business intelligence is growing rapidly, making traditional warehousing solutions prohibitively expensiv...
Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zhen...