Sciweavers

571 search results - page 3 / 115
» Testing homogeneity of a large data set by bootstrapping
Sort
View
CSDA
2008
94views more  CSDA 2008»
13 years 7 months ago
Robust model selection using fast and robust bootstrap
Robust model selection procedures control the undue influence that outliers can have on the selection criteria by using both robust point estimators and a bounded loss function wh...
Matias Salibian-Barrera, Stefan Van Aelst
ICDM
2008
IEEE
137views Data Mining» more  ICDM 2008»
14 years 1 months ago
Iterative Set Expansion of Named Entities Using the Web
Set expansion refers to expanding a partial set of “seed” objects into a more complete set. One system that does set expansion is SEAL (Set Expander for Any Language), which e...
Richard C. Wang, William W. Cohen
KDD
2001
ACM
216views Data Mining» more  KDD 2001»
14 years 7 months ago
The distributed boosting algorithm
In this paper, we propose a general framework for distributed boosting intended for efficient integrating specialized classifiers learned over very large and distributed homogeneo...
Aleksandar Lazarevic, Zoran Obradovic
VLDB
2004
ACM
121views Database» more  VLDB 2004»
14 years 21 days ago
An Automatic Data Grabber for Large Web Sites
We demonstrate a system to automatically grab data from data intensive web sites. The system first infers a model that describes at the intensional level the web site as a collec...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
CLEF
2010
Springer
13 years 8 months ago
Bootstrapping Websites for Classification of Organization Names on Twitter
There has been a growing interest in monitoring the social media presence of companies for improved marketing. Many public APIs are available for tapping into the data, and there a...
Paul Kalmar