Sciweavers

801 search results - page 65 / 161
» The Inefficiency of Batch Training for Large Training Sets
Sort
View
FLAIRS
2006
13 years 10 months ago
Corpus Based Unsupervised Labeling of Documents
Text categorization involves mapping of documents to a fixed set of labels. A similar but equally important problem is that of assigning labels to large corpora. With a deluge of ...
Delip Rao, Deepak P, Deepak Khemani
ESANN
2003
13 years 10 months ago
Semi-automatic acquisition and labelling of image data using SOMs
Abstract. Application of neural networks for real world object recognition suffers from the need to acquire large quantities of labelled image data. We propose a solution that acq...
Gunther Heidemann, Axel Saalbach, Helge Ritter
AAAI
1998
13 years 10 months ago
Boosting in the Limit: Maximizing the Margin of Learned Ensembles
The "minimum margin" of an ensemble classifier on a given training set is, roughly speaking, the smallest vote it gives to any correct training label. Recent work has sh...
Adam J. Grove, Dale Schuurmans
ECIR
2008
Springer
13 years 10 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
CLEF
2010
Springer
13 years 10 months ago
Bootstrapping Websites for Classification of Organization Names on Twitter
There has been a growing interest in monitoring the social media presence of companies for improved marketing. Many public APIs are available for tapping into the data, and there a...
Paul Kalmar