Search Sciweavers | Sciweavers

542 search results - page 32 / 109

» Learning author-topic models from text corpora

146

click to vote

ML
2000
ACM

124views Machine Learning» more ML 2000»

Text Classification from Labeled and Unlabeled Documents using EM

15 years 3 months ago

Download www.kamalnigam.com

This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. ...

Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...

claim paper

Read More »

120

click to vote

COLING
2010

126views Computational Linguistics» more COLING 2010»

Broad Coverage Multilingual Deep Sentence Generation with a Stochastic Multi-Level Realizer

14 years 10 months ago

Download www.aclweb.org

Most of the known stochastic sentence generators use syntactically annotated corpora, performing the projection to the surface in one stage. However, in full-fledged text generati...

Bernd Bohnet, Leo Wanner, Simon Mille, Alicia Burg...

claim paper

Read More »

120

Voted

INTERSPEECH
2010

105views Signal Processing» more INTERSPEECH 2010»

Learning a language model from continuous speech

14 years 10 months ago

Download www.phontron.com

This paper presents a new approach to language model construction, learning a language model not from text, but directly from continuous speech. A phoneme lattice is created using...

Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsu...

claim paper

Read More »

145

Voted

CVPR
2011
IEEE

342views Computer Vision» more CVPR 2011»

14 years 11 months ago

Enforcing Similarity Constraints with Integer Programming for Better Scene Text Recognition

Download vis-www.cs.umass.edu

The recognition of text in everyday scenes is made difﬁcult by viewing conditions, unusual fonts, and lack of linguistic context. Most methods integrate a priori appearance info...

David Smith, Jacqueline Feild, Eric Learned-Miller

claim paper

Read More »

120

click to vote

LREC
2008

108views Education» more LREC 2008»

A Lightweight and Efficient Tool for Cleaning Web Pages

15 years 4 months ago

Download www.lrec-conf.org

Originally conceived as a "naive" baseline experiment using traditional n-gram language models as classifiers, the NCLEANER system has turned out to be a fast and lightw...

Stefan Evert

claim paper

Read More »

« Prev « First page 32 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers