Search Sciweavers | Sciweavers

182 search results - page 29 / 37

» Probabilistic Document Length Priors for Language Models

133

click to vote

SIGIR
2010
ACM

172views Information Technology» more SIGIR 2010»

Positional relevance model for pseudo-relevance feedback

15 years 7 months ago

Download sifaka.cs.uiuc.edu

Pseudo-relevance feedback is an eﬀective technique for improving retrieval results. Traditional feedback algorithms use a whole feedback document as a unit to extract words for ...

Yuanhua Lv, ChengXiang Zhai

claim paper

Read More »

149

click to vote

ICDAR
2007
IEEE

193views Document Analysis» more ICDAR 2007»

Fast Lexicon-Based Scene Text Recognition with Sparse Belief Propagation

15 years 10 months ago

Download www.cs.umass.edu

Using a lexicon can often improve character recognition under challenging conditions, such as poor image quality or unusual fonts. We propose a ﬂexible probabilistic model for c...

Jerod J. Weinman, Erik G. Learned-Miller, Allen R....

claim paper

Read More »

169

click to vote

WWW
2008
ACM

201views Internet Technology» more WWW 2008»

Learning deterministic regular expressions for the inference of schemas from XML data

16 years 4 months ago

Download www2008.org

Inferring an appropriate DTD or XML Schema Definition (XSD) for a given collection of XML documents essentially reduces to learning deterministic regular expressions from sets of ...

Geert Jan Bex, Wouter Gelade, Frank Neven, Stijn V...

claim paper

Read More »

149

click to vote

CIKM
2011
Springer

211views Information Technology» more CIKM 2011»

S3K: seeking statement-supporting top-K witnesses

14 years 3 months ago

Download www.mpi-inf.mpg.de

Traditional information retrieval techniques based on keyword search help to identify a ranked set of relevant documents, which often contains many documents in the top ranks that...

Steffen Metzger, Shady Elbassuoni, Katja Hose, Ral...

claim paper

Read More »

107

click to vote

ICML
2006
IEEE

180views Machine Learning» more ICML 2006»

Topic modeling: beyond bag-of-words

16 years 4 months ago

Download people.ee.duke.edu

Some models of textual corpora employ text generation methods involving n-gram statistics, while others use latent topic variables inferred using the "bag-of-words" assu...

Hanna M. Wallach

claim paper

Read More »

« Prev « First page 29 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers