Search Sciweavers | Sciweavers

415 search results - page 12 / 83

» Finding nuggets in documents: A machine learning approach

click to vote

CIKM
2005
Springer

125views Information Technology» more CIKM 2005»

Learning to summarise XML documents using content and structure

14 years 3 months ago

Download eprints.pascal-network.org

Documents formatted in eXtensible Markup Language (XML) are becoming increasingly available in collections of various document types. In this paper, we present an approach for the...

Massih-Reza Amini, Anastasios Tombros, Nicolas Usu...

claim paper

Read More »

click to vote

EMNLP
2007

118views Natural Language Processing» more EMNLP 2007»

Learning to Find English to Chinese Transliterations on the Web

13 years 11 months ago

Download www.aclweb.org

We present a method for learning to find English to Chinese transliterations on the Web. In our approach, proper nouns are expanded into new queries aimed at maximizing the probab...

Jian-Cheng Wu, Jason S. Chang

claim paper

Read More »

click to vote

ICML
2006
IEEE

158views Machine Learning» more ICML 2006»

Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution

14 years 11 months ago

Download cseweb.ucsd.edu

The Dirichlet compound multinomial (DCM) distribution, also called the multivariate Polya distribution, is a model for text documents that takes into account burstiness: the fact ...

Charles Elkan

claim paper

Read More »

click to vote

ECIR
2006
Springer

134views Information Technology» more ECIR 2006»

Automatic Document Organization in a P2P Environment

13 years 11 months ago

Download ir.shef.ac.uk

Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...

Stefan Siersdorfer, Sergej Sizov

claim paper

Read More »

click to vote

ERCIMDL
2010
Springer

180views Education» more ERCIMDL 2010»

SciPlore Xtract: Extracting Titles from Scientific PDF Documents by Analyzing Style Information (Font Size)

13 years 7 months ago

Download www.sciplore.org

Extracting titles from a PDFs full text is an important task in information retrieval to identify PDFs. Existing approaches apply complicated and expensive (in terms of calculating...

Jöran Beel, Bela Gipp, Ammar Shaker, Nick Fri...

claim paper

Read More »

« Prev « First page 12 / 83 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers