Search Sciweavers | Sciweavers

88 search results - page 2 / 18

» Process Model for Composing High-quality Text Corpora

123

click to vote

ACL
2009

160views Computational Linguistics» more ACL 2009»

Active Learning for Multilingual Statistical Machine Translation

15 years 10 days ago

Download aclweb.org

Statistical machine translation (SMT) models require bilingual corpora for training, and these corpora are often multilingual with parallel text in multiple languages simultaneous...

Gholamreza Haffari, Anoop Sarkar

claim paper

Read More »

129

Voted

EACL
2006
ACL Anthology

91views Natural Language Processing» more EACL 2006»

Multilingual Term Extraction from Domain-specific Corpora Using Morphological Structure

15 years 3 months ago

Download acl.ldc.upenn.edu

Morphologically complex terms composed from Greek or Latin elements are frequent in scientific and technical texts. Word forming units are thus relevant cues for the identificatio...

Delphine Bernhard

claim paper

Read More »

125

click to vote

KDD
2010
ACM

233views Data Mining» more KDD 2010»

Evolutionary hierarchical dirichlet processes for multiple correlated time-varying corpora

15 years 6 months ago

Download research.microsoft.com

Mining cluster evolution from multiple correlated time-varying text corpora is important in exploratory text analytics. In this paper, we propose an approach called evolutionary h...

Jianwen Zhang, Yangqiu Song, Changshui Zhang, Shix...

claim paper

Read More »

122

click to vote

TOIS
2010

128views more TOIS 2010»

Learning author-topic models from text corpora

15 years 26 days ago

Download www.ics.uci.edu

We propose a new unsupervised learning technique for extracting information about authors and topics from large text collections. We model documents as if they were generated by a...

Michal Rosen-Zvi, Chaitanya Chemudugunta, Thomas L...

claim paper

Read More »

101

Voted

SIGIR
2006
ACM

139views Information Technology» more SIGIR 2006»

Improving the estimation of relevance models using large external corpora

15 years 8 months ago

Download ciir.cs.umass.edu

Information retrieval algorithms leverage various collection statistics to improve performance. Because these statistics are often computed on a relatively small evaluation corpus...

Fernando Diaz, Donald Metzler

claim paper

Read More »

« Prev « First page 2 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers