Sciweavers

80 search results - page 5 / 16
» A Statistical Study of the WPT-03 Corpus
Sort
View
EMNLP
2009
13 years 6 months ago
Collocation Extraction Using Monolingual Word Alignment Method
Statistical bilingual word alignment has been well studied in the context of machine translation. This paper adapts the bilingual word alignment algorithm to monolingual scenario ...
Zhan-yi Liu, Haifeng Wang, Hua Wu, Sheng Li
BMCBI
2006
131views more  BMCBI 2006»
13 years 8 months ago
Statistical modeling of biomedical corpora: mining the Caenorhabditis Genetic Center Bibliography for genes related to life span
Background: The statistical modeling of biomedical corpora could yield integrated, coarse-to-fine views of biological phenomena that complement discoveries made from analysis of m...
David M. Blei, K. Franks, Michael I. Jordan, I. Sa...
LREC
2010
233views Education» more  LREC 2010»
13 years 10 months ago
The Development of a Morphosyntactic Tagset for Afrikaans and its Use with Statistical Tagging
In this paper, we present a morphosyntactic tagset for Afrikaans based on the guidelines developed by the Expert Advisory Group on Language Engineering Standards (EAGLES). We comp...
Boris Haselbach, Ulrich Heid
INFORMATICALT
2006
116views more  INFORMATICALT 2006»
13 years 8 months ago
Cache-based Statistical Language Models of English and Highly Inflected Lithuanian
This paper investigates a variety of statistical cache-based language models built upon three corpora: English, Lithuanian, and Lithuanian base forms. The impact of the cache size,...
Airenas Vaiciunas, Gailius Raskinis
MIE
2008
123views Healthcare» more  MIE 2008»
13 years 10 months ago
Searching Related Resources in a Quality Controlled Health Gateway: a Feasibility Study
Objective: The neighbors of a document are those documents in a corpus that are most similar to it. The objective of this paper is to develop and evaluate the related resources alg...
Tayeb Merabti, Suzanne Pereira, Catherine Letord, ...