Sciweavers

542 search results - page 15 / 109
» Learning author-topic models from text corpora
Sort
View
ICDAR
2011
IEEE
12 years 7 months ago
Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments
- Large-scale digitisation has led to a number of new possibilities with regard to adaptive and learning based methods in the field of Document Image Analysis and OCR. For ground t...
C. Clausner, Stefan Pletschacher, Apostolos Antona...
ICDE
2012
IEEE
205views Database» more  ICDE 2012»
11 years 10 months ago
Optimizing Statistical Information Extraction Programs over Evolving Text
—Statistical information extraction (IE) programs are increasingly used to build real-world IE systems such as Alibaba, CiteSeer, Kylin, and YAGO. Current statistical IE approach...
Fei Chen, Xixuan Feng, Christopher Re, Min Wang
COLING
2002
13 years 7 months ago
The Computation of Word Associations: Comparing Syntagmatic and Paradigmatic Approaches
It is shown that basic language processes such as the production of free word associations and the generation of synonyms can be simulated using statistical models that analyze th...
Reinhard Rapp
CSL
2007
Springer
13 years 7 months ago
Automatic phonetic transcription of large speech corpora
This study is aimed at investigating whether automatic phonetic transcription procedures can approximate manual transcriptions typically delivered with contemporary large speech c...
Christophe Van Bael, Lou Boves, Henk van den Heuve...
ICMLA
2007
13 years 9 months ago
Semi-Supervised Active Learning for Modeling Medical Concepts from Free Text
We apply a new active learning formulation to the problem of learning medical concepts from unstructured text. The new formulation is based on maximizing the mutual information th...
Rómer Rosales, Praveen Krishnamurthy, R. Bh...