Sciweavers

542 search results - page 4 / 109
» Learning author-topic models from text corpora
Sort
View
EMNLP
2011
12 years 7 months ago
Inducing Sentence Structure from Parallel Corpora for Reordering
When translating among languages that differ substantially in word order, machine translation (MT) systems benefit from syntactic preordering—an approach that uses features fro...
John DeNero, Jakob Uszkoreit
ACL
1993
13 years 9 months ago
Automatic Acquisition of a Large Subcategorization Dictionary from Corpora
This paper presents a new method for producing a dictionary of subcategorization frames from unlabelled text corpora. It is shown that statistical filtering of the results of a ...
Christopher D. Manning
ICML
2005
IEEE
14 years 8 months ago
Hierarchical Dirichlet model for document classification
The proliferation of text documents on the web as well as within institutions necessitates their convenient organization to enable efficient retrieval of information. Although tex...
Sriharsha Veeramachaneni, Diego Sona, Paolo Avesan...
SOFSEM
2000
Springer
13 years 11 months ago
Towards High Speed Grammar Induction on Large Text Corpora
Abstract. In this paper we describe an e cient and scalable implementation for grammar induction based on the EMILE approach ( 2], 3], 4], 5], 6]). The current EMILE 4.1 implementa...
Pieter W. Adriaans, Marten Trautwein, Marco Vervoo...
ACL
2010
13 years 5 months ago
Learning Common Grammar from Multilingual Corpus
We propose a corpus-based probabilistic framework to extract hidden common syntax across languages from non-parallel multilingual corpora in an unsupervised fashion. For this purp...
Tomoharu Iwata, Daichi Mochihashi, Hiroshi Sawada