Sciweavers

78 search results - page 4 / 16
» Learning Common Grammar from Multilingual Corpus
Sort
View
LREC
2008
88views Education» more  LREC 2008»
13 years 9 months ago
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization
Tokenization is one of the initial steps done for almost any text processing task. It is not particularly recognized as a challenging task for English monolingual systems but it r...
Oana Frunza
JMLR
2010
192views more  JMLR 2010»
13 years 2 months ago
Inducing Tree-Substitution Grammars
Inducing a grammar from text has proven to be a notoriously challenging learning task despite decades of research. The primary reason for its difficulty is that in order to induce...
Trevor Cohn, Phil Blunsom, Sharon Goldwater
ACL
1998
13 years 9 months ago
Automatic Acquisition of Language Model based on Head-Dependent Relation between Words
Language modeling is to associate a sequence of words with a priori probability, which is a key part of many natural language applications such as speech recognition and statistic...
Seungmi Lee, Key-Sun Choi
LREC
2008
120views Education» more  LREC 2008»
13 years 9 months ago
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
We introduce the corpus of United States Congressional bills from 1947 to 1998 for use by language research communities. The U.S. Policy Agenda Legislation Corpus Volume 1 (USPALC...
Stephen Purpura, John Wilkerson, Dustin Hillard
BIRTHDAY
2009
Springer
14 years 2 months ago
Formal Grammars of Early Language
We propose to model the development of language by a series of formal grammars, accounting for the linguistic capacity of children at the very early stages of mastering language. T...
Shuly Wintner, Alon Lavie, Brian MacWhinney