
12 years 6 days ago
Movie-DiC: a Movie Dialogue Corpus for Research and Development
This paper describes Movie-DiC a Movie Dialogue Corpus recently collected for research and development purposes. The collected dataset comprises 132,229 dialogues containing a tot...
Rafael E. Banchs
12 years 6 days ago
Pattern Learning for Relation Extraction with a Hierarchical Topic Model
We describe the use of a hierarchical topic model for automatically identifying syntactic and lexical patterns that explicitly state ontological relations. We leverage distant sup...
Enrique Alfonseca, Katja Filippova, Jean-Yves Delo...
12 years 6 days ago
Spectral Learning of Latent-Variable PCFGs
We introduce a spectral learning algorithm for latent-variable PCFGs (Petrov et al., 2006). Under a separability (singular value) condition, we prove that the method provides cons...
Shay B. Cohen, Karl Stratos, Michael Collins, Dean...
12 years 6 days ago
Incremental Joint Approach to Word Segmentation, POS Tagging, and Dependency Parsing in Chinese
We propose the first joint model for word segmentation, POS tagging, and dependency parsing for Chinese. Based on an extension of the incremental joint model for POS tagging and ...
Jun Hatori, Takuya Matsuzaki, Yusuke Miyao, Jun-ic...
12 years 6 days ago
Learning Syntactic Verb Frames using Graphical Models
We present a novel approach for building verb subcategorization lexicons using a simple graphical model. In contrast to previous methods, we show how the model can be trained with...
Thomas Lippincott, Anna Korhonen, Diarmuid Ó...
12 years 6 days ago
Temporally Anchored Relation Extraction
Although much work on relation extraction has aimed at obtaining static facts, many of the target relations are actually fluents, as their validity is naturally anchored to a cer...
Guillermo Garrido, Anselmo Peñas, Bernardo ...
12 years 6 days ago
Word Epoch Disambiguation: Finding How Words Change Over Time
In this paper we introduce the novel task of “word epoch disambiguation,” defined as the problem of identifying changes in word usage over time. Through experiments run using...
Rada Mihalcea, Vivi Nastase
12 years 6 days ago
Using Search-Logs to Improve Query Tagging
Syntactic analysis of search queries is important for a variety of information-retrieval tasks; however, the lack of annotated data makes training query analysis models difficult...
Kuzman Ganchev, Keith Hall, Ryan T. McDonald, Slav...
12 years 6 days ago
Learning to "Read Between the Lines" using Bayesian Logic Programs
Most information extraction (IE) systems identify facts that are explicitly stated in text. However, in natural language, some facts are implicit, and identifying them requires ...
Sindhu Raghavan, Raymond J. Mooney, Hyeonseo Ku