Sciweavers

1683 search results - page 165 / 337
» Natural Language Processing (almost) from Scratch
Sort
View
TAL
2010
Springer
15 years 2 months ago
Summarization as Feature Selection for Document Categorization on Small Datasets
Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...
Emmanuel Anguiano-Hernández, Luis Villase&n...
EMNLP
2010
15 years 2 months ago
Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid
Discovering and summarizing opinions from online reviews is an important and challenging task. A commonly-adopted framework generates structured review summaries with aspects and ...
Wayne Xin Zhao, Jing Jiang, Hongfei Yan, Xiaoming ...
EMNLP
2010
15 years 2 months ago
Predicting the Semantic Compositionality of Prefix Verbs
In many applications, replacing a complex word form by its stem can reduce sparsity, revealing connections in the data that would not otherwise be apparent. In this paper, we focu...
Shane Bergsma, Aditya Bhargava, Hua He, Grzegorz K...
EMNLP
2010
15 years 2 months ago
Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models
We describe a new scalable algorithm for semi-supervised training of conditional random fields (CRF) and its application to partof-speech (POS) tagging. The algorithm uses a simil...
Amarnag Subramanya, Slav Petrov, Fernando Pereira
EMNLP
2010
15 years 2 months ago
Latent-Descriptor Clustering for Unsupervised POS Induction
We present a novel approach to distributionalonly, fully unsupervised, POS tagging, based on an adaptation of the EM algorithm for the estimation of a Gaussian mixture. In this ap...
Michael Lamar, Yariv Maron, Elie Bienenstock