We propose a new method for detecting errors in “gold-standard” part-ofspeech annotation. The approach locates errors with high precision based on n-grams occurring in the cor...
We show that categories induced by unsupervised word clustering can surpass the performance of gold part-of-speech tags in dependency grammar induction. Unlike classic clustering ...
Valentin I. Spitkovsky, Hiyan Alshawi, Angel X. Ch...
Arabic Language understanding (ALU) computing is considered an AI-hard task. In this paper, we propose an Agent model for ALU problem. This agent is detailed in this paper. An ALU...
This paper presents the Part Of Speech tagger and Chunker for Tamil using Machine learning techniques. Part Of Speech tagging and chunking are the fundamental processing steps for...
V. Dhanalakshmi, P. Padmavathy, M. Anand Kumar, K....
We address the problem of part-of-speech tagging for English data from the popular microblogging service Twitter. We develop a tagset, annotate data, develop features, and report ...
Kevin Gimpel, Nathan Schneider, Brendan O'Connor, ...