Pre-training of Hidden-Unit CRFs

9 years 10 months ago

Download www.cs.columbia.edu

In this paper, we apply the concept of pretraining to hidden-unit conditional random ﬁelds (HUCRFs) to enable learning on unlabeled data. We present a simple yet effective pre-training technique that learns to associate words with their clusters, which are obtained in an unsupervised manner. The learned parameters are then used to initialize the supervised learning process. We also propose a word clustering technique based on canonical correlation analysis (CCA) that is sensitive to multiple word senses, to further improve the accuracy within the proposed framework. We report consistent gains over standard conditional random ﬁelds (CRFs) and HUCRFs without pre-training in semantic tagging, named entity recognition (NER), and part-of-speech (POS) tagging tasks, which could indicate the task independent nature of the proposed technique.

Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

Real-time Traffic

ACL 2015 | Computational Linguistics |

claim paper

Post Info
More Details (n/a)

Added	13 Apr 2016
Updated	13 Apr 2016
Type	Journal
Year	2015
Where	ACL
Authors	Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

Comments (0)

Sciweavers

Pre-training of Hidden-Unit CRFs

ACL 2015 | Computational Linguistics |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers