Learning the Semantic Correlation: An Alternative Way to Gain from Unlabeled Text

15 years 8 months ago

Download www.cs.cmu.edu

In this paper, we address the question of what kind of knowledge is generally transferable from unlabeled text. We suggest and analyze the semantic correlation of words as a generally transferable structure of the language and propose a new method to learn this structure using an appropriately chosen latent variable model. This semantic correlation contains structural information of the language space and can be used to control the joint shrinkage of model parameters for any specific task in the same space through regularization. In an empirical study, we construct 190 different text classification tasks from a real-world benchmark, and the unlabeled documents are a mixture from all these tasks. We test the ability of various algorithms to use the mixed unlabeled text to enhance all classification tasks. Empirical results show that the proposed approach is a reliable and scalable method for semi-supervised learning, regardless of the source of unlabeled data, the specific task to be e...

Yi Zhang 0010, Jeff Schneider, Artur Dubrawski

Real-time Traffic

Classification Tasks | Information Technology | NIPS 2008 | Semantic Correlation | Unlabeled Text |

claim paper

» Mining common topics from multiple asynchronous text streams

» Unsupervised discovery of visual object class hierarchies

» Annotating Relationships Between Multiple MixedMedia Digital Objects by Extending Annotea

» Intelligent Search in a Collection of Video Lectures

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	NIPS
Authors	Yi Zhang 0010, Jeff Schneider, Artur Dubrawski

Comments (0)

Sciweavers

Learning the Semantic Correlation: An Alternative Way to Gain from Unlabeled Text

Classification Tasks | Information Technology | NIPS 2008 | Semantic Correlation | Unlabeled Text |

Explore & Download

Productivity Tools

Sciweavers