Sciweavers

AAAI
2008

Semi-Supervised Learning for Blog Classification

14 years 1 months ago
Semi-Supervised Learning for Blog Classification
Blog classification (e.g., identifying bloggers' gender or age) is one of the most interesting current problems in blog analysis. Although this problem is usually solved by applying supervised learning techniques, the large labeled dataset required for training is not always available. In contrast, unlabeled blogs can easily be collected from the web. Therefore, a semi-supervised learning method for blog classification, effectively using unlabeled data, is proposed. In this method, entries from the same blog are assumed to have the same characteristics. With this assumption, the proposed method captures the characteristics of each blog, such as writing style and topic, and uses these characteristics to improve the classification accuracy.
Daisuke Ikeda, Hiroya Takamura, Manabu Okumura
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2008
Where AAAI
Authors Daisuke Ikeda, Hiroya Takamura, Manabu Okumura
Comments (0)