Collective Latent Dirichlet Allocation

14 years 9 months ago

Download www.lcs.ios.ac.cn

In this paper, we propose a new variant of Latent Dirichlet Allocation(LDA): Collective LDA (C-LDA), for multiple corpora modeling. C-LDA combines multiple corpora during learning such that it can transfer knowledge from one corpus to another; meanwhile it keeps a discriminative node which represents the corpus ID to constrain the learned topics in each corpus. Compared with LDA locally applied to the target corpus, C-LDA results in reﬁned topicword distribution, while compared with applying LDA globally and straightforwardly to the combined corpus, C-LDA keeps each topic only for one corpus. We demonstrate that C-LDA has improved performance with these advantages by experiments on several benchmark document data sets .

Zhiyong Shen, Jun Sun, Yi-Dong Shen

Real-time Traffic

Corpus Id | Data Mining | ICDM 2008 | Multiple Corpora | Multiple Corpora Modeling |

claim paper

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICDM
Authors	Zhiyong Shen, Jun Sun, Yi-Dong Shen

Comments (0)

Sciweavers

Collective Latent Dirichlet Allocation

Corpus Id | Data Mining | ICDM 2008 | Multiple Corpora | Multiple Corpora Modeling |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers