Sciweavers

ACL
2008

Text Segmentation with LDA-Based Fisher Kernel

14 years 1 months ago
Text Segmentation with LDA-Based Fisher Kernel
In this paper we propose a domainindependent text segmentation method, which consists of three components. Latent Dirichlet allocation (LDA) is employed to compute words semantic distribution, and we measure semantic similarity by the Fisher kernel. Finally global best segmentation is achieved by dynamic programming. Experiments on Chinese data sets with the technique show it can be effective. Introducing latent semantic information, our algorithm is robust on irregular-sized segments.
Qi Sun, Runxin Li, Dingsheng Luo, Xihong Wu
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where ACL
Authors Qi Sun, Runxin Li, Dingsheng Luo, Xihong Wu
Comments (0)