Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

208

Voted

DEXAW
2008
IEEE

373views Database» more DEXAW 2008»

Topic Detection by Clustering Keywords

16 years 1 months ago

Topic Detection by Clustering Keywords

Download www.uni-weimar.de

We consider topic detection without any prior knowledge of category structure or possible categories. Keywords are extracted and clustered based on different similarity measures using the induced k-bisecting clustering algorithm. Evaluation on Wikipedia articles shows that clusters of keywords correlate strongly with the Wikipedia categories of the articles. In addition, we ﬁnd that a distance measure based on the Jensen-Shannon divergence of probability distributions outperforms the cosine similarity. In particular, a newly proposed term distribution taking co-occurrence of terms into account gives best results.

Christian Wartena, Rogier Brussee

Real-time Traffic

Cosine Similarity | Database | DEXAW 2008 | Distribution Taking Co-occurrence | K-bisecting Clustering Algorithm |

claim paper

Related Content

» Nearduplicate keyframe retrieval with visual keywords and semantic context

» Conversation clusters grouping conversation topics through humancomputer dialog

» Topic Tracking Based on Keywords Dependency Profile

» Automatic online news issue construction in web environment

» Crosslanguage linking of news stories on the web using interlingual topic modelling

» Clustering of Search Engine Keywords Using Access Logs

» An investigation of linguistic features and clustering algorithms for topical document clu...

» ClusteringBased Searching and Navigation in an Online News Source

» Exploring supervised and unsupervised methods to detect topics in biomedical text

Post Info
More Details (n/a)

Added	29 May 2010
Updated	29 May 2010
Type	Conference
Year	2008
Where	DEXAW
Authors	Christian Wartena, Rogier Brussee

Comments (0)