The Performance Analysis of a Chi-square Similarity Measure for Topic Related Clustering of Noisy Transcripts

16 years 7 months ago

Download iielab-secs.secs.oakland.edu

The goal of the paper is to present a novel Chi-square similarity measure and assess its performance through comparison with well-known similarity measures such as Cosine, Dice, and Jaccard. The Chi-square similarity measure has been designed to withstand the imperfections of transcribed spoken documents. The major difference of our similarity measure from others consists in the fact that in addition to searching for co-occurring words in documents, we also match informative closeness of common words. We assume that cooccurring words, which had been employed to convey the same information, should have the compatible significance in matching documents. To test it we apply the Chi-Square method. Experimental results obtained via using an archive of transcribed news broadcasts demonstrate the high efficacy of the proposed methodology.

Oktay Ibrahimov, Ishwar K. Sethi, Nevenka Dimitrov

Real-time Traffic

Chi-Square Method | Chi-square Similarity Measure | Computer Vision | ICPR 2002 | Well-known Similarity Measures |

claim paper

Added	09 Nov 2009
Updated	09 Nov 2009
Type	Conference
Year	2002
Where	ICPR
Authors	Oktay Ibrahimov, Ishwar K. Sethi, Nevenka Dimitrova

Sciweavers

The Performance Analysis of a Chi-square Similarity Measure for Topic Related Clustering of Noisy Transcripts

Chi-Square Method | Chi-square Similarity Measure | Computer Vision | ICPR 2002 | Well-known Similarity Measures |

Explore & Download

Productivity Tools

Sciweavers