Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity

14 years 6 months ago

Download www.sics.se

This paper introduces a measure of corpus homogeneity that indicates the amount of topical dispersion in a corpus. The measure is based on the density of neighborhoods in semantic word spaces. We evaluate the measure by comparing the results for ﬁve diﬀerent corpora. Our initial results indicate that the proposed density measure can indeed identify diﬀerences in topical dispersion.

Magnus Sahlgren, Jussi Karlgren

Real-time Traffic

Corpus Homogeneity | Density Measure | SPIRE 2005 | Topical Dispersion |

claim paper

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	SPIRE
Authors	Magnus Sahlgren, Jussi Karlgren

Comments (0)

Sciweavers

Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity

Corpus Homogeneity | Density Measure | SPIRE 2005 | Topical Dispersion |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers