Sciweavers

SPIRE
2005
Springer

Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity

14 years 4 months ago
Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity
This paper introduces a measure of corpus homogeneity that indicates the amount of topical dispersion in a corpus. The measure is based on the density of neighborhoods in semantic word spaces. We evaluate the measure by comparing the results for five different corpora. Our initial results indicate that the proposed density measure can indeed identify differences in topical dispersion.
Magnus Sahlgren, Jussi Karlgren
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where SPIRE
Authors Magnus Sahlgren, Jussi Karlgren
Comments (0)