We present the results of a community detection analysis of the Wikipedia graph. Distinct communities in Wikipedia contain semantically closely related articles. The central topic of a community can be identified using PageRank. Extracted communities can be organized hierarchically similar to manually created Wikipedia category structure. Categories and Subject Descriptors I.2.4 [Knowledge Representation]: Semantic Networks; H.3.3 [Information Search and Retrieval]: Clustering General Terms Experimentation Keywords Graph analysis, community detection, Wikipedia
Dmitry Lizorkin, Olena Medelyan, Maria P. Grineva