In this paper, we describe experiments into the application of term weighting techniques from text retrieval to support the automatic identification of significant locations from a large location log, which we consider to be important for supporting many location-based social network applications. We identify the fact that the distribution of locations follows a similar shaped distribution to that of terms in a language and in so doing motivate our use of term weighting techniques. Using this information we then show that these proven techniques can be used to automatically identify social visits and “pass through” locations, as well as standard home and work locations. We also suggest that it is possible to classify whether an extended segment of personal location data may be a tourist trip, business trip or a typical working (at home) period of time. Keywords : Location; Power-law distribution; GPS; important locations; text retrieval.
Zhengwei Qiu, Cathal Gurrin, Aiden R. Doherty, Ala