Extracting Spatial Knowledge from the Web

15 years 12 months ago

Download mccurley.org

The content of the world-wide web is pervaded by information of a geographical or spatial nature, particularly such location information as addresses, postal codes, and telephone numbers. We present a system for extracting spatial knowledge from collections of web pages gathered by web-crawling programs. For each page determined to contain location information, we apply geocoding techniques to compute geographic coordinates, such as latitude-longitude pairs. Next, we augment the location information with keyword descriptors extracted from the web page contents. We then apply spatial data mining techniques on the augmented location information to derive spatial knowledge. The techniques make use of so-called shared neighbor information to produce clusters of web pages organized around a common set of concepts. KEYWORDS Web Data Mining, Geographic Information System (GIS), Information Extraction, Geocoding, Geoparsing, Crawl, Clustering, Labeling, Keyword Extraction, Dimension Reduction...

Yasuhiko Morimoto, Masaki Aono, Michael E. Houle,

Real-time Traffic

Internet Technology | Location Information | SAINT 2003 | Spatial Knowledge | Web Pages |

claim paper

» Extended Link Analysis for Extracting Spatial Information Hubs

» Landmark Extraction A Web Mining Approach

» A Knowledge Base for the maintenance of knowledge extracted from web data

» Extracting spatial association rules from spatial transactions

» WebSets extracting sets of entities from the web using unsupervised information extraction

» Learning to Extract Symbolic Knowledge from the World Wide Web

» Incorporating sitelevel knowledge to extract structured data from web forums

» Cultural Heritage Knowledge Extraction from Web Documents

Post Info
More Details (n/a)

Added	05 Jul 2010
Updated	05 Jul 2010
Type	Conference
Year	2003
Where	SAINT
Authors	Yasuhiko Morimoto, Masaki Aono, Michael E. Houle, Kevin S. McCurley

Comments (0)

Sciweavers

Extracting Spatial Knowledge from the Web

Internet Technology | Location Information | SAINT 2003 | Spatial Knowledge | Web Pages |

Explore & Download

Productivity Tools

Sciweavers