Sciweavers

2677 search results - page 57 / 536
» Extracting Structured Data from Web Pages
Sort
View
WEBDB
2005
Springer
102views Database» more  WEBDB 2005»
14 years 2 months ago
Design and Implementation of a Geographic Search Engine
In this paper, we describe the design and initial implementation of a geographic search engine prototype for Germany, based on a large crawl of the de domain. Geographic search en...
Alexander Markowetz, Yen-Yu Chen, Torsten Suel, Xi...
NIPS
2003
13 years 10 months ago
Ranking on Data Manifolds
The Google search engine has enjoyed huge success with its web page ranking algorithm, which exploits global, rather than local, hyperlink structure of the web using random walks....
Dengyong Zhou, Jason Weston, Arthur Gretton, Olivi...
GEOINFO
2003
13 years 10 months ago
The Web as a Data Source for Spatial Databases
With the phenomenal growth of the WWW, rich data sources on many different subjects have become available online. Some of these sources store daily facts that often involve textual...
Karla A. V. Borges, Alberto H. F. Laender, Claudia...
DOCENG
2009
ACM
14 years 3 months ago
Web document text and images extraction using DOM analysis and natural language processing
: © Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing Parag Mulendra Joshi, Sam Liu HP Laboratories HPL-2009-187 Web page text extraction,...
Parag Mulendra Joshi, Sam Liu
DEXA
2006
Springer
197views Database» more  DEXA 2006»
13 years 10 months ago
Cleaning Web Pages for Effective Web Content Mining
Classifying and mining noise-free web pages will improve on accuracy of search results as well as search speed, and may benefit webpage organization applications (e.g., keyword-bas...
Jing Li, Christie I. Ezeife