Sciweavers

910 search results - page 103 / 182
» Testbed for information extraction from deep web
Sort
View
WEBI
2001
Springer
14 years 18 days ago
World Wide Web - A Multilingual Language Resource
Abstract. This paper argues that the World Wide Web could be regarded not only as an information resource but also as a dynamic, multilingual, least controlled, easy to access and ...
Fang Li, Huanye Sheng, Wilhelm Weisweber
ICMLA
2004
13 years 9 months ago
LASSO: a learning architecture for semantic web ontologies
Expressing web page content in a way that computers can understand is the key to a semantic web. Generating ontological information from the web automatically using machine learni...
Christopher N. Hammack, Stephen D. Scott
WWW
2009
ACM
14 years 8 months ago
Sitemaps: above and beyond the crawl of duty
Comprehensive coverage of the public web is crucial to web search engines. Search engines use crawlers to retrieve pages and then discover new ones by extracting the pages' o...
Uri Schonfeld, Narayanan Shivakumar
PVLDB
2010
135views more  PVLDB 2010»
13 years 6 months ago
SXPath - Extending XPath towards Spatial Querying on Web Documents
Querying data from presentation formats like HTML, for purposes such as information extraction, requires the consideration of tree structures as well as the consideration of spati...
Ermelinda Oro, Massimo Ruffolo, Steffen Staab
WWW
2006
ACM
14 years 8 months ago
CWS: a comparative web search system
In this paper, we define and study a novel search problem: Comparative Web Search (CWS). The task of CWS is to seek relevant and comparative information from the Web to help users...
Jian-Tao Sun, Xuanhui Wang, Dou Shen, Hua-Jun Zeng...