Sciweavers

910 search results - page 41 / 182
» Testbed for information extraction from deep web
Sort
View
VLDB
2011
ACM
251views Database» more  VLDB 2011»
13 years 2 months ago
Harvesting relational tables from lists on the web
A large number of web pages contain data structured in the form of “lists”. Many such lists can be further split into multi-column tables, which can then be used in more seman...
Hazem Elmeleegy, Jayant Madhavan, Alon Y. Halevy
ICSOC
2009
Springer
14 years 2 months ago
Web Service Search on Large Scale
The Web is nowadays moving from a Web of data to a Web of services. In this paper we present our approach for Web Service discovery on Web scale, targeted to support flexible and ...
Nathalie Steinmetz, Holger Lausen, Manuel Brunner
WWW
2005
ACM
14 years 8 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
WWW
2009
ACM
14 years 8 months ago
Estimating web site readability using content extraction
Nowadays, information is primarily searched on the WWW. From a user perspective, the readability is an important criterion for measuring the accessibility and thereby the quality ...
Thomas Gottron, Ludger Martin
LREC
2008
133views Education» more  LREC 2008»
13 years 9 months ago
Automatic Identification of Temporal Information in Tourism Web Pages
This paper presents our work on the detection of temporal information in web pages. The pages examined within the scope of this study were taken from the tourism sector and the te...
Stéphanie Weiser, Philippe Laublet, Jean-Lu...