Sciweavers

543 search results - page 15 / 109
» Exploiting content redundancy for web information extraction
Sort
View
WWW
2011
ACM
13 years 2 months ago
Web information extraction using Markov logic networks
In this paper, we consider the problem of extracting structured data from web pages taking into account both the content of individual attributes as well as the structure of pages...
Sandeepkumar Satpal, Sahely Bhadra, Sundararajan S...
ICCSA
2005
Springer
14 years 1 months ago
Semantic Web Enabled Information Systems: Personalized Views on Web Data
Abstract. In this paper a methodology and a framework for personalized views on data available on the World Wide Web are proposed. We describe its main two ingredients, Web data ex...
Robert Baumgartner, Christian Enzi, Nicola Henze, ...
SAINT
2003
IEEE
14 years 25 days ago
Extracting Spatial Knowledge from the Web
The content of the world-wide web is pervaded by information of a geographical or spatial nature, particularly such location information as addresses, postal codes, and telephone ...
Yasuhiko Morimoto, Masaki Aono, Michael E. Houle, ...
ICDE
2004
IEEE
117views Database» more  ICDE 2004»
14 years 9 months ago
Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web
In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...
James Caverlee, Ling Liu, David Buttler
ICANN
2005
Springer
14 years 1 months ago
Content-Based Retrieval of Web Pages and Other Hierarchical Objects with Self-organizing Maps
We propose a content-based information retrieval (CBIR) method that models known relationships between multimedia objects as a hierarchical tree-structure incorporating additional ...
Mats Sjöberg, Jorma Laaksonen