Sciweavers

211 search results - page 18 / 43
» Effective Web data extraction with standard XML technologies
Sort
View
WWW
2004
ACM
14 years 8 months ago
Testbed for information extraction from deep web
Search results generated by searchable databases are served dynamically and far larger than the static documents on the Web. These results pages have been referred to as the Deep ...
Yasuhiro Yamada, Nick Craswell, Tetsuya Nakatoh, S...
EON
2008
13 years 9 months ago
Data and Process Mediation Support for B2B Integration
Abstract In this paper we present how Semantic Web Service technology can be used to overcome process and data heterogeneity in a B2B integration scenario. While one partner uses s...
Maciej Zaremba, Maximilian Herold, Raluca Zaharia,...
WWW
2004
ACM
14 years 8 months ago
Automatic web news extraction using tree edit distance
The Web poses itself as the largest data repository ever available in the history of humankind. Major efforts have been made in order to provide efficient access to relevant infor...
Davi de Castro Reis, Paulo Braz Golgher, Altigran ...
SEMWEB
2001
Springer
13 years 12 months ago
On the Integration of Topic Maps and RDF data
Abstract. Topic Maps and RDF are two independently developed paradigms and standards for the representation, interchange, and exploitation of model-based data on the web. Each para...
Martin S. Lacher, Stefan Decker
WWW
2010
ACM
13 years 7 months ago
Exploiting content redundancy for web information extraction
We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...
Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...