Sciweavers

1541 search results - page 40 / 309
» Extracting Web Data Using Instance-Based Learning
Sort
View
GI
2001
Springer
14 years 1 months ago
Transaction Synchronization for XML Data in Client-Server Web Applications
: Whenever database centered client-server web applications have to be used by multiple web clients on different platforms, then recently XML has been considered as an important da...
Stefan Böttcher, Adelhard Türling
SEMWEB
2005
Springer
14 years 2 months ago
Rapid Benchmarking for Semantic Web Knowledge Base Systems
Abstract. We present a method for rapid development of benchmarks for Semantic Web knowledge base systems. At the core, we have a synthetic data generation approach for OWL that is...
Sui-Yu Wang, Yuanbo Guo, Abir Qasem, Jeff Heflin
AIRWEB
2007
Springer
14 years 3 months ago
Extracting Link Spam using Biased Random Walks from Spam Seed Sets
Link spam deliberately manipulates hyperlinks between web pages in order to unduly boost the search engine ranking of one or more target pages. Link based ranking algorithms such ...
Baoning Wu, Kumar Chellapilla
VLDB
2004
ACM
121views Database» more  VLDB 2004»
14 years 2 months ago
An Automatic Data Grabber for Large Web Sites
We demonstrate a system to automatically grab data from data intensive web sites. The system first infers a model that describes at the intensional level the web site as a collec...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
IJCAI
2003
13 years 10 months ago
Trainability: Developing a responsive learning system
In this paper, we describe the lessons we learned in developing AgentBuilder, a commercial system for rapidly creating agents that extract information from web sites. AgentBuilder...
Steven Minton, Sorinel I. Ticrea, Jennifer Beach