Sciweavers

395 search results - page 19 / 79
» An Automatic Data Grabber for Large Web Sites
Sort
View
WIDM
2003
ACM
14 years 2 months ago
Schema-guided wrapper maintenance for web-data extraction
Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. There are two main issues relevant t...
Xiaofeng Meng, Dongdong Hu, Chen Li
NIPS
2003
13 years 10 months ago
Fast Algorithms for Large-State-Space HMMs with Applications to Web Usage Analysis
In applying Hidden Markov Models to the analysis of massive data streams, it is often necessary to use an artificially reduced set of states; this is due in large part to the fac...
Pedro F. Felzenszwalb, Daniel P. Huttenlocher, Jon...
AIRWEB
2006
Springer
14 years 15 days ago
Tracking Web Spam with Hidden Style Similarity
Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...
Tanguy Urvoy, Thomas Lavergne, Pascal Filoche
ICWE
2007
Springer
14 years 2 months ago
Integrating Databases, Search Engines and Web Applications: A Model-Driven Approach
This paper addresses conceptual modeling and automatic code generation for search engine integration with data intensive Web applications. We have analyzed the similarities (and di...
Alessandro Bozzon, Tereza Iofciu, Wolfgang Nejdl, ...
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
14 years 2 months ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...