Sciweavers

4234 search results - page 21 / 847
» A Method for Web Information Extraction
Sort
View
IJCAI
2003
13 years 9 months ago
Integrating Information to Bootstrap Information Extraction from Web Sites
In this paper we propose a methodology to learn to extract domain-specific information from large repositories (e.g. the Web) with minimum user intervention. Learning is seeded b...
Fabio Ciravegna, Alexiei Dingli, David Guthrie, Yo...
ICDAR
2007
IEEE
13 years 11 months ago
WEB Image Classification Based on the Fusion of Image and Text Classifiers
This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that con...
Pedro R. Kalva, Fabrício Enembreck, Alessan...
CIKM
2010
Springer
13 years 6 months ago
Automatic metadata extraction from multilingual enterprise content
Enterprises provide professionally authored content about their products/services in different languages for use in web sites and customer care. For customer care, personalization...
Melike Sah, Vincent Wade
WWW
2006
ACM
14 years 8 months ago
Robust web content extraction
We present an empirical evaluation and comparison of two content extraction methods in HTML: absolute XPath expressions and relative XPath expressions. We argue that the relative ...
Marek Kowalkiewicz, Maria E. Orlowska, Tomasz Kacz...
WWW
2010
ACM
14 years 2 months ago
Entity relation discovery from web tables and links
The World-Wide Web consists not only of a huge number of unstructured texts, but also a vast amount of valuable structured data. Web tables [2] are a typical type of structured in...
Cindy Xide Lin, Bo Zhao, Tim Weninger, Jiawei Han,...