Sciweavers

1042 search results - page 51 / 209
» Logic-based Web Information Extraction
Sort
View
WWW
2010
ACM
14 years 1 months ago
Web-scale knowledge extraction from semi-structured tables
A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...
Eric Crestan, Patrick Pantel
WWW
2008
ACM
14 years 9 months ago
Web page sectioning using regex-based template
This work aims to provide a novel, site-specific web page segmentation and section importance detection algorithm, which leverages structural, content, and visual information. The...
Rupesh R. Mehta, Amit Madaan
CIKM
2008
Springer
13 years 10 months ago
Coreex: content extraction from online news articles
We developed and tested a heuristic technique for extracting the main article from news site Web pages. We construct the DOM tree of the page and score every node based on the amo...
Jyotika Prasad, Andreas Paepcke
AC
2006
Springer
13 years 8 months ago
Web Testing for Reliability Improvement
In this chapter, we characterize problems for web applications, examine existing testing techniques that are potentially applicable to the web environment, and introduce a strateg...
Jeff Tian, Li Ma
ICDAR
2007
IEEE
14 years 17 days ago
WEB Image Classification Based on the Fusion of Image and Text Classifiers
This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that con...
Pedro R. Kalva, Fabrício Enembreck, Alessan...