Sciweavers

1947 search results - page 5 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
AAAI
2008
13 years 11 months ago
Automatic Extraction of Data Points and Text Blocks from 2-Dimensional Plots in Digital Documents
Two dimensional plots (2-D) in digital documents on the web are an important source of information that is largely under-utilized. In this paper, we outline how data and text can ...
Saurabh Kataria, William Browuer, Prasenjit Mitra,...
ICDE
2007
IEEE
126views Database» more  ICDE 2007»
14 years 10 months ago
Organizing Hidden-Web Databases by Clustering Visible Web Documents
In this paper we address the problem of organizing hidden-Web databases. Given a heterogeneous set of Web forms that serve as entry points to hidden-Web databases, our goal is to ...
Luciano Barbosa, Juliana Freire, Altigran Soares d...
ICIP
2004
IEEE
14 years 10 months ago
Automatically learning structural units in educational videos with the hierarchical hidden markov models
In this paper we present a coherent approach using the hierarchical HMM with shared structures to extract the structural units that form the building blocks of an education/traini...
Dinh Q. Phung, Svetha Venkatesh, Hung Hai Bui
ICDE
2006
IEEE
207views Database» more  ICDE 2006»
14 years 10 months ago
Automatic Sales Lead Generation from Web Data
Speed to market is critical to companies that are driven by sales in a competitive market. The earlier a potential customer can be approached in the decision making process of a p...
Ganesh Ramakrishnan, Sachindra Joshi, Sumit Negi, ...
AAAI
2006
13 years 10 months ago
Automatic Wrapper Generation Using Tree Matching and Partial Tree Alignment
This paper is concerned with the problem of structured data extraction from Web pages. The objective of the research is to automatically segment data records in a page, extract da...
Yanhong Zhai, Bing Liu