Sciweavers

1947 search results - page 34 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
COOPIS
1998
IEEE
14 years 8 days ago
Wrapper Generation for Web Accessible Data Sources
There is an increase in the number of data sources that can be queried across the WWW. Such sources typically support HTML forms-based interfaces and search engines query collecti...
Jean-Robert Gruser, Louiqa Raschid, Maria-Esther V...
MMS
2006
13 years 8 months ago
A probabilistic semantic model for image annotation and multi-modal image retrieval
This paper addresses automatic image annotation problem and its application to multi-modal image retrieval. The contribution of our work is three-fold. (1) We propose a probabilis...
Ruofei Zhang, Zhongfei (Mark) Zhang, Mingjing Li, ...
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 9 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
JCDL
2006
ACM
237views Education» more  JCDL 2006»
14 years 2 months ago
Automatic extraction of table metadata from digital documents
Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and high...
Ying Liu, Prasenjit Mitra, C. Lee Giles, Kun Bai
DMKD
2000
ACM
110views Data Mining» more  DMKD 2000»
14 years 1 months ago
Combining Strategies for Extracting Relations from Text Collections
Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
Eugene Agichtein, Eleazar Eskin, Luis Gravano