Sciweavers

PVLDB
2008
141views more  PVLDB 2008»
13 years 10 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...
DEXA
1998
Springer
105views Database» more  DEXA 1998»
14 years 3 months ago
Semantic Based Schema Analysis
: Semantic similarity between schema elements is greatly influenced by the context in which the elements are defined and compared. This paper emphasizes on the role of context in e...
Nayyer Masood, Barry Eaglestone
BTW
2007
Springer
212views Database» more  BTW 2007»
14 years 5 months ago
Instance Matching with COMA++
: Schema matching is the process of identifying semantic correspondences between schemas. COMA++ is a matching prototype which uses several characteristics of schemas to determine ...
Daniel Engmann, Sabine Maßmann
SIGMOD
2007
ACM
91views Database» more  SIGMOD 2007»
14 years 11 months ago
Indexing dataspaces
Dataspaces are collections of heterogeneous and partially unstructured data. Unlike data-integration systems that also offer uniform access to heterogeneous data sources, dataspac...
Xin Dong, Alon Y. Halevy