Sciweavers

152 search results - page 17 / 31
» Redundancy-Driven Web Data Extraction and Integration
Sort
View
IV
2008
IEEE
133views Visualization» more  IV 2008»
14 years 1 months ago
Mining Scholarly Semantic Networks from the Web
With the increased usage of the Web and its availability of data, various scholarly information is now available on the Web. Extraction, aggregation, and visualization of such inf...
Mizuki Oka, Yutaka Matsuo
VLDB
2007
ACM
134views Database» more  VLDB 2007»
14 years 1 months ago
Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach
Structured community portals extract and integrate information from raw Web pages to present a unified view of entities and relationships in the community. In this paper we argue...
Pedro DeRose, Warren Shen, Fei Chen 0002, AnHai Do...
VLDB
2004
ACM
126views Database» more  VLDB 2004»
14 years 25 days ago
Instance-based Schema Matching for Web Databases by Domain-specific Query Probing
In a Web database that dynamically provides information in response to user queries, two distinct schemas, interface schema (the schema users can query) and result schema (the sch...
Jiying Wang, Ji-Rong Wen, Frederick H. Lochovsky, ...
WIRI
2005
IEEE
14 years 1 months ago
A Fast Linkage Detection Scheme for Multi-Source Information Integration
Record linkage refers to techniques for identifying records associated with the same real-world entities. Record linkage is not only crucial in integrating multi-source databases ...
Akiko N. Aizawa, Keizo Oyama
EDBT
2009
ACM
123views Database» more  EDBT 2009»
14 years 2 months ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...