Sciweavers

265 search results - page 10 / 53
» Learning Logic Wrappers for Information Extraction from the ...
Sort
View
SIGMOD
2006
ACM
107views Database» more  SIGMOD 2006»
14 years 8 months ago
Documentum ECI self-repairing wrappers: performance analysis
Documentum Enterprise Content Integration (ECI) services is a content integration middleware that provides one-query access to the Intranet and Internet content resources. The ECI...
Boris Chidlovskii, Bruno Roustant, Marc Brette
WWW
2009
ACM
14 years 9 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...
RIAO
1997
13 years 10 months ago
Towards Sophisticated Wrapping of Web-based information Repositories
Access to on-line information via the Web is exploding. Index and retrieval engines already start to integrate a huge variety of heterogeneous repositories. However, the heterogen...
Boris Chidlovskii, Uwe M. Borghoff, Pierre-Yves Ch...
SIGMOD
2002
ACM
188views Database» more  SIGMOD 2002»
14 years 8 months ago
COMMIX: towards effective web information extraction, integration and query answering
As WWW becomes more and more popular and powerful, how to search information on the web in database way becomes an important research topic. COMMIX, which is developed in the DB g...
Tengjiao Wang, Shiwei Tang, Dongqing Yang, Jun Gao...
WWW
2005
ACM
14 years 9 months ago
Fully automatic wrapper generation for search engines
When a query is submitted to a search engine, the search engine returns a dynamically generated result page containing the result records, each of which usually consists of a link...
Hongkun Zhao, Weiyi Meng, Zonghuan Wu, Vijay Ragha...