Sciweavers

21 search results - page 3 / 5
» Unsupervised information extraction from unstructured, ungra...
Sort
View
WWW
2010
ACM
14 years 1 months ago
Entity relation discovery from web tables and links
The World-Wide Web consists not only of a huge number of unstructured texts, but also a vast amount of valuable structured data. Web tables [2] are a typical type of structured in...
Cindy Xide Lin, Bo Zhao, Tim Weninger, Jiawei Han,...
IDEAL
2010
Springer
13 years 4 months ago
An Efficient Approach to Clustering Real-Estate Listings
World Wide Web (WWW) is a vast source of information, the problem of information overload is more acute than ever. Due to noise in WWW, it is becoming hard to find usable informati...
Maciej Grzenda, Deepak Thukral
WEBDB
2005
Springer
100views Database» more  WEBDB 2005»
14 years 1 days ago
Malleable Schemas: A Preliminary Report
Large-scale information integration, and in particular, search on the World Wide Web, is pushing the limits on the combination of structured data and unstructured data. By its ver...
Xin Dong, Alon Y. Halevy
WSE
2002
IEEE
13 years 11 months ago
Dynamic Model Extraction and Statistical Analysis of Web Applications
The World Wide Web, initially intended as a way to publish static hypertexts on the Internet, is moving toward complex applications. Static Web sites are being gradually replaced ...
Paolo Tonella, Filippo Ricca
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
13 years 10 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant