Sciweavers

684 search results - page 18 / 137
» Extracting semantic structure of web documents using content...
Sort
View
MM
2004
ACM
195views Multimedia» more  MM 2004»
14 years 2 months ago
Hierarchical clustering of WWW image search results using visual, textual and link information
We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics. Organizing the resu...
Deng Cai, Xiaofei He, Zhiwei Li, Wei-Ying Ma, Ji-R...
VLDB
2001
ACM
83views Database» more  VLDB 2001»
14 years 1 months ago
Visual Web Information Extraction with Lixto
We present new techniques for supervised wrapper generation and automated web information extraction, and a system called Lixto implementing these techniques. Our system can gener...
Robert Baumgartner, Sergio Flesca, Georg Gottlob
CN
1999
115views more  CN 1999»
13 years 8 months ago
XML-GL: A Graphical Language for Querying and Restructuring XML Documents
The growing acceptance of XML as a standard for semi-structured documents on the Web opens up challenging opportunities for Web query languages. In this paper we introduce XML-GL,...
Stefano Ceri, Sara Comai, Ernesto Damiani, Piero F...
ESWS
2007
Springer
14 years 2 months ago
What Have Innsbruck and Leipzig in Common? Extracting Semantics from Wiki Content
Wikis are established means for the collaborative authoring, versioning and publishing of textual articles. The Wikipedia project, for example, succeeded in creating the by far lar...
Sören Auer, Jens Lehmann
WWW
2009
ACM
14 years 1 months ago
Extracting data records from the web using tag path clustering
Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the first step of this object extraction process, identifies...
Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...