Sciweavers

211 search results - page 20 / 43
» Effective Web data extraction with standard XML technologies
Sort
View
WWW
2007
ACM
14 years 8 months ago
Extraction and classification of dense communities in the web
The World Wide Web (WWW) is rapidly becoming important for society as a medium for sharing data, information and services, and there is a growing interest in tools for understandi...
Yon Dourisboure, Filippo Geraci, Marco Pellegrini
VLDB
2002
ACM
91views Database» more  VLDB 2002»
13 years 7 months ago
DTD-Directed Publishing with Attribute Translation Grammars
We present a framework for publishing relational data in XML with respect to a fixed DTD. In data exchange on the Web, XML views of relational data are typically required to confo...
Michael Benedikt, Chee Yong Chan, Wenfei Fan, Raje...
WWW
2010
ACM
14 years 2 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
ICWS
2009
IEEE
14 years 4 months ago
An Approach to Composing Web Services with Context Heterogeneity
The potential benefits of Web services composition heavily rely on semantic interoperability, i.e., the ability to exchange data meaningfully amongst Web services. Context heterog...
Xitong Li, Stuart E. Madnick, Hongwei Zhu 0002, Yu...
WWW
2007
ACM
14 years 8 months ago
Mapping-driven XML transformation
Clio is an existing schema-mapping tool that provides user-friendly means to manage and facilitate the complex task of transformation and integration of heterogeneous data such as...
Haifeng Jiang, Howard Ho, Lucian Popa, Wook-Shin H...