Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

148

FLAIRS
2004

119views Artificial Intelligence» more FLAIRS 2004»

Towards a Universal Web Wrapper

15 years 8 months ago

Towards a Universal Web Wrapper

Download www.cl.cam.ac.uk

The wealth of information contained in the world-wide web has created much interest in systems for integrating information from multiple sites. We describe a universal wrapper machine that can learn to extract information from the web given only a set of general rules describing the data domain. It cleanly separates out site-independent and site-specific knowledge from execution implementation. Site-independent knowledge is expressed in user-supplied domain rules, while site-specific knowledge is expressed in automatically-generated context-free grammars that describe site structures. The two are combined by using the domain rules to semantically interpret the parse trees generated by the grammars. The resulting declarative wrapper specifications are easily understandable by humans and can be executed to perform information extraction. Once extracted, tuples can be queried by external agents using a high-level agent communication language.

Theodore W. Hong, Keith L. Clark

Real-time Traffic

Artificial Intelligence | Automatically-generated Context-free Grammars | Domain Rules | FLAIRS 2004 | Site-specific Knowledge |

claim paper

Related Content

» RoadRunner Towards Automatic Data Extraction from Large Web Sites

» COMMIX towards effective web information extraction integration and query answering

» Constructing XMLSpeaking Wrappers for WEB Applications Towards an Interoperating WEB

» Towards Knowledge Acquisition from SemiStructured Content

» Towards Sophisticated Wrapping of Webbased information Repositories

» Towards Evolving Web Sites into Grid Services Environment

» Pollock automatic generation of virtual web services from web sites

» Web image mining towards universal age estimator

» Students Acceptance of Web 20 Technologies in Higher Education Findings from a Survey in a...

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	FLAIRS
Authors	Theodore W. Hong, Keith L. Clark

Comments (0)