Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

186

JUCS
2008

123views more JUCS 2008»

Exploring Information Extraction Resilience

15 years 6 months ago

Exploring Information Extraction Resilience

Download www.jucs.org

: There are many challenges developers face when attempting to reliably extract data from the Web. One of these challenges is the resilience of the extraction system to changes in the web pages information is being extracted from. This article compares the resilience of information extraction systems that use position based extraction with an ontology based extraction system and a system that combines position based extraction with ontology based extraction. The findings demonstrate the advantages of using a system that combines multiple extraction techniques, especially in environments where web sites change frequently and where data collection is conducted over an extended period of time. Key Words: Information extraction, semi-structured data, ontologies Category: H3.3, H3.4, H5.4

Dawn G. Gregg

Real-time Traffic

Information Extraction | Information Extraction Systems | JUCS 2008 | Ontology Based Extraction |

claim paper

Related Content

» ROXXI Reviving witness dOcuments to eXplore eXtracted Information

» An Exploration of the KolmogorovSmirnov Test as Competitor to Mutual Information Analysis

» Japanese Information Extraction with Automatically Extracted Patterns

» Extracting and Exploring the GeoTemporal Semantics of Textual Resources

» Extraction of social context and application to personal multimedia exploration

» LeakageResilient Cryptography

» Multiview Bootstrapping for Relation Extraction by Exploring Web Features and Linguistic F...

» An Intelligent Multilingual Information Browsing and Retrieval System Using Information Ex...

» SEXTANT Exploring Unexplored Contexts for Semantic Extraction from Syntactic Analysis

Post Info
More Details (n/a)

Added	13 Dec 2010
Updated	13 Dec 2010
Type	Journal
Year	2008
Where	JUCS
Authors	Dawn G. Gregg

Comments (0)