Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

175

Voted

CIKM
2009
Springer

115views Information Technology» more CIKM 2009»

Data extraction from the web using wild card queries

15 years 11 months ago

Data extraction from the web using wild card queries

Download webdocs.cs.ualberta.ca

This paper presents an overview of our framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task over a text collection is expressed as a query that combines text fragments with wild cards, and the query result is a set of facts in the form of unary, binary and general n-ary tuples. Despite being both simple and declarative, the framework can be applied to a wide range of extraction tasks. We report some of our work on expanding queries and ranking the the results. We also report some of our experiments and evaluations of the proposed querying framework. Categories and Subject Descriptors H.3.3 [Information Systems]: Information Search and Retrieval; H.5.2 [Information Systems]: User Interfaces General Terms Algorithms,Experimentation,Measurement Keywords DeWild, Data Extraction, Web Search, Ranking

Davood Rafiei, Haobin Li

Real-time Traffic

CIKM 2009 | Extraction Tasks | General N-ary Tuples | Language Text Sources |

claim paper

Related Content

» Data Extraction from Web Database Query Result Pages via Tagsets and Integer Sequences

» Domainindependent entity extraction from web search query logs

» From RESTful Services to RDF Connecting the Web and the Semantic Web

» On the Automatic Extraction of Data from the Hidden Web

» Extracting ObjectOriented Database Schemas from XML DTDs Using Inheritance

» Optimizing content freshness of relations extracted from the web using keyword search

» Automatic wrapper maintenance for semistructured web sources using results from previous q...

» Querying Web Data The WebQA Approach

» Creating Relational Data from Unstructured and Ungrammatical Data Sources

Post Info
More Details (n/a)

Added	24 Jul 2010
Updated	24 Jul 2010
Type	Conference
Year	2009
Where	CIKM
Authors	Davood Rafiei, Haobin Li

Comments (0)