Using ILP to Construct Features for Information Extraction from Semi-structured Text

16 years 28 days ago

Download www2.chi.unsw.edu.au

Machine-generated documents containing semi-structured text are rapidly forming the bulk of data being stored in an organisation. Given a feature-based representation of such data, methods like SVMs are able to construct good models for information extraction (IE). But how are the feature-deﬁnitions to be obtained in the ﬁrst place? (We are referring here to the representation problem: selecting good features from the ones deﬁned comes later.) So far, features have been deﬁned manually or by using special-purpose programs: neither approach scaling well to handle the heterogeneity of the data or new domain-speciﬁc information. We suggest that Inductive Logic Programming (ILP) could assist in this. Speciﬁcally, we demonstrate the use of ILP to deﬁne features for seven IE tasks using two disparate sources of information. Our ﬁndings are as follows: (1) the ILP system is able to identify eﬃciently large numbers of good features. Typically, the time taken to identify the f...

Ganesh Ramakrishnan, Sachindra Joshi, Sreeram Bala

Real-time Traffic

Artificial Intelligence | Feature-based Representation | IE Tasks | ILP 2007 | Information Extraction |

claim paper

» Constructing Reference Sets from Unstructured Ungrammatical Text

» Open Information Extraction Using Wikipedia

» Coupling information retrieval and information extraction A new text technology for gather...

» Learning Ensembles of FirstOrder Clauses for RecallPrecision Curves A Case Study in Biomed...

» Video text recognition using feature compensation as categorydependent feature extraction

» Multiview Bootstrapping for Relation Extraction by Exploring Web Features and Linguistic F...

» Using Text Mining to Infer Semantic Attributes for Retail Data Mining

» Coreex content extraction from online news articles

Post Info
More Details (n/a)

Added	08 Jun 2010
Updated	08 Jun 2010
Type	Conference
Year	2007
Where	ILP
Authors	Ganesh Ramakrishnan, Sachindra Joshi, Sreeram Balakrishnan, Ashwin Srinivasan

Comments (0)

Sciweavers

Using ILP to Construct Features for Information Extraction from Semi-structured Text

Artificial Intelligence | Feature-based Representation | IE Tasks | ILP 2007 | Information Extraction |

Explore & Download

Productivity Tools

Sciweavers