Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

227

DMKD
2000
ACM

110views Data Mining» more DMKD 2000»

Combining Strategies for Extracting Relations from Text Collections

16 years 1 days ago

Combining Strategies for Extracting Relations from Text Collections

Download www.cs.columbia.edu

Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use for answering precise queries or for running data mining tasks. Our Snowball system extracts these relations from document collections starting with only a handful of user-provided example tuples. Based on these tuples, Snowball generates patterns that are used, in turn, to ﬁnd more tuples. In this paper we introduce a new pattern and tuple generation scheme for Snowball, with different strengths and weaknesses than those of our original system. We also show preliminary results on how we can combine the two versions of Snowball to extract tuples more accurately.

Eugene Agichtein, Eleazar Eskin, Luis Gravano

Real-time Traffic

Data Mining | DMKD 2000 | Snowball | Snowball Generates Patterns | Valuable Structured Data |

claim paper

Related Content

» Snowball extracting relations from large plaintext collections

» RelExt A Tool for Relation Extraction from Text in Ontology Extension

» Combining Statistical Techniques and Lexicosyntactic Patterns for Semantic Relations Extra...

» Combining relations for information extraction from free text

» Predicting accuracy of extracting information from unstructured text collections

» Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web

» Automatic Collection of Related Terms from the Web

» Mining relational data from text From strictly supervised to weakly supervised learning

» Automatic Acquisition of Script Knowledge from a Text Collection

Post Info
More Details (n/a)

Added	01 Aug 2010
Updated	01 Aug 2010
Type	Conference
Year	2000
Where	DMKD
Authors	Eugene Agichtein, Eleazar Eskin, Luis Gravano

Comments (0)