Sciweavers

DEBU
2006

Avatar Information Extraction System

13 years 11 months ago
Avatar Information Extraction System
The AVATAR Information Extraction System (IES) at the IBM Almaden Research Center enables highprecision, rule-based, information extraction from text-documents. Drawing from our experience we propose the use of probabilistic database techniques as the formal underpinnings of information extraction systems so as to maintain high precision while increasing recall. This involves building a framework where rule-based annotators can be mapped to queries in a database system. We use examples from AVATAR IES to describe the challenges in achieving this goal. Finally, we show that deriving precision estimates in such a database system presents a significant challenge for probabilistic database systems.
T. S. Jayram, Rajasekar Krishnamurthy, Sriram Ragh
Added 11 Dec 2010
Updated 11 Dec 2010
Type Journal
Year 2006
Where DEBU
Authors T. S. Jayram, Rajasekar Krishnamurthy, Sriram Raghavan, Shivakumar Vaithyanathan, Huaiyu Zhu
Comments (0)