An Integrated Term-Based Corpus Query System

15 years 8 months ago

Download personalpages.manchester.ac.uk

In this paper we describe the X-TRACT workbench, which enables efficient termbased querying against a domain-specific literature corpus. Its main aim is to aid domain specialists in locating and extracting new knowledge from scientific literature corpora. Before querying, a corpus is automatically terminologically analysed by the ATRACT system, which performs terminology recognition based on the C/NCvalue method enhanced by incorporation of term variation handling. The results of terminology processing are annotated in XML, and the produced XML documents are stored in an XML-native database. All corpus retrieval operations are performed against this database using an XML query language. We illustrate the way in which the X-TRACT workbench can be utilised for knowledge discovery, literature mining and conceptual information extraction.

Kostas Manios, Goran Nenadic, Irena Spasic, Sophia

Real-time Traffic

Domain-specific Literature Corpus | EACL 2003 | Natural Language Processing | Scientific Literature Corpora | X-TRACT Workbench |

claim paper

» The Personal Reader Personalizing and Enriching Learning Resources Using Semantic Web Tech...

» Optimizing scoring functions and indexes for proximity search in typeannotated corpora

» A Simple Bayesian Framework for ContentBased Image Retrieval

» Integrating Semantic Knowledge into Text Similarity and Information Retrieval

» Integrated Term Weighting Visualization and User Interface Development for Bioinformation ...

» WikiAnalytics Adhoc Querying of Highly Heterogeneous Structured Data

» ContentBased Queries on the CasImage Database Within the IRMA Framework

» Iterative translation disambiguation for crosslanguage information retrieval

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	EACL
Authors	Kostas Manios, Goran Nenadic, Irena Spasic, Sophia Ananiadou

Comments (0)

Sciweavers

An Integrated Term-Based Corpus Query System

Domain-specific Literature Corpus | EACL 2003 | Natural Language Processing | Scientific Literature Corpora | X-TRACT Workbench |

Explore & Download

Productivity Tools

Sciweavers