Sciweavers

CLEF
2003
Springer

Pruning Texts with NLP and Expanding Queries with an Ontology: TagSearch

14 years 4 months ago
Pruning Texts with NLP and Expanding Queries with an Ontology: TagSearch
: The basic line of our action is first to use natural language processing to prune the texts and the query, and secondly to use an ontology to expand the queries. Last year The system described here is based on the one used last year for CLEF-2002. But most components have been improved and some new steps have been added. The system whose name is TagSearch is based on three main components: - A chunker, named TagChunker1 . - Lucene, a good OpenSource search engine written by Doug Cuting and his friends2 . - An ontology, named TagDico3 . The first two components were used last year. The use of an ontology is new. Objectives Our main objectives was to find the right documents in deducing implicit information and in avoiding noise. The task is divided in two steps: index the texts and search in the index. Main ideas for indexing The idea is that instead of indexing characters strings, the texts are parsed and only the results of the parsing are indexed4 . So we are able to prune the wron...
Gil Francopoulo
Added 06 Jul 2010
Updated 06 Jul 2010
Type Conference
Year 2003
Where CLEF
Authors Gil Francopoulo
Comments (0)