Sciweavers

GECCO
2007
Springer

Evolving Lucene search queries for text classification

14 years 6 months ago
Evolving Lucene search queries for text classification
We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to construct Lucene search queries. Genetic programs acquire fitness by producing queries that are effective binary classifiers for a particular category when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from classification tasks. Categories and Subject Descriptors D.3.3 [Programming Languages]: General Terms Algorithms. Keywords text classification, Genetic Programming, Apache Lucene.
Laurence Hirsch, Robin Hirsch, Masoud Saeedi
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where GECCO
Authors Laurence Hirsch, Robin Hirsch, Masoud Saeedi
Comments (0)