Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

206

FLAIRS
2004

175views Artificial Intelligence» more FLAIRS 2004»

Automatic Generation of Background Text to Aid Classification

15 years 9 months ago

Automatic Generation of Background Text to Aid Classification

Download www.cs.csi.cuny.edu

We illustrate that Web searches can often be utilized to generate background text for use with text classification. This is the case because there are frequently many pages on the World Wide Web that are relevant to particular text classification tasks. We show that an automatic method of creation of a secondary corpus of unlabeled but related documents can help decrease error rates in text categorization problems. Furthermore, if the test corpus is known, this related set of information can be tailored to match the particular categorization problem in a transductive approach. Our system uses WHIRL, a tool that combines database functionalities with techniques from the information retrieval literature. When there is a limited number of training examples, or the process of obtaining training examples is expensive or difficult, this method can be especially useful.

Sarah Zelikovitz, Robert Hafner

Real-time Traffic

Artificial Intelligence | Categorization Problem | FLAIRS 2004 | Particular Text Classification | Text Classification |

claim paper

Related Content

» KID an algorithm for fast and efficient text mining used to automatically generate a data...

» Integrating Background Knowledge Into Text Classification

» Text Classification using the Concept of Association Rule of Data Mining

» A Vector Space Model for Subjectivity Classification in Urdu aided by CoTraining

» AutoFACT An Automatic Functional Annotation and Classification Tool

» Automatic Classification of Text Databases Through Query Probing

» Generating Concept Hierarchies from Text for Intelligence Analysis

» Automatically Generating Annotator Rationales to Improve Sentiment Classification

» Automatic Generation of Classification Theorems for Finite Algebras

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	FLAIRS
Authors	Sarah Zelikovitz, Robert Hafner

Comments (0)