Sciweavers

423 search results - page 68 / 85
» Text Classification by Labeling Words
Sort
View
CLEF
2010
Springer
13 years 8 months ago
ZOT! to Wikipedia Vandalism - Lab Report for PAN at CLEF 2010
Abstract This vandalism detector uses features primarily derived from a wordpreserving differencing of the text for each Wikipedia article from before and after the edit, along wit...
James White, Rebecca Maessen
ICASSP
2011
IEEE
12 years 11 months ago
A hierarchical generative model for Generic Audio Document Categorization
In this paper, we call the pattern classification problem that consists in assigning a category label to a long audio signal based on its semantic content as Generic Audio Documen...
Zhi Zeng, Shuwu Zhang
ICDAR
2009
IEEE
14 years 2 months ago
Classifying Foreground Pixels in Document Images
We present a system that classifies pixels in a document image according to marking type such as machine print, handwriting, and noise. A segmenter module first splits an input ...
Prateek Sarkar, Eric Saund, Jing Lin
COMAD
2009
13 years 8 months ago
Business Insight from Collection of Unstructured Formatted Documents with IBM Content Harvester
In this paper, we report the development and experiments of IBM Content Harvester (CH), a tool to analyze and recover templates and content from word processor created text docume...
Biplav Srivastava, Yuan-Chi Chang
GECCO
2005
Springer
139views Optimization» more  GECCO 2005»
14 years 1 months ago
Use of a genetic algorithm in brill's transformation-based part-of-speech tagger
The tagging problem in natural language processing is to find a way to label every word in a text as a particular part of speech, e.g., proper noun. An effective way of solving th...
Garnett Carl Wilson, Malcolm I. Heywood