Sciweavers

18 search results - page 3 / 4
» Improving kNN Text Categorization by Removing Outliers from ...
Sort
View
CLEF
2010
Springer
13 years 8 months ago
ZOT! to Wikipedia Vandalism - Lab Report for PAN at CLEF 2010
Abstract This vandalism detector uses features primarily derived from a wordpreserving differencing of the text for each Wikipedia article from before and after the edit, along wit...
James White, Rebecca Maessen
TKDE
2008
111views more  TKDE 2008»
13 years 7 months ago
Text Clustering with Feature Selection by Using Statistical Data
Abstract-- Feature selection is an important method for improving the efficiency and accuracy of text categorization algorithms by removing redundant and irrelevant terms from the ...
Yanjun Li, Congnan Luo, Soon M. Chung
IJCAI
2003
13 years 8 months ago
Coherent Keyphrase Extraction via Web Mining
Keyphrases are useful for a variety of purposes, including summarizing, indexing, labeling, categorizing, clustering, highlighting, browsing, and searching. The task of automatic ...
Peter D. Turney
CVPR
2011
IEEE
13 years 3 months ago
Enforcing Similarity Constraints with Integer Programming for Better Scene Text Recognition
The recognition of text in everyday scenes is made difficult by viewing conditions, unusual fonts, and lack of linguistic context. Most methods integrate a priori appearance info...
David Smith, Jacqueline Feild, Eric Learned-Miller
ANLP
2000
137views more  ANLP 2000»
13 years 8 months ago
Using Corpus-derived Name Lists for Named Entity Recognition
This paper describes experiments to establish the performance of a named entity recognition system which builds categorized lists of names from manually annotated training data. N...
Mark Stevenson, Robert J. Gaizauskas