Sciweavers

345 search results - page 34 / 69
» On a new model for automatic text categorization based on Ve...
Sort
View
EVOW
2008
Springer
13 years 10 months ago
Evolving an Automatic Defect Classification Tool
Automatic Defect Classification (ADC) is a well-developed technology for inspection and measurement of defects on patterned wafers in the semiconductors industry. The poor training...
Assaf Glazer, Moshe Sipper
ICDM
2008
IEEE
136views Data Mining» more  ICDM 2008»
14 years 2 months ago
Document-Word Co-regularization for Semi-supervised Sentiment Analysis
The goal of sentiment prediction is to automatically identify whether a given piece of text expresses positive or negative opinion towards a topic of interest. One can pose sentim...
Vikas Sindhwani, Prem Melville
GFKL
2005
Springer
142views Data Mining» more  GFKL 2005»
14 years 2 months ago
Near Similarity Search and Plagiarism Analysis
Abstract. Existing methods to text plagiarism analysis mainly base on “chunking”, a process of grouping a text into meaningful units each of which gets encoded by an integer nu...
Benno Stein, Sven Meyer zu Eissen
HICSS
2006
IEEE
163views Biometrics» more  HICSS 2006»
14 years 2 months ago
Learning Ranking vs. Modeling Relevance
The classical (ad hoc) document retrieval problem has been traditionally approached through ranking according to heuristically developed functions (such as tf.idf or bm25) or gene...
Dmitri Roussinov, Weiguo Fan
TKDE
2008
175views more  TKDE 2008»
13 years 8 months ago
Efficient Phrase-Based Document Similarity for Clustering
Phrase has been considered as a more informative feature term for improving the effectiveness of document clustering. In this paper, we propose a phrase-based document similarity t...
Hung Chim, Xiaotie Deng