Sciweavers

2827 search results - page 98 / 566
» Marking Text Documents
Sort
View
IJCNN
2007
IEEE
14 years 2 months ago
Text Representations for Text Categorization: A Case Study in Biomedical Domain
— In vector space model (VSM), textual documents are represented as vectors in the term space. Therefore, there are two issues in this representation, i.e. (1) what should a term...
Man Lan, Chew Lim Tan, Jian Su, Hwee-Boon Low
CIKM
2000
Springer
14 years 14 days ago
Scalable association-based text classification
Naïve Bayes (NB) classifier has long been considered a core methodology in text classification mainly due to its simplicity and computational efficiency. There is an increasing n...
Dimitris Meretakis, Dimitris Fragoudis, Hongjun Lu...
SIGIR
2008
ACM
13 years 8 months ago
Local text reuse detection
Text reuse occurs in many different types of documents and for many different reasons. One form of reuse, duplicate or near-duplicate documents, has been a focus of researchers be...
Jangwon Seo, W. Bruce Croft
SIGCSE
2006
ACM
163views Education» more  SIGCSE 2006»
14 years 2 months ago
TextMOLE: text mining operations library and environment
The paper describes the first version of the TextMOLE (Text Mining Operations Library and Environment) system for textual data mining. Currently TextMOLE acts as an advanced inde...
Daniel B. Waegel, April Kontostathis
CVPR
2009
IEEE
13 years 11 months ago
Robust unsupervised segmentation of degraded document images with topic models
Segmentation of document images remains a challenging vision problem. Although document images have a structured layout, capturing enough of it for segmentation can be difficult....
Timothy J. Burns, Jason J. Corso