Sciweavers

808 search results - page 76 / 162
» Keyword-based document clustering
Sort
View
ICPR
2008
IEEE
14 years 2 months ago
Stop word detection in compressed textual images: An experiment on indic script documents
Stop word detection is attempted in this work in the context of retrieval of document images in the compressed domain. Algorithms are presented to identify text lines and words an...
Utpal Garain, Amit Kumar Das
JSAI
2001
Springer
14 years 9 days ago
A Document as a Small World
The small world topology is known widespread in biological, social and man-made systems. This paper shows that the small world structure also exists in documents, such as papers. A...
Yutaka Matsuo, Yukio Ohsawa, Mitsuru Ishizuka
ICASSP
2011
IEEE
12 years 11 months ago
Using latent topic features to improve binary classification of spoken documents
In many topic identification applications, supervised training labels are indirectly related to the semantic content of the documents being classified. For example, many topical...
Jonathan Wintrode
ICDAR
2003
IEEE
14 years 1 months ago
Unsupervised Feature Selection Using Multi-Objective Genetic Algorithms for Handwritten Word Recognition
In this paper a methodology for feature selection in unsupervised learning is proposed. It makes use of a multiobjective genetic algorithm where the minimization of the number of ...
Marisa E. Morita, Robert Sabourin, Flávio B...
CIS
2004
Springer
14 years 1 months ago
A Method of Acquiring Ontology Information from Web Documents
Abstract. Ontology plays an important role on the Semantic Web. In this paper, we propose a method, AOIWD, of acquiring ontology information from Web documents. The AOIWD method em...
Lixin Han, Guihai Chen, Li Xie