Sciweavers

240 search results - page 9 / 48
» Temporally-aware algorithms for document classification
Sort
View
ICDAR
2003
IEEE
14 years 20 days ago
Classification of Web Documents Using a Graph Model
In this paper we describe work relating to classification of web documents using a graph-based model instead of the traditional vector-based model for document representation. We ...
Adam Schenker, Mark Last, Horst Bunke, Abraham Kan...
SIGIR
2002
ACM
13 years 7 months ago
Automatic classification in product catalogs
In this paper, we present the AutoCat system for product classification. AutoCat uses a vector space model, modified to consider product attributes unavailable in traditional docu...
Ben Wolin
SIGIR
2002
ACM
13 years 7 months ago
Unsupervised document classification using sequential information maximization
We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...
Noam Slonim, Nir Friedman, Naftali Tishby
DOCENG
2010
ACM
13 years 8 months ago
Glyph extraction from historic document images
This paper is about the reproduction of ancient texts with vectorised fonts. While for OCR only recognition rates count, a reproduction process does not necessarily require the re...
Lothar Meyer-Lerbs, Arne Schuldt, Björn Gottf...
ECIR
2006
Springer
13 years 8 months ago
Phrase Clustering Without Document Context
Abstract. We applied different clustering algorithms to the task of clustering multi-word terms in order to reflect a humanly built ontology. Clustering was done without the usual ...
Eric SanJuan, Fidelia Ibekwe-Sanjuan