Sciweavers

10500 search results - page 92 / 2100
» Documentation for
Sort
View
ISICT
2003
13 years 9 months ago
Digital document life cycle development
Project MEMORIAL [3] is aimed at developing a new technology for creating Web based information systems using interactive electronic documents extracted from their paper originals...
Henryk Krawczyk, Bogdan Wiszniewski
ICML
2008
IEEE
14 years 9 months ago
Semi-supervised learning of compact document representations with deep networks
Finding good representations of text documents is crucial in information retrieval and classification systems. Today the most popular document representation is based on a vector ...
Marc'Aurelio Ranzato, Martin Szummer
PAKDD
2009
ACM
127views Data Mining» more  PAKDD 2009»
14 years 3 months ago
Clustering Documents Using a Wikipedia-Based Concept Representation
Abstract. This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document clustering. We first create a concept-based document representation b...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
ICDAR
2009
IEEE
14 years 3 months ago
Segmentation-free Word Spotting in Historical Printed Documents
In this paper, a new efficient word spotting methodology is presented that can be applied to historical printed documents without requiring any previous block or word segmentation...
Basilios Gatos, Ioannis Pratikakis
ICEIS
2009
IEEE
14 years 2 months ago
A Natural and Multi-layered Approach to Detect Changes in Tree-Based Textual Documents
Several efficient and very powerful algorithms exist for detecting changes in tree-based textual documents, such as those encoded in XML. An important aspect is still underestimat...
Angelo Di Iorio, Michele Schirinzi, Fabio Vitali, ...