Sciweavers

240 search results - page 16 / 48
» Modelling the Retrieval of Structured Documents Containing T...
Sort
View
ICDAR
2009
IEEE
14 years 2 months ago
The GERMANA Database
A new handwritten text database, GERMANA, is presented to facilitate empirical comparison of different approaches to text line extraction and off-line handwriting recognition. G...
Daniel Pérez, Lionel Tarazón, Nicol&...
CIKM
2009
Springer
14 years 2 months ago
The impact of document structure on keyphrase extraction
Keyphrases are short phrases that reflect the main topic of a document. Because manually annotating documents with keyphrases is a time-consuming process, several automatic appro...
Katja Hofmann, Manos Tsagkias, Edgar Meij, Maarten...
AWIC
2003
Springer
13 years 11 months ago
A Natural Language Interface for Information Retrieval on Semantic Web Documents
Abstract. We present a dialogue system that enables the access in natural language to a web information retrieval system. We use a Web Semantic Language to model the knowledge conv...
Paulo Quaresma, Irene Pimenta Rodrigues
ICPR
2008
IEEE
14 years 8 months ago
A discriminative semi-Markov model for robust scene text recognition
We present a semi-Markov model for recognizing scene text that integrates character and word segmentation with recognition. Using wavelet features, it requires only approximate lo...
Allen R. Hanson, Erik G. Learned-Miller, Jerod J. ...
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 8 months ago
Structured entity identification and document categorization: two tasks with one joint model
Traditionally, research in identifying structured entities in documents has proceeded independently of document categorization research. In this paper, we observe that these two t...
Indrajit Bhattacharya, Shantanu Godbole, Sachindra...