Sciweavers

240 search results - page 16 / 48
» Modelling the Retrieval of Structured Documents Containing T...
Sort
View
ICDAR
2009
IEEE
15 years 10 months ago
The GERMANA Database
A new handwritten text database, GERMANA, is presented to facilitate empirical comparison of different approaches to text line extraction and off-line handwriting recognition. G...
Daniel Pérez, Lionel Tarazón, Nicol&...
CIKM
2009
Springer
15 years 10 months ago
The impact of document structure on keyphrase extraction
Keyphrases are short phrases that reflect the main topic of a document. Because manually annotating documents with keyphrases is a time-consuming process, several automatic appro...
Katja Hofmann, Manos Tsagkias, Edgar Meij, Maarten...
160
Voted
AWIC
2003
Springer
15 years 7 months ago
A Natural Language Interface for Information Retrieval on Semantic Web Documents
Abstract. We present a dialogue system that enables the access in natural language to a web information retrieval system. We use a Web Semantic Language to model the knowledge conv...
Paulo Quaresma, Irene Pimenta Rodrigues
137
Voted
ICPR
2008
IEEE
16 years 4 months ago
A discriminative semi-Markov model for robust scene text recognition
We present a semi-Markov model for recognizing scene text that integrates character and word segmentation with recognition. Using wavelet features, it requires only approximate lo...
Allen R. Hanson, Erik G. Learned-Miller, Jerod J. ...
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
16 years 3 months ago
Structured entity identification and document categorization: two tasks with one joint model
Traditionally, research in identifying structured entities in documents has proceeded independently of document categorization research. In this paper, we observe that these two t...
Indrajit Bhattacharya, Shantanu Godbole, Sachindra...