Sciweavers

1362 search results - page 29 / 273
» A Statistical Learning Approach To Document Image Analysis
Sort
View
PAA
2006
13 years 7 months ago
Automatic name extraction from degraded document images
The problem addressed in this paper is the automatic extraction of names from a document image. Our approach relies on the combination of two complementary analyses. First, the ima...
Laurence Likforman-Sulem, Pascal Vaillant, Aliette...
ICDAR
2003
IEEE
14 years 21 days ago
Extraction, layout analysis and classification of diagrams in PDF documents
Diagrams are a critical part of virtually all scientific and technical documents. Analyzing diagrams will be important for building comprehensive document retrieval systems. This ...
Robert P. Futrelle, Mingyan Shao, Chris Cieslik, A...
SIGIR
2009
ACM
14 years 1 months ago
Identifying the original contribution of a document via language modeling
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
Benyah Shaparenko, Thorsten Joachims
CVPR
2009
IEEE
15 years 2 months ago
Contextual Restoration of Severely Degraded Document Images
We propose an approach to restore severely degraded document images using a probabilistic context model. Un- like traditional approaches that use previously learned prior models...
Jyotirmoy Banerjee, Anoop M. Namboodiri, C. V. Jaw...
ECAI
2006
Springer
13 years 11 months ago
SUMMaR: Combining Linguistics and Statistics for Text Summarization
Abstract. We describe a text summarization system that moves beyond standard approaches by using a hybrid approach of linguistic and statistical analysis and by employing text-sort...
Manfred Stede, Heike Bieler, Stefanie Dipper, Arth...