Sciweavers

257 search results - page 13 / 52
» Text extraction from graphical document images using sparse ...
Sort
View
CBMS
2007
IEEE
14 years 2 months ago
Auto-Extraction, Representation and Integration of a Diabetes Ontology Using Bayesian Networks
This paper describes how high level biological knowledge obtained from ontologies such as the Gene Ontology (GO) can be integrated with low level information extracted from a Baye...
Kenneth McGarry, Sheila Garfield, Stefan Wermter
CVPR
2009
IEEE
13 years 11 months ago
Robust unsupervised segmentation of degraded document images with topic models
Segmentation of document images remains a challenging vision problem. Although document images have a structured layout, capturing enough of it for segmentation can be difficult....
Timothy J. Burns, Jason J. Corso
SIGIR
2003
ACM
14 years 28 days ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann
CASCON
2006
150views Education» more  CASCON 2006»
13 years 9 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement ea...
Nawei Chen, Hagit Shatkay, Dorothea Blostein
TCSV
2002
292views more  TCSV 2002»
13 years 7 months ago
Document image segmentation using wavelet scale-space features
In this paper, an efficient and computationally fast method for segmenting text and graphics part of document images based on textural cues is presented. We assume that the graphic...
Mausumi Acharyya, Malay K. Kundu