Sciweavers

315 search results - page 31 / 63
» Text classification from positive and unlabeled documents
Sort
View
ANLP
1994
105views more  ANLP 1994»
13 years 10 months ago
Modeling Content Identification from Document Images
A new technique to locate content-representing words for a given document image using representation of character shapes is described. A character shape code representation define...
Takehiro Nakayama
PAMI
2007
185views more  PAMI 2007»
13 years 8 months ago
Restoring 2D Content from Distorted Documents
—This paper presents a framework to restore the 2D content printed on documents in the presence of geometric distortion and nonuniform illumination. Compared with text-based docu...
Michael S. Brown, Mingxuan Sun, Ruigang Yang, Lin ...
ICDAR
2007
IEEE
14 years 15 days ago
Iterated Document Content Classification
We report an improved methodology for training classifiers for document image content extraction, that is, the location and segmentation of regions containing handwriting, machine...
Chang An, Henry S. Baird, Pingping Xiu
ACL
1998
13 years 10 months ago
Information Classification and Navigation Based on 5W1H of the Target Information
This paper proposes a method by which 5WlH (who, when, where, what, why, how, and predicate) information is used to classify and navigate Japaneselanguage texts. 5WlH information,...
Takahiro Ikeda, Akitoshi Okumura, Kazunori Muraki
ESWS
2006
Springer
14 years 9 days ago
Automatic Extraction of Hierarchical Relations from Text
Abstract. Automatic extraction of semantic relationships between entity instances in an ontology is useful for attaching richer semantic metadata to documents. In this paper we pro...
Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cu...