Sciweavers

54 search results - page 6 / 11
» Morphological Tagging Approach in Document Analysis of Invoi...
Sort
View
WWW
2010
ACM
14 years 2 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
COLING
2008
13 years 9 months ago
A Grammar Checking System for Punjabi
This article provides description about the grammar checking system developed for detecting various grammatical errors in Punjabi texts. This system utilizes a fullform lexicon fo...
Mandeep Singh Gill, Gurpreet Singh Lehal
DRR
2010
13 years 10 months ago
General text line extraction approach based on locally orientation estimation
This paper presents a novel approach for the multi-oriented text line extraction from historical handwritten Arabic documents. Because of the multi-orientation of lines and their ...
Nazih Ouwayed, Abdel Belaïd, François ...
ISIWI
2000
13 years 9 months ago
Automatic Document Classification - A thorough Evaluation of various Methods
(Automatic) document classification is generally defined as content-based assignment of one or more predefined categories to documents. Usually, machine learning, statistical patt...
Christoph Goller, J. Löning, T. Will, W. Wolf...
IJDAR
2008
136views more  IJDAR 2008»
13 years 7 months ago
Matching word images for content-based retrieval from printed document images
As large quantity of document images is getting archived by the digital libraries, there is a need for an efficient search strategies to make them available as per users informatio...
Million Meshesha, C. V. Jawahar