Sciweavers

222 search results - page 14 / 45
» Ancient document analysis based on text line extraction
Sort
View
SIGIR
2003
ACM
14 years 1 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann
COLING
2010
13 years 2 months ago
Text Summarization of Turkish Texts using Latent Semantic Analysis
Text summarization solves the problem of extracting important information from huge amount of text data. There are various methods in the literature that aim to find out well-form...
Makbule Ozsoy, Ilyas Cicekli, Ferda Nur Alpaslan
DAS
2010
Springer
13 years 5 months ago
Information extraction by finding repeated structure
Repetition of layout structure is prevalent in document images. In document design, such repetition conveys the underlying logical and functional structure of the data. For exampl...
Evgeniy Bart, Prateek Sarkar
ICDAR
2003
IEEE
14 years 1 months ago
Document page similarity based on layout visual saliency: Application to query by example and document classification
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Véronique Eglin, Stéphane Bres
ICDAR
2009
IEEE
14 years 2 months ago
A Gradient Difference Based Technique for Video Text Detection
Text detection in video images has received increasing attention, particularly in scene text detection in video images, as it plays a vital role in video indexing and information ...
Palaiahnakote Shivakumara, Trung Quy Phan, Chew Li...