Sciweavers

849 search results - page 56 / 170
» Modeling Content Identification from Document Images
Sort
View
SAC
2008
ACM
13 years 8 months ago
Author identification using writer-dependent and writer-independent strategies
In this work we discuss author identification for documents written in Portuguese. Two different approaches were compared. The first is the writer-independent model which reduces ...
Daniel Pavelec, Edson J. R. Justino, Leonardo Vida...
LREC
2010
192views Education» more  LREC 2010»
13 years 10 months ago
Automatic Identification of Arabic Dialects
In this work, automatic recognition of Arabic dialects is proposed. An acoustic survey of the proportion of vocalic intervals and the standard deviation of consonantal intervals i...
Mohamed Belgacem, Georges Antoniadis, Laurent Besa...
ICDAR
2011
IEEE
12 years 8 months ago
Localization of Digit Strings in Farsi/Arabic Document Images Using Structural Features and Syntactical Analysis
—This paper presents a new method for localization of digit strings with a specific syntax in Farsi/ Arabic document images. First, some features are extracted from all connected...
Ali Abedi, Karim Faez
ICML
2010
IEEE
13 years 10 months ago
A Language-based Approach to Measuring Scholarly Impact
Identifying the most influential documents in a corpus is an important problem in many fields, from information science and historiography to text summarization and news aggregati...
Sean Gerrish, David M. Blei
CIKM
2010
Springer
13 years 7 months ago
Decomposing background topics from keywords by principal component pursuit
Low-dimensional topic models have been proven very useful for modeling a large corpus of documents that share a relatively small number of topics. Dimensionality reduction tools s...
Kerui Min, Zhengdong Zhang, John Wright, Yi Ma