Search Sciweavers | Sciweavers

203

NLDB
2004
Springer

145views Natural Language Processing» more NLDB 2004»

A Flexible Workbench for Document Analysis and Text Mining

16 years 12 days ago

Abstract: Document analysis and text mining techniques are used to preprocess documents in information retrieval systems, to extract concepts in ontology construction processes, an...

Jon Atle Gulla, Terje Brasethvik, Harald Kaada

claim paper

Read More »

203

click to vote

ICDAR
2011
IEEE

235views Document Analysis» more ICDAR 2011»

Localization of Digit Strings in Farsi/Arabic Document Images Using Structural Features and Syntactical Analysis

14 years 6 months ago

Download www.icdar2011.org

—This paper presents a new method for localization of digit strings with a specific syntax in Farsi/ Arabic document images. First, some features are extracted from all connected...

Ali Abedi, Karim Faez

claim paper

Read More »

201

click to vote

ICDAR
2009
IEEE

161views Document Analysis» more ICDAR 2009»

Learning Rich Hidden Markov Models in Document Analysis: Table Location

16 years 1 months ago

Download homepages.inf.ed.ac.uk

Hidden Markov Models (HMM) are probabilistic graphical models for interdependent classification. In this paper we experiment with different ways of combining the components of an ...

Ana Costa e Silva

claim paper

Read More »

172

click to vote

CICLING
2001
Springer

140views Natural Language Processing» more CICLING 2001»

Automatic Keyword Extraction Using Domain Knowledge

15 years 11 months ago

Download people.dsv.su.se

Documents can be assigned keywords by frequency analysis of the terms found in the document text, which arguably is the primary source of knowledge about the document itself. By in...

Anette Hulth, Jussi Karlgren, Anna Jonsson, Henrik...

claim paper

Read More »

196

click to vote

ICDAR
2003
IEEE

282views Document Analysis» more ICDAR 2003»

Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis

16 years 10 days ago

Download research.microsoft.com

Neural networks are a powerful technology for classification of visual inputs arising from documents. However, there is a confusing plethora of different neural network methods th...

Patrice Simard, David Steinkraus, John C. Platt

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers