Search Sciweavers | Sciweavers

466 search results - page 18 / 94

» Scalable Feature Extraction from Noisy Documents

321

Voted

DAS
2006
Springer

266views Document Analysis» more DAS 2006»

Script Identification from Indian Documents

15 years 11 months ago

Download cvit.iiit.ac.in

Abstract. Automatic identification of a script in a given document image facilitates many important applications such as automatic archiving of multilingual documents, searching on...

Gopal Datt Joshi, Saurabh Garg, Jayanthi Sivaswamy

claim paper

Read More »

228

click to vote

TAL
2010
Springer

127views Natural Language Processing» more TAL 2010»

Summarization as Feature Selection for Document Categorization on Small Datasets

15 years 5 months ago

Download users.dsic.upv.es

Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...

Emmanuel Anguiano-Hernández, Luis Villase&n...

claim paper

Read More »

239

click to vote

WWW
2009
ACM

189views Internet Technology» more WWW 2009»

Extracting data records from the web using tag path clustering

16 years 4 days ago

Download www2009.org

Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the ﬁrst step of this object extraction process, identiﬁes...

Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...

claim paper

Read More »

191

click to vote

CIKM
2009
Springer

163views Information Technology» more CIKM 2009»

The impact of document structure on keyphrase extraction

16 years 2 months ago

Download ilps.science.uva.nl

Keyphrases are short phrases that reﬂect the main topic of a document. Because manually annotating documents with keyphrases is a time-consuming process, several automatic appro...

Katja Hofmann, Manos Tsagkias, Edgar Meij, Maarten...

claim paper

Read More »

199

click to vote

ICDAR
2003
IEEE

105views Document Analysis» more ICDAR 2003»

Extraction, layout analysis and classification of diagrams in PDF documents

16 years 23 days ago

Download www.ccs.neu.edu

Diagrams are a critical part of virtually all scientific and technical documents. Analyzing diagrams will be important for building comprehensive document retrieval systems. This ...

Robert P. Futrelle, Mingyan Shao, Chris Cieslik, A...

claim paper

Read More »

« Prev « First page 18 / 94 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers