Search Sciweavers | Sciweavers

99 search results - page 9 / 20

» Generalized inverse document frequency

188

click to vote

ICDAR
2009
IEEE

168views Document Analysis» more ICDAR 2009»

Scalable Feature Extraction from Noisy Documents

16 years 1 months ago

Download www.cvc.uab.es

We cope with the metadata recognition in layoutoriented documents. We address the problem as a classiﬁcation task and propose a method for automatic extraction of relevant featu...

Loïc Lecerf, Boris Chidlovskii

claim paper

Read More »

187

click to vote

KCAP
2005
ACM

97views Information Technology» more KCAP 2005»

Extracting significant words from corpora for ontology extraction

16 years 18 days ago

Download www.aktors.org

This paper reports a technique for Knowledge Extraction using Natural Language Processing for the purposes of semi-automatic Ontology learning. Determination of significant words ...

Dileep G. Damle, Victoria S. Uren

claim paper

Read More »

190

click to vote

ICDAR
2011
IEEE

177views Document Analysis» more ICDAR 2011»

Chinese Keyword Spotting Using Knowledge-Based Clustering

14 years 6 months ago

Download www.icdar2011.org

—Content-based document image retrieval is a new and promising research area. Without OCR, document indexing directly based on image content is more general and convenient. Howev...

Yong Xia, Kuanquan Wang, Mingwei Li

claim paper

Read More »

229

click to vote

FLAIRS
2007

243views Artificial Intelligence» more FLAIRS 2007»

Indexing Documents by Discourse and Semantic Contents from Automatic Annotations of Texts

15 years 9 months ago

Download www.aaai.org

The basic aim of the model proposed here is to automatically build semantic metatext structure for texts that would allow us to search and extract discourse and semantic informati...

Brahim Djioua, Jean-Pierre Desclés

claim paper

Read More »

211

click to vote

SPIRE
2010
Springer

178views Information Technology» more SPIRE 2010»

Dual-Sorted Inverted Lists

15 years 5 months ago

Download www.dcc.uchile.cl

Several IR tasks rely, to achieve high eﬃciency, on a single pervasive data structure called the inverted index. This is a mapping from the terms in a text collection to the docu...

Gonzalo Navarro, Simon J. Puglisi

claim paper

Read More »

« Prev « First page 9 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers