Sciweavers

466 search results - page 17 / 94
» Scalable Feature Extraction from Noisy Documents
Sort
View
LREC
2008
160views Education» more  LREC 2008»
13 years 11 months ago
Automatic Extraction of Textual Elements from News Web Pages
In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...
Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany
ICPR
2008
IEEE
14 years 4 months ago
A robust technique for text extraction in mixed-type binary documents
A crucial preprocessing stage in applications such as OCR is text extraction from mixed-type documents. The present work, in contrast to most until now, successfully faces the pro...
Charalambos Strouthopoulos, Athanasios Nikolaidis
ESWS
2006
Springer
14 years 1 months ago
Automatic Extraction of Hierarchical Relations from Text
Abstract. Automatic extraction of semantic relationships between entity instances in an ontology is useful for attaching richer semantic metadata to documents. In this paper we pro...
Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cu...
IRCDL
2010
13 years 8 months ago
A New Domain Independent Keyphrase Extraction System
In this paper we present a keyphrase extraction system that can extract potential phrases from a single document in an unsupervised, domain-independent way. We extract word n-grams...
Nirmala Pudota, Antonina Dattolo, Andrea Baruzzo, ...
ACL
2012
12 years 6 days ago
Labeling Documents with Timestamps: Learning from their Time Expressions
Temporal reasoners for document understanding typically assume that a document’s creation date is known. Algorithms to ground relative time expressions and order events often re...
Nathanael Chambers