Sciweavers

72 search results - page 6 / 15
» Automatic Selection of Table Areas in Documents for Informat...
Sort
View
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 7 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
RULEML
2004
Springer
14 years 22 days ago
Rule Learning for Feature Values Extraction from HTML Product Information Sheets
The Web is now a huge information repository with a rich semantic structure that, however, is primarily addressed to human understanding rather than automated processing by a compu...
Costin Badica, Amelia Badica
DEXAW
2010
IEEE
201views Database» more  DEXAW 2010»
13 years 6 months ago
Keyword Extraction Using Word Co-occurrence
—A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as...
Christian Wartena, Rogier Brussee, Wout Slakhorst
ICDAR
2009
IEEE
14 years 2 months ago
Seal Detection and Recognition: An Approach for Document Indexing
Reliable indexing of documents having seal instances can be achieved by recognizing seal information. This paper presents a novel approach for detecting and classifying such multi...
Partha Pratim Roy, Umapada Pal, Josep Lladó...
BMCBI
2010
98views more  BMCBI 2010»
13 years 7 months ago
Resolving anaphoras for the extraction of drug-drug interactions in pharmacological documents
Background: Drug-drug interactions are frequently reported in the increasing amount of biomedical literature. Information Extraction (IE) techniques have been devised as a useful ...
Isabel Segura-Bedmar, Mario Crespo, César d...