Sciweavers

543 search results - page 26 / 109
» Exploiting content redundancy for web information extraction
Sort
View
I3
2007
13 years 10 months ago
Identity: How to name it, How to find it
The main objective of this work is to exploit the relationship between the information findability problem and a subject-based organization of information. Identification of a sub...
Christo Dichev, Darina Dicheva, Jan Fischer
WWW
2010
ACM
14 years 3 months ago
Towards comment-based cross-media retrieval
This paper investigates whether Web comments can be exploited for cross-media retrieval. Comparing Web items such as texts, images, videos, music, products, or personal profiles ...
Martin Potthast, Benno Stein, Steffen Becker
ELPUB
2006
ACM
14 years 2 months ago
Pushing the Quality Level in Networked News Business: Semantic-Based Content Retrieval and Composition in International News Pub
Electronic publishing exploits numerous possibilities to present or exchange information and to communicate via most current media like the Internet. By utilizing modern Web techn...
Markus W. Schranz
RULEML
2004
Springer
14 years 2 months ago
Rule Learning for Feature Values Extraction from HTML Product Information Sheets
The Web is now a huge information repository with a rich semantic structure that, however, is primarily addressed to human understanding rather than automated processing by a compu...
Costin Badica, Amelia Badica
ACL
2012
11 years 11 months ago
ACCURAT Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora
The lack of parallel corpora and linguistic resources for many languages and domains is one of the major obstacles for the further advancement of automated translation. A possible...
Marcis Pinnis, Radu Ion, Dan Stefanescu, Fangzhong...