Sciweavers

1002 search results - page 24 / 201
» Unsupervised Relation Extraction From Web Documents
Sort
View
EMNLP
2007
13 years 9 months ago
Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents
Recognizing polarity requires a list of polar words and phrases. For the purpose of building such lexicon automatically, a lot of studies have investigated (semi-) unsupervised me...
Nobuhiro Kaji, Masaru Kitsuregawa
PRICAI
2000
Springer
13 years 11 months ago
Text Retrieval from Document Images based on N-Gram Algorithm
In this paper, we propose a method of text retrieval from document images using a similarity measure based on an N-Gram algorithm. We directly extract image features instead of us...
Chew Lim Tan, Sam Yuan Sung, Zhaohui Yu, Yi Xu
ICWE
2003
Springer
14 years 1 months ago
The Cooperative Web: A Step towards Web Intelligence
The Web is mainly processed by humans. The role of the machines is just to transmit and display the contents of the documents, barely being able to do something else. Nowadays ther...
Daniel Gayo-Avello, Darío Álvarez Gu...
WWW
2011
ACM
13 years 2 months ago
HyLiEn: a hybrid approach to general list extraction on the web
We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...
Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...
SAMT
2007
Springer
108views Multimedia» more  SAMT 2007»
14 years 1 months ago
Document Layout Substructure Discovery
Abstract. In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, ext...
Claudio Andreatta