Sciweavers

1042 search results - page 173 / 209
» Logic-based Web Information Extraction
Sort
View
WWW
2008
ACM
14 years 9 months ago
Generating diverse and representative image search results for landmarks
Can we leverage the community-contributed collections of rich media on the web to automatically generate representative and diverse views of the world's landmarks? We use a c...
Lyndon S. Kennedy, Mor Naaman
WS
2010
ACM
13 years 7 months ago
Structured literature image finder: Parsing text and figures in biomedical literature
The SLIF project combines text-mining and image processing to extract structured information from biomedical literature. SLIF extracts images and their captions from published pap...
Amr Ahmed, Andrew Arnold, Luís Pedro Coelho...
ACL
2010
13 years 7 months ago
Demonstration of a Prototype for a Conversational Companion for Reminiscing about Images
This paper describes an initial prototype demonstrator of a Companion, designed as a platform for novel approaches to the following: 1) The use of Information Extraction (IE) tech...
Yorick Wilks, Roberta Catizone, Alexiei Dingli, We...
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
14 years 3 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
JOT
2008
142views more  JOT 2008»
13 years 9 months ago
Mining Edgar Tender Offers
This paper describes how use the HTMLEditorKit to perform web data mining on EDGAR (Electronic Data-Gathering, Analysis, and Retrieval system). EDGAR is the SEC's (U.S. Secur...
Douglas Lyon