Sciweavers

267 search results - page 41 / 54
» Automatic Wrappers for Large Scale Web Extraction
Sort
View
AIME
2009
Springer
14 years 2 months ago
CORAAL - Towards Deep Exploitation of Textual Resources in Life Sciences
Abstract. Prominent biomedical literature search tools like ScienceDirect, PubMed Central or MEDLINE allow for efficient retrieval of resources based on key words. Due to vast amou...
Vít Novácek, Tudor Groza, Siegfried ...
WWW
2005
ACM
14 years 8 months ago
A search engine for natural language applications
Many modern natural language-processing applications utilize search engines to locate large numbers of Web documents or to compute statistics over the Web corpus. Yet Web search e...
Michael J. Cafarella, Oren Etzioni
CIKM
2005
Springer
14 years 1 months ago
Retrieving answers from frequently asked questions pages on the web
We address the task of answering natural language questions by using the large number of Frequently Asked Questions (FAQ) pages available on the web. The task involves three steps...
Valentin Jijkoun, Maarten de Rijke
VLDB
2002
ACM
161views Database» more  VLDB 2002»
13 years 7 months ago
Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection
Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over many s...
Panagiotis G. Ipeirotis, Luis Gravano
WWW
2008
ACM
14 years 8 months ago
Generating diverse and representative image search results for landmarks
Can we leverage the community-contributed collections of rich media on the web to automatically generate representative and diverse views of the world's landmarks? We use a c...
Lyndon S. Kennedy, Mor Naaman