Sciweavers

1860 search results - page 302 / 372
» Automatic Generation of Search Engines
Sort
View
WWW
2006
ACM
14 years 9 months ago
Bootstrapping semantics on the web: meaning elicitation from schemas
In most web sites, web-based applications (such as web portals, emarketplaces, search engines), and in the file systems of personal computers, a wide variety of schemas (such as t...
Paolo Bouquet, Luciano Serafini, Stefano Zanobini,...
WWW
2006
ACM
14 years 9 months ago
Detecting spam web pages through content analysis
In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...
Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...
EDBT
2009
ACM
123views Database» more  EDBT 2009»
14 years 3 months ago
High-performance information extraction with AliBaba
A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...
WEBI
2009
Springer
14 years 3 months ago
Mining a Multilingual Geographical Gazetteer from the Web
Geographical gazetteers are necessary in a wide variety of applications. In the past, the construction of such gazetteers has been a tedious, manual process and only recently have...
Adrian Popescu, Gregory Grefenstette, Houda Bouamo...
JCDL
2009
ACM
168views Education» more  JCDL 2009»
14 years 3 months ago
A framework for describing web repositories
In prior work we have demonstrated that search engine caches and archiving projects like the Internet Archive’s Wayback Machine can be used to “lazily preserve” websites and...
Frank McCown, Michael L. Nelson