Sciweavers

3425 search results - page 295 / 685
» Personalisation of Web Search
Sort
View
WWW
2007
ACM
16 years 4 months ago
A large-scale study of robots.txt
Search engines largely rely on Web robots to collect information from the Web. Due to the unregulated open-access nature of the Web, robot activities are extremely diverse. Such c...
Yang Sun, Ziming Zhuang, C. Lee Giles
CIKM
2009
Springer
15 years 10 months ago
MatchSim: a novel neighbor-based similarity measure with maximum neighborhood matching
The problem of measuring similarity between web pages arises in many important Web applications, such as search engines and Web directories. In this paper, we propose a novel neig...
Zhenjiang Lin, Michael R. Lyu, Irwin King
144
Voted
OTM
2005
Springer
15 years 9 months ago
Ontology-Based Spatial Query Expansion in Information Retrieval
Ontologies play a key role in Semantic Web research. A common use of ontologies in Semantic Web is to enrich the current Web resources with some well-defined meaning to enhance th...
Gaihua Fu, Christopher B. Jones, Alia I. Abdelmoty
SIGIR
2006
ACM
15 years 10 months ago
Finding near-duplicate web pages: a large-scale evaluation of algorithms
Broder et al.’s [3] shingling algorithm and Charikar’s [4] random projection based approach are considered “state-of-theart” algorithms for finding near-duplicate web pag...
Monika Rauch Henzinger
136
Voted
ACL
2010
15 years 2 months ago
Speech-Driven Access to the Deep Web on Mobile Devices
The Deep Web is the collection of information repositories that are not indexed by search engines. These repositories are typically accessible through web forms and contain dynami...
Taniya Mishra, Srinivas Bangalore