Sciweavers

109 search results - page 4 / 22
» Topic Distributions over Links on Web
Sort
View
CIKM
2009
Springer
14 years 2 months ago
Vetting the links of the web
Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...
Na Dai, Brian D. Davison
WWW
2002
ACM
14 years 8 months ago
The structure of broad topics on the web
The Web graph is a giant social network whose properties have been measured and modeled extensively in recent years. Most such studies concentrate on the graph structure alone, an...
Soumen Chakrabarti, Mukul Joshi, Kunal Punera, Dav...
LREC
2008
139views Education» more  LREC 2008»
13 years 9 months ago
Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retr
We have performed a set of experiments made to investigate the utility of morphological analysis to improve retrieval of documents written in languages with relatively large morph...
Jussi Karlgren, Hercules Dalianis, Bart Jongejan
VLDB
2002
ACM
161views Database» more  VLDB 2002»
13 years 7 months ago
Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection
Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over many s...
Panagiotis G. Ipeirotis, Luis Gravano
CIKM
2005
Springer
14 years 1 months ago
Focused crawling for both topical relevance and quality of medical information
Subject-specific search facilities on health sites are usually built using manual inclusion and exclusion rules. These can be expensive to maintain and often provide incomplete c...
Thanh Tin Tang, David Hawking, Nick Craswell, Kath...