Sciweavers

205 search results - page 37 / 41
» An Alternative Ranking Problem for Search Engines
Sort
View
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 7 months ago
De-duping URLs via rewrite rules
A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar
CIKM
2003
Springer
14 years 21 days ago
Categorizing web queries according to geographical locality
Web pages (and resources, in general) can be characterized according to their geographical locality. For example, a web page with general information about wildflowers could be c...
Luis Gravano, Vasileios Hatzivassiloglou, Richard ...
WEBI
2009
Springer
14 years 2 months ago
Towards Bridging the Web and the Semantic Web
The World Wide Web (WWW) has provided us with a plethora of information. However, given its unstructured format, this information is useful mainly to humans and cannot be effectiv...
Swarnim Kulkarni, Doina Caragea
CIKM
2005
Springer
14 years 1 months ago
ViPER: augmenting automatic information extraction with visual perceptions
In this paper we address the problem of unsupervised Web data extraction. We show that unsupervised Web data extraction becomes feasible when supposing pages that are made up of r...
Kai Simon, Georg Lausen
LREC
2008
155views Education» more  LREC 2008»
13 years 9 months ago
L-ISA: Learning Domain Specific Isa-Relations from the Web
Automated extraction of ontological knowledge from text corpora is a relevant task in Natural Language Processing. In this paper, we focus on the problem of finding hypernyms for ...
Alessandra Potrich, Emanuele Pianta