Sciweavers

495 search results - page 69 / 99
» Discovering the Representative of a Search Engine
Sort
View
WWW
2003
ACM
14 years 9 months ago
Detecting Near-replicas on the Web by Content and Hyperlink Analysis
The presence of replicas or near-replicas of documents is very common on the Web. Documents may be replicated completely or partially for different reasons (versions, mirrors, etc...
Ernesto Di Iorio, Michelangelo Diligenti, Marco Go...
ER
2010
Springer
90views Database» more  ER 2010»
13 years 7 months ago
W-Ray: A Strategy to Publish Deep Web Geographic Data
Abstract. This paper introduces an approach to address the problem of accessing conventional and geographic data from the Deep Web. The approach relies on describing the relevant d...
Helena Piccinini, Melissa Lemos, Marco A. Casanova...
DEXAW
2008
IEEE
136views Database» more  DEXAW 2008»
13 years 10 months ago
Segmentation of Legislative Documents Using a Domain-Specific Lexicon
The amount of legal information is continuously growing. New legislative documents appear everyday in the Web. Legal documents are produced on a daily basis in briefingformat, cont...
Ismael Hasan, Javier Parapar, Roi Blanco
CORR
2010
Springer
124views Education» more  CORR 2010»
13 years 9 months ago
Link Graph Analysis for Adult Images Classification
In order to protect an image search engine's users from undesirable results adult images' classifier should be built. The information about links from websites to images...
Evgeny Kharitonov, Anton Slesarev, Ilya Muchnik, F...
EMNLP
2010
13 years 6 months ago
Mining Name Translations from Entity Graph Mapping
This paper studies the problem of mining entity translation, specifically, mining English and Chinese name pairs. Existing efforts can be categorized into (a) a transliterationbas...
Gae-won You, Seung-won Hwang, Young-In Song, Long ...