Sciweavers

472 search results - page 69 / 95
» Crawling the Hidden Web
Sort
View
ERCIMDL
2005
Springer
124views Education» more  ERCIMDL 2005»
14 years 2 months ago
A Comparison of On-Line Computer Science Citation Databases
This paper examines the difference and similarities between the two on-line computer science citation databases DBLP and CiteSeer. The database entries in DBLP are inserted manual...
Vaclav Petricek, Ingemar J. Cox, Hui Han, Isaac G....
FC
2010
Springer
160views Cryptology» more  FC 2010»
14 years 26 days ago
Measuring the Perpetrators and Funders of Typosquatting
We describe a method for identifying “typosquatting”, the intentional registration of misspellings of popular website addresses. We estimate that at least 938 000 typosquatting...
Tyler Moore, Benjamin Edelman
FEGC
2006
92views Biometrics» more  FEGC 2006»
13 years 10 months ago
Maintaining an Online Bibliographical Database: The Problem of Data Quality
CiteSeer and Google-Scholar are huge digital libraries which provide access to (computer-)science publications. Both collections are operated like specialized search engines, they ...
Michael Ley, Patrick Reuther
PPL
2008
140views more  PPL 2008»
13 years 9 months ago
An Importance-Aware Architecture for Large-Scale Grid Information Services
This paper is concerned with the scalability of large-scale grid monitoring and information services, which are mainly used for the discovery of resources of interest. Large-scale...
Serafeim Zanikolas, Rizos Sakellariou
INTERSPEECH
2010
13 years 3 months ago
Text normalization based on statistical machine translation and internet user support
In this paper, we describe and compare systems for text normalization based on statistical machine translation (SMT) methods which are constructed with the support of internet use...
Tim Schlippe, Chenfei Zhu, Jan Gebhardt, Tanja Sch...