Sciweavers

EWMF
2005
Springer
14 years 5 months ago
Information Retrieval in Trust-Enhanced Document Networks
Abstract. To fight the problem of information overload in huge information sources like large document repositories, e. g. citeseer, or internet websites you need a selection crit...
Klaus Stein, Claudia Hess
EWMF
2005
Springer
14 years 5 months ago
Semi-automatic Construction of Topic Ontologies
In this paper, we review two techniques for topic discovery in collections of text documents (Latent Semantic Indexing and K-Means clustering) and present how we integrated them in...
Blaz Fortuna, Dunja Mladenic, Marko Grobelnik
EWMF
2005
Springer
14 years 5 months ago
Introducing Semantics in Web Personalization: The Role of Ontologies
Web personalization is the process of customizing a web site to the needs of each specific user or set of users. Personalization of a web site may be performed by the provision of ...
Magdalini Eirinaki, Dimitrios Mavroeidis, George T...
EWMF
2005
Springer
14 years 5 months ago
Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis
Abstract. We show that eigenvector decomposition can be used to extract a term taxonomy from a given collection of text documents. So far, methods based on eigenvector decompositio...
Holger Bast, Georges Dupret, Debapriyo Majumdar, B...
AIRWEB
2005
Springer
14 years 5 months ago
Cloaking and Redirection: A Preliminary Study
Cloaking and redirection are two possible search engine spamming techniques. In order to understand cloaking and redirection on the Web, we downloaded two sets of Web pages while ...
Baoning Wu, Brian D. Davison
AIRWEB
2005
Springer
14 years 5 months ago
Blocking Blog Spam with Language Model Disagreement
We present an approach for detecting link spam common in blog comments by comparing the language models used in the blog post, the comment, and pages linked by the comments. In co...
Gilad Mishne, David Carmel, Ronny Lempel
AIRWEB
2005
Springer
14 years 5 months ago
Web Spam, Propaganda and Trust
Web spamming, the practice of introducing artificial text and links into web pages to affect the results of searches, has been recognized as a major problem for search engines. ...
Panagiotis Takis Metaxas, Joseph DeStefano
AIRWEB
2005
Springer
14 years 5 months ago
Web Spam Taxonomy
Web spamming refers to actions intended to mislead search engines into ranking some pages higher than they deserve. Recently, the amount of web spam has increased dramatically, le...
Zoltán Gyöngyi, Hector Garcia-Molina
AIRWEB
2005
Springer
14 years 5 months ago
An Analysis of Factors Used in Search Engine Ranking
This paper investigates the influence of different page features on the ranking of search engine results. We use Google (via its API) as our testbed and analyze the result rankin...
Albert Bifet, Carlos Castillo, Paul-Alexandru Chir...
AIRWEB
2005
Springer
14 years 5 months ago
SpamRank -- Fully Automatic Link Spam Detection
András A. Benczúr, Károly Csa...