Today web search engines provide the easiest way to reach information on the web. In this scenario, more than 95% of Indian language content on the web is not searchable due to mu...
The presence of replicas or near-replicas of documents is very common on the Web. Documents may be replicated completely or partially for different reasons (versions, mirrors, etc...
Ernesto Di Iorio, Michelangelo Diligenti, Marco Go...
Abstract. This paper extends previous studies that investigated the accessibility of different web sites of specific content, to an analysis of the whole web of a specific country ...
In this paper we try to consider a Web page as information with social aspects. Each Web page is the result of invisible social interaction. This interaction between different gro...
Abstract. Much of the information on the web is indeed dynamic content provided through linkups with databases. However, due to heterogeneity of databases, it is difficult to provi...
Jeong-Oog Lee, Myeong-Cheol Ko, Jinsoo Kim, Chang-...