Sciweavers

WWW
2008
ACM
15 years 6 days ago
Genealogical trees on the web: a search engine user perspective
This paper presents an extensive study about the evolution of textual content on the Web, which shows how some new pages are created from scratch while others are created using al...
Ricardo A. Baeza-Yates, Álvaro R. Pereira J...
WWW
2008
ACM
15 years 6 days ago
Collaborative filtering on skewed datasets
Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. In this paper, we observed that in skewed datasets...
Somnath Banerjee, Krishnan Ramanathan
WWW
2008
ACM
15 years 6 days ago
Exploiting semantic web technologies to model web form interactions
Form mapping is the key problem that needs to be solved in order to get access to the hidden web. Currently available solutions for fully automatic mapping are not ready for comme...
Bernhard Krüpl, Robert Baumgartner, Wolfgang ...
WWW
2008
ACM
15 years 6 days ago
Information "uptrieval": exploring models for content assimilation and aggregation for developing regions
Information Retrieval on the WWW is important because it is hard to find what one is looking for. There is a plethora of information available, and searching relevant information ...
Sheetal K. Agarwal, Arun Kumar, Sougata Mukherjea,...
WWW
2008
ACM
15 years 6 days ago
Pagerank for product image search
In this paper, we cast the image-ranking problem into the task of identifying "authority" nodes on an inferred visual similarity graph and propose an algorithm to analyz...
Yushi Jing, Shumeet Baluja
WWW
2008
ACM
15 years 6 days ago
Size matters: word count as a measure of quality on wikipedia
Wikipedia, "the free encyclopedia", now contains over two million English articles, and is widely regarded as a highquality, authoritative encyclopedia. Some Wikipedia a...
Joshua E. Blumenstock
WWW
2008
ACM
15 years 6 days ago
A teapot graph and its hierarchical structure of the chinese web
The shape of the Web in terms of its graphical structure has been a widely interested topic. Two graphs, Bow Tie and Daisy, have stood out from previous research. In this work, we...
Jonathan J. H. Zhu, Tao Meng, Zhengmao Xie, Geng L...
WWW
2008
ACM
15 years 6 days ago
Information retrieval and knowledge discovery on the semantic web of traditional chinese medicine
We conduct the first systematical adoption of the Semantic Web solution in the integration, management, and utilization of TCM information and knowledge resources. As the results,...
Zhaohui Wu, Tong Yu, Huajun Chen, Xiaohong Jiang, ...
WWW
2008
ACM
15 years 6 days ago
Histrace: building a search engine of historical events
In this paper, we describe an experimental search engine on our
Lian'en Huang, Jonathan J. H. Zhu, Xiaoming Li
WWW
2008
ACM
15 years 6 days ago
Enabling secure digital marketplace
The fast development of the Web provides new ways for effective distribution of network-based digital goods. A digital marketplace provides a platform to enable Web users to effec...
Hongxia Jin, Vladimir Zbarsky