Sciweavers

WWW
2005
ACM
15 years 8 days ago
A clustering method for news articles retrieval system
Organizing the results of a search facilitates the user in overviewing the information returned. We regard the clustering task as the tasks of making labels for a list of items an...
Hiroyuki Toda, Ryoji Kataoka
WWW
2005
ACM
15 years 8 days ago
Analysis of topic dynamics in web search
We report on a study of topic dynamics for pages visited by a sample of people using MSN Search. We examine the predictive accuracies of probabilistic models of topic transitions ...
Xuehua Shen, Susan T. Dumais, Eric Horvitz
WWW
2005
ACM
15 years 8 days ago
Automatic generation of link collections and their visualization
In this paper, we describe a method of generating link collections in a user-specified category by comprehensively collecting existing link collections and analyzing their hyperli...
Osamu Segawa, Jun Kawai, Kazuyuki Sakauchi
WWW
2005
ACM
15 years 8 days ago
Exploiting the deep web with DynaBot: matching, probing, and ranking
We present the design of Dynabot, a guided Deep Web discovery system. Dynabot's modular architecture supports focused crawling of the Deep Web with an emphasis on matching, p...
Daniel Rocco, James Caverlee, Ling Liu, Terence Cr...
WWW
2005
ACM
15 years 8 days ago
ALVIN: a system for visualizing large networks
Davood Rafiei, Stephen Curial
WWW
2005
ACM
15 years 8 days ago
Adaptive page ranking with neural networks
Recent developments in the area of neural networks provided new models which are capable of processing general types of graph structures. Neural networks are well-known for their ...
Franco Scarselli, Sweah Liang Yong, Markus Hagenbu...
WWW
2005
ACM
15 years 8 days ago
Constructing extensible XQuery mappings
Constructing and maintaining semantic mappings are necessary but troublesome in data sharing systems. While most current work focuses on seeking automated techniques to solve this...
Gang Qian, Yisheng Dong
WWW
2005
ACM
15 years 8 days ago
Automatically learning document taxonomies for hierarchical classification
While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...
Kunal Punera, Suju Rajan, Joydeep Ghosh
WWW
2005
ACM
15 years 8 days ago
Information flow using edge stress factor
This paper shows how a corpus of instant messages can be employed to detect de facto communities of practice automatically. A novel algorithm based on the concept of Edge Stress F...
Franco Salvetti, Savitha Srinivasan
WWW
2005
ACM
15 years 8 days ago
Delivering new web content reusing remote and heterogeneous sites. A DOM-based approach
This contribution addresses the development of new web sites reusing already existing contents from external sources. Unlike common links to other resources, which retrieves the w...
Luis Álvarez Sabucedo, Luis E. Anido-Rif&oa...