Sciweavers

708 search results - page 16 / 142
» Identifying Content Blocks from Web Documents
Sort
View
IPPS
2005
IEEE
14 years 2 months ago
Securely Replicated Web Documents
In order to achieve better scalability and reduce latency in handling user requests, many Web applications make extensive use of data replication through caches and Content Delive...
Bogdan C. Popescu, Maarten van Steen, Bruno Crispo...
HICSS
2009
IEEE
150views Biometrics» more  HICSS 2009»
14 years 3 months ago
An N-Gram Based Approach to Automatically Identifying Web Page Genre
The research reported in this paper is the first phase of a larger project on the automatic classification of web pages by their genres, using ngram representations of the web pag...
Jane E. Mason, Michael A. Shepherd, Jack Duffy
EUC
2004
Springer
14 years 2 months ago
On the Effects of Bandwidth Reduction Techniques in Distributed Applications
Communication optimization plays an important role in building networked distributed applications. In this paper, we systematically evaluate four bandwidth reduction algorithms, n...
Hanping Lufei, Weisong Shi, Lucia Zamorano
APWEB
2003
Springer
14 years 1 months ago
Extracting Content Structure for Web Pages Based on Visual Representation
Abstract. A new web content structure based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and auto...
Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma
DSS
2008
186views more  DSS 2008»
13 years 8 months ago
A machine learning approach to web page filtering using content and structure analysis
As the Web continues to grow, it has become increasingly difficult to search for relevant information using traditional search engines. Topic-specific search engines provide an al...
Michael Chau, Hsinchun Chen