Search Sciweavers | Sciweavers

708 search results - page 23 / 142

» Identifying Content Blocks from Web Documents

118

click to vote

WWW
2009
ACM

135views Internet Technology» more WWW 2009»

User-centric content freshness metrics for search engines

16 years 3 months ago

Download www2009.org

In order to return relevant search results, a search engine must keep its local repository synchronized to the Web, but it is usually impossible to attain perfect freshness. Hence...

Ali Dasdan, Xinh Huynh

claim paper

Read More »

click to vote

SIGMOD
2000
ACM

85views Database» more SIGMOD 2000»

Finding Replicated Web Collections

15 years 7 months ago

Download ilpubs.stanford.edu

Many web documents (such as JAVA FAQs) are being replicated on the Internet. Often entire document collections (such as hyperlinked Linux manuals) are being replicated many times....

Junghoo Cho, Narayanan Shivakumar, Hector Garcia-M...

claim paper

Read More »

click to vote

ICDCS
2000
IEEE

74views Distributed And Parallel Com...» more ICDCS 2000»

On Supporting Weakly-Connected Browsing in a Mobile Web Environment

15 years 7 months ago

Download imsc-dmim.usc.edu

A mobile environment is weakly-connected, characterized by low communication bandwidth and poor connectivity. Conventional paradigm for sur ng mobile web documents is ine ective s...

Antonio Si, Hong Va Leong, Dennis McLeod, Stanley ...

claim paper

Read More »

114

click to vote

WWW
2009
ACM

142views Internet Technology» more WWW 2009»

Estimating web site readability using content extraction

16 years 3 months ago

Download www2009.eprints.org

Nowadays, information is primarily searched on the WWW. From a user perspective, the readability is an important criterion for measuring the accessibility and thereby the quality ...

Thomas Gottron, Ludger Martin

claim paper

Read More »

137

click to vote

LISA
2003

84views Operating System» more LISA 2003»

DryDock: A Document Firewall

15 years 4 months ago

Download tools.arlut.utexas.edu

Auditing a web site’s content is an arduous task. For any given page on a web server, system administrators are often ill-equipped to determine who created the document, why it�...

Deepak Giridharagopal

claim paper

Read More »

« Prev « First page 23 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers