Search Sciweavers | Sciweavers

708 search results - page 6 / 142

» Identifying Content Blocks from Web Documents

211

click to vote

WWW
2005
ACM

154views Internet Technology» more WWW 2005»

Thresher: automating the unwrapping of semantic content from the World Wide Web

16 years 7 months ago

Download www2005.org

We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...

Andrew Hogue, David R. Karger

claim paper

Read More »

199

click to vote

DOCENG
2010
ACM

220views Document Analysis» more DOCENG 2010»

From templates to schemas: bridging the gap between free editing and safe data processing

15 years 5 months ago

Download hal.inria.fr

In this paper we present tools that provide an easy way to edit XML content directly on the web, with the usual beneﬁt of valid XML content. These tools make it possible to crea...

Vincent Quint, Cécile Roisin, Stépha...

claim paper

Read More »

180

click to vote

SIGIR
2008
ACM

136views Information Technology» more SIGIR 2008»

Comments-oriented document summarization: understanding documents with readers' feedback

15 years 6 months ago

Download www.cais.ntu.edu.sg

Comments left by readers on Web documents contain valuable information that can be utilized in different information retrieval tasks including document search, visualization, and ...

Meishan Hu, Aixin Sun, Ee-Peng Lim

claim paper

Read More »

142

Voted

WWW
2007
ACM

114views Internet Technology» more WWW 2007»

Homepage live: automatic block tracing for web personalization

16 years 7 months ago

Download www2007.org

The emergence of personalized homepage services, e.g. personalized Google Homepage and Microsoft Windows Live, has enabled Web users to select Web contents of interest and to aggr...

Jie Han, Dingyi Han, Chenxi Lin, Hua-Jun Zeng, Zhe...

claim paper

Read More »

178

Voted

WWW
2008
ACM

127views Internet Technology» more WWW 2008»

Genealogical trees on the web: a search engine user perspective

16 years 7 months ago

Download www2008.org

This paper presents an extensive study about the evolution of textual content on the Web, which shows how some new pages are created from scratch while others are created using al...

Ricardo A. Baeza-Yates, Álvaro R. Pereira J...

claim paper

Read More »

« Prev « First page 6 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers