Sciweavers

708 search results - page 8 / 142
» Identifying Content Blocks from Web Documents
Sort
View
ICML
2000
IEEE
14 years 8 months ago
Learning to Probabilistically Identify Authoritative Documents
We describe a model of document citation that learns to identify hubs and authorities in a set of linked documents, such as pages retrieved from the world wide web, or papers retr...
David Cohn, Huan Chang
CIKM
2006
Springer
13 years 11 months ago
A comparative study on classifying the functions of web page blocks
In this paper, we study the problem of learning block classification models to estimate block functions. We distinguish general models, which are learned across multiple sites, an...
Xiangye Xiao, Qiong Luo, Xing Xie, Wei-Ying Ma
AIRWEB
2009
Springer
14 years 2 months ago
Looking into the past to better classify web spam
Web spamming techniques aim to achieve undeserved rankings in search results. Research has been widely conducted on identifying such spam and neutralizing its influence. However,...
Na Dai, Brian D. Davison, Xiaoguang Qi
IUI
2010
ACM
14 years 2 months ago
From documents to tasks: deriving user tasks from document usage patterns
A typical knowledge worker is involved in multiple tasks and switches frequently between them every work day. These frequent switches become expensive because each task switch req...
Oliver Brdiczka
WWW
2004
ACM
14 years 8 months ago
Tv2web: generating and browsing web with multiple lod from video streams and their metadata
We propose a method of automatically constructing Web content from video streams with metadata that we call TV2Web. The Web content includes thumbnails of video units and caption ...
Kazutoshi Sumiya, Mahendren Munisamy, Katsumi Tana...