Sciweavers

708 search results - page 26 / 142
» Identifying Content Blocks from Web Documents
Sort
View
SIGIR
2008
ACM
13 years 8 months ago
Classifiers without borders: incorporating fielded text from neighboring web pages
Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...
Xiaoguang Qi, Brian D. Davison
HICSS
2005
IEEE
150views Biometrics» more  HICSS 2005»
14 years 2 months ago
Collaborative Authoring on the Web: A Genre Analysis of Online Encyclopedias
This paper presents the results of a genre analysis of two web-based collaborative authoring environments, Wikipedia and Everything2, both of which are intended as repositories of...
William G. Emigh, Susan C. Herring
ICDE
2000
IEEE
99views Database» more  ICDE 2000»
14 years 10 months ago
XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources
This paper describes the methodology and the software development of XWRAP, an XML-enabled wrapper construction system for semi-automatic generation of wrapper programs. By XML-ena...
Ling Liu, Calton Pu, Wei Han
PST
2008
13 years 10 months ago
An Effective Defense against Intrusive Web Advertising
Intrusive Web advertising such as pop-ups and animated layer ads, which distract the user from reading or navigating through the main content of Web pages, is being perceived as a...
Viktor Krammer
WWW
2004
ACM
14 years 9 months ago
Using urls and table layout for web classification tasks
We propose new features and algorithms for automating Web-page classification tasks such as content recommendation and ad blocking. We show that the automated classification of We...
L. K. Shih, David R. Karger