Sciweavers

160 search results - page 8 / 32
» Web page classification with heterogeneous data fusion
Sort
View
WWW
2006
ACM
14 years 8 months ago
Detecting spam web pages through content analysis
In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...
Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...
WEBDB
2000
Springer
110views Database» more  WEBDB 2000»
13 years 11 months ago
Modeling Data Entry and Operations in WebML
Web Modeling Language (WebML, http://webml.org) is a notation for visually specifying complex Web sites at the conceptual level. All the concepts of WebML are specified both graph...
Aldo Bongio, Stefano Ceri, Piero Fraternali, Andre...
ERCIMDL
2005
Springer
100views Education» more  ERCIMDL 2005»
14 years 1 months ago
Importance of HTML Structural Elements and Metadata in Automated Subject Classification
The aim of the study was to determine how significance indicators assigned to different Web page elements (internal metadata, title, headings, and main text) influence automated cl...
Koraljka Golub, Anders Ardö
WWW
2005
ACM
14 years 8 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
ECML
2005
Springer
14 years 1 months ago
Learning from Positive and Unlabeled Examples with Different Data Distributions
Abstract. We study the problem of learning from positive and unlabeled examples. Although several techniques exist for dealing with this problem, they all assume that positive exam...
Xiaoli Li, Bing Liu