Sciweavers

2553 search results - page 84 / 511
» How-To Web Pages
Sort
View
WWW
2008
ACM
16 years 3 months ago
Representing a web page as sets of named entities of multiple types: a model and some preliminary applications
As opposed to representing a document as a "bag of words" in most information retrieval applications, we propose a model of representing a web page as sets of named enti...
Nan Di, Conglei Yao, Mengcheng Duan, Jonathan J. H...
WWW
2007
ACM
16 years 3 months ago
Web page classification with heterogeneous data fusion
Web pages are more than text and they contain much contextual and structural information, e.g., the title, the meta data, the anchor text, etc., each of which can be seen as a dat...
Zenglin Xu, Irwin King, Michael R. Lyu
HICSS
2008
IEEE
105views Biometrics» more  HICSS 2008»
15 years 9 months ago
Using Visual Features for Fine-Grained Genre Classification of Web Pages
The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...
Ryan Levering, Michal Cutler, Lei Yu
WWW
2007
ACM
16 years 3 months ago
Adaptive record extraction from web pages
We describe an adaptive method for extracting records from web pages. Our algorithm combines a weighted tree matching metric with clustering for obtaining data extraction patterns...
Justin Park, Denilson Barbosa
ACSAC
2006
IEEE
15 years 8 months ago
Anomaly Based Web Phishing Page Detection
Many anti-phishing schemes have recently been proposed in literature. Despite all those efforts, the threat of phishing attacks is not mitigated. One of the main reasons is that p...
Ying Pan, Xuhua Ding