Search Sciweavers | Sciweavers

106

WWW
2008
ACM

95views Internet Technology» more WWW 2008»

Representing a web page as sets of named entities of multiple types: a model and some preliminary applications

16 years 3 months ago

Download www2008.org

As opposed to representing a document as a "bag of words" in most information retrieval applications, we propose a model of representing a web page as sets of named enti...

Nan Di, Conglei Yao, Mengcheng Duan, Jonathan J. H...

claim paper

Read More »

93

click to vote

WWW
2007
ACM

138views Internet Technology» more WWW 2007»

Web page classification with heterogeneous data fusion

16 years 3 months ago

Download www.cse.cuhk.edu.hk

Web pages are more than text and they contain much contextual and structural information, e.g., the title, the meta data, the anchor text, etc., each of which can be seen as a dat...

Zenglin Xu, Irwin King, Michael R. Lyu

claim paper

Read More »

117

click to vote

HICSS
2008
IEEE

105views Biometrics» more HICSS 2008»

Using Visual Features for Fine-Grained Genre Classification of Web Pages

15 years 9 months ago

Download csdl2.computer.org

The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...

Ryan Levering, Michal Cutler, Lei Yu

claim paper

Read More »

101

click to vote

WWW
2007
ACM

150views Internet Technology» more WWW 2007»

Adaptive record extraction from web pages

16 years 3 months ago

Download www2007.org

We describe an adaptive method for extracting records from web pages. Our algorithm combines a weighted tree matching metric with clustering for obtaining data extraction patterns...

Justin Park, Denilson Barbosa

claim paper

Read More »

92

click to vote

ACSAC
2006
IEEE

142views Security Privacy» more ACSAC 2006»

Anomaly Based Web Phishing Page Detection

15 years 8 months ago

Download www.mysmu.edu

Many anti-phishing schemes have recently been proposed in literature. Despite all those efforts, the threat of phishing attacks is not mitigated. One of the main reasons is that p...

Ying Pan, Xuhua Ding

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers