Search Sciweavers | Sciweavers

117

CLEF
2010
Springer

164views Information Technology» more CLEF 2010»

Person Attribute Extraction from the Textual Parts of Web Pages

15 years 2 months ago

Download www.clef2010.org

We present the RGAI systems which participated in the third Web People Search Task challenge. The chief characteristics of our approach are that we focus on the raw textual parts o...

István Nagy, Richárd Farkas

claim paper

Read More »

116

Voted

KDD
2002
ACM

293views Data Mining» more KDD 2002»

Automatic Categorization of Web Pages and User Clustering with Mixtures of Hidden Markov Models

16 years 2 months ago

Download www.snn.ru.nl

We propose mixtures of hidden Markov models for modelling clickstreams of web surfers. Hence, the page categorization is learned from the data without the need for a (possibly cumb...

Alexander Ypma, Tom Heskes

claim paper

Read More »

102

click to vote

CBMS
2001
IEEE

109views Medical Imaging» more CBMS 2001»

Web Page Downloading and Classification

15 years 6 months ago

Download lhncbc.nlm.nih.gov

This paper describes the processes of downloading and classifying Web-based articles in online medical journals as a preliminary step to extracting bibliographic data to populate ...

Loc Q. Tran, Chan W. Moon, Daniel X. Le, George R....

claim paper

Read More »

99

click to vote

ACL
2006

85views Computational Linguistics» more ACL 2006»

Implementing a Characterization of Genre for Automatic Genre Identification of Web Pages

15 years 3 months ago

Download www.nltg.brighton.ac.uk

In this paper, we propose an implementable characterization of genre suitable for automatic genre identification of web pages. This characterization is implemented as an inferenti...

Marina Santini, Richard Power, Roger Evans

claim paper

Read More »

118

click to vote

IPM
2007

149views more IPM 2007»

Web page title extraction and its application

15 years 2 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...

Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers