Search Sciweavers | Sciweavers

87 search results - page 5 / 18

» Document zone content classification and its performance eva...

195

Voted

AUSAI
2001
Springer

102views Artificial Intelligence» more AUSAI 2001»

Fast Text Classification Using Sequential Sampling Processes

15 years 11 months ago

Download www.cs.iastate.edu

A central problem in information retrieval is the automated classification of text documents. While many existing methods achieve good levels of performance, they generally require...

Michael D. Lee

claim paper

Read More »

226

Voted

SIGIR
2008
ACM

133views Information Technology» more SIGIR 2008»

Classifiers without borders: incorporating fielded text from neighboring web pages

15 years 7 months ago

Download www.cse.lehigh.edu

Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...

Xiaoguang Qi, Brian D. Davison

claim paper

Read More »

189

click to vote

IIWAS
2008

160views Internet Technology» more IIWAS 2008»

Combining content extraction heuristics: the CombinE system

15 years 8 months ago

Download www.informatik.uni-mainz.de

The main text content of an HTML document on the WWW is typically surrounded by additional contents, such as navigation menus, advertisements, link lists or design elements. Conte...

Thomas Gottron

claim paper

Read More »

210

click to vote

WWW
2008
ACM

176views Internet Technology» more WWW 2008»

Learning to rank relational objects and its application to web search

16 years 8 months ago

Download www2008.org

Learning to rank is a new statistical learning technology on creating a ranking model for sorting objects. The technology has been successfully applied to web search, and is becom...

Tao Qin, Tie-Yan Liu, Xu-Dong Zhang, De-Sheng Wang...

claim paper

Read More »

210

Voted

WWW
2006
ACM

116views Internet Technology» more WWW 2006»

A content and structure website mining model

16 years 8 months ago

Download www2006.org

We present a novel model for validating and improving the content and structure organization of a website. This model studies the website as a graph and evaluates its interconnect...

Barbara Poblete, Ricardo A. Baeza-Yates

claim paper

Read More »

« Prev « First page 5 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers