Search Sciweavers | Sciweavers

1149 search results - page 8 / 230

» Classification of Web Documents Using a Graph Model

202

click to vote

WWW
2002
ACM

130views Internet Technology» more WWW 2002»

Using web structure for classifying and describing web pages

16 years 7 months ago

Download dpennock.com

The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...

Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...

claim paper

Read More »

170

click to vote

MLDM
2007
Springer

119views Machine Learning» more MLDM 2007»

PE-PUC: A Graph Based PU-Learning Approach for Text Classification

16 years 23 days ago

Download dm.thss.tsinghua.edu.cn

This paper presents a novel solution for the problem of building text classifier using positive documents (P) and unlabeled documents (U). Here, the unlabeled documents are mixed w...

Shuang Yu, Chunping Li

claim paper

Read More »

190

click to vote

KES
2006
Springer

205views Information Technology» more KES 2006»

Integrated Document Browsing and Data Acquisition for Building Large Ontologies

15 years 6 months ago

Download rewerse.net

Named entities (e.g., "Kofi Annan", "Coca-Cola", "Second World War") are ubiquitous in web pages and other types of document and often provide a simpl...

Felix Weigel, Klaus U. Schulz, Levin Brunner, Edua...

claim paper

Read More »

169

click to vote

LAWEB
2006
IEEE

85views Internet Technology» more LAWEB 2006»

Where and How Duplicates Occur in the Web

16 years 19 days ago

Download homepages.dcc.ufmg.br

In this paper we study duplicates on the Web, using collections containing documents of all sites under the .cl domain that represent accurate and representative subsets of the We...

Álvaro R. Pereira Jr., Ricardo A. Baeza-Yat...

claim paper

Read More »

157

click to vote

WWW
2001
ACM

131views Internet Technology» more WWW 2001»

On integrating catalogs

16 years 7 months ago

Download rakesh.agrawal-family.com

We address the problem of integrating documents from different sources into a master catalog. This problem is pervasive in web marketplaces and portals. Current technology for aut...

Rakesh Agrawal, Ramakrishnan Srikant

claim paper

Read More »

« Prev « First page 8 / 230 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers