Sciweavers

258 search results - page 6 / 52
» Classifying Document Titles Based on Information Inference
Sort
View
HT
2005
ACM
14 years 2 months ago
As we may perceive: inferring logical documents from hypertext
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov
WWW
2002
ACM
14 years 9 months ago
Using web structure for classifying and describing web pages
The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...
Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...
AAAI
1998
13 years 10 months ago
Learning to Classify Text from Labeled and Unlabeled Documents
In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
IPM
2006
130views more  IPM 2006»
13 years 8 months ago
Exploiting structural information for semi-structured document categorization
This paper examines several different approaches to exploiting structural information in semi-structured document categorization. The methods under consideration are designed for ...
Andrej Bratko, Bogdan Filipic
JIIS
2006
73views more  JIIS 2006»
13 years 8 months ago
Using KCCA for Japanese-English cross-language information retrieval and document classification
Kernel Canonical Correlation Analysis (KCCA) is a method of correlating linear relationship between two variables in a kernel defined feature space. A machine learning algorithm b...
Yaoyong Li, John Shawe-Taylor