Sciweavers

373 search results - page 10 / 75
» Correcting the Document Layout: A Machine Learning Approach
Sort
View
AI
2009
Springer
14 years 3 months ago
An Iterative Hybrid Filter-Wrapper Approach to Feature Selection for Document Clustering
The manipulation of large-scale document data sets often involves the processing of a wealth of features that correspond with the available terms in the document space. The employm...
Mohammad-Amin Jashki, Majid Makki, Ebrahim Bagheri...
CIKM
2004
Springer
14 years 2 months ago
Hierarchical document categorization with support vector machines
Automatically categorizing documents into pre-defined topic hierarchies or taxonomies is a crucial step in knowledge and content management. Standard machine learning techniques ...
Lijuan Cai, Thomas Hofmann
CICLING
2005
Springer
14 years 2 months ago
A Machine Learning Approach to Information Extraction
Information extraction is concerned with applying natural language processing to automatically extract the essential details from text documents. A great disadvantage of current ap...
Alberto Téllez-Valero, Manuel Montes-y-G&oa...
ICML
1998
IEEE
14 years 9 months ago
Employing EM and Pool-Based Active Learning for Text Classification
This paper shows how a text classifier's need for labeled training documents can be reduced by taking advantage of a large pool of unlabeled documents. We modify the Query-by...
Andrew McCallum, Kamal Nigam
MLDM
2007
Springer
14 years 3 months ago
PE-PUC: A Graph Based PU-Learning Approach for Text Classification
This paper presents a novel solution for the problem of building text classifier using positive documents (P) and unlabeled documents (U). Here, the unlabeled documents are mixed w...
Shuang Yu, Chunping Li