Sciweavers

249 search results - page 17 / 50
» Classification of Documents Based on the Structure of Their ...
Sort
View
ICML
2005
IEEE
14 years 8 months ago
Learning hierarchical multi-category text classification models
We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is...
Craig Saunders, John Shawe-Taylor, Juho Rousu, S&a...
IJON
2006
146views more  IJON 2006»
13 years 7 months ago
Feature selection and classification using flexible neural tree
The purpose of this research is to develop effective machine learning or data mining techniques based on flexible neural tree FNT. Based on the pre-defined instruction/operator se...
Yuehui Chen, Ajith Abraham, Bo Yang
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 2 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
ICDAR
2003
IEEE
14 years 23 days ago
Recognition of On-line Handwritten Mathematical Formulas in the E-Chalk System
In this article, we present a system for the recognition of on-line handwritten mathematical formulas which is used in the electronic chalkboard (E-chalk), a multimedia system for...
Ernesto Tapia, Raúl Rojas
IPM
2007
149views more  IPM 2007»
13 years 7 months ago
Web page title extraction and its application
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...
Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...