Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

193

Voted

ICMLA
2008

131views Machine Learning» more ICMLA 2008»

Text Classification Using Tree Kernels and Linguistic Information

15 years 8 months ago

Text Classification Using Tree Kernels and Linguistic Information

Download www.di.uevora.pt

Standard Machine Learning approaches to text classification use the bag-of-words representation of documents to deceive the classification target function. Typical linguistic structures such as morphology, syntax and semantic are completely ignored in the learning process. This paper examines the role of these structures on the classifier construction applying the study to the Portuguese language. Classifiers are built using the SVM algorithm on a newspaper's articles dataset. The results show that syntactic structure is not useful for text classification (as initially expected), but a novel structured representation that uses document's semantic information has the same discriminative power over classes as the traditional bag-of-words one.

Teresa Gonçalves, Paulo Quaresma

Real-time Traffic

Classification Target Function | ICMLA 2008 | Machine Learning | Text Classification | Typical Linguistic Structures |

claim paper

Related Content

» Composite Kernels For Relation Extraction

» Question classification using support vector machines

» Kernel methods syntax and semantics for relational text categorization

» Complex Linguistic Features for Text Classification A Comprehensive Study

» Automatic analysis of semantic similarity in comparable text through syntactic tree matchi...

» Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees

» A Semantic Kernel to Exploit Linguistic Knowledge

» Text classification with kernels on the multinomial manifold

» Kernels on Linguistic Structures for Answer Extraction

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	ICMLA
Authors	Teresa Gonçalves, Paulo Quaresma

Comments (0)