Effective shrinkage of large multi-class linear svm models for text categorization

14 years 9 months ago

Download figment.cse.usf.edu

When linear support vector machines (SVMs) are applied to multi-class text categorization in industry, the size of the linear SVM model is very large, usually greater than several gigabytes. As a result, the model cannot directly ﬁt into the computer memory and the classiﬁcation process is slow. In this paper, a novel method based on vector norm is proposed to shrink the model size signiﬁcantly without sacriﬁcing the classiﬁcation accuracy. Also, we propose a cache-efﬁcient implementation of multi-class linear SVMs in the classiﬁcation phase. Our experimental results have shown that on Yahoo-Korea dataset the proposed method can shrink the model size from 5.2 gigabytes to 260 megabytes and the efﬁcient implementation of linear SVM has obtained a speedup factor of 44.

Jian-xiong Dong, Ching Y. Suen, Adam Krzyzak

Real-time Traffic

Computer Vision | ICPR 2008 | Linear Svm | Linear Svm Model | Model Size |

claim paper

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICPR
Authors	Jian-xiong Dong, Ching Y. Suen, Adam Krzyzak

Comments (0)

Sciweavers

Effective shrinkage of large multi-class linear svm models for text categorization

Computer Vision | ICPR 2008 | Linear Svm | Linear Svm Model | Model Size |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers