Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

144

WEBI
2005
Springer

89views Internet Technology» more WEBI 2005»

An EM Based Training Algorithm for Cross-Language Text Categorization

16 years 2 days ago

An EM Based Training Algorithm for Cross-Language Text Categorization

Download www.cs.uic.edu

Due to the globalization on the Web, many companies and institutions need to efﬁciently organize and search repositories containing multilingual documents. The management of these heterogeneous text collections increases the costs signiﬁcantly because experts of different languages are required to organize these collections. CrossLanguage Text Categorization can provide techniques to extend existing automatic classiﬁcation systems in one language to new languages without requiring additional intervention of human experts. In this paper we propose a learning algorithm based on the EM scheme which can be used to train text classiﬁers in a multilingual environment. In particular, in the proposed approach, we assume that a predeﬁned category set and a collection of labeled train

Leonardo Rigutini, Marco Maggini, Bing Liu

Real-time Traffic

CrossLanguage Text Categorization | Heterogeneous Text Collections | Internet Technology | Multilingual Documents | WEBI 2005 |

claim paper

Related Content

» Cross Language Text Classification by Model Translation and SemiSupervised Learning

» CrossLanguage Frame Semantics Transfer in Bilingual Corpora

» A SemiSupervised Document Clustering Algorithm Based on EM

» Combining Labeled and Unlabeled Data for MultiClass Text Categorization

» Combining ILP with Semisupervised Learning for Web Page Categorization

» Text Classification from Labeled and Unlabeled Documents using EM

» Semisupervised Text Classification Using Partitioned EM

» Multilingual document clusters discovery

» Author Identification Using a Tensor Space Representation

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	WEBI
Authors	Leonardo Rigutini, Marco Maggini, Bing Liu

Comments (0)