Using Nearest Neighbor Information to Improve Cross-Language Text Classification

15 years 11 months ago

Download ccc.inaoep.mx

Cross-language text classification (CLTC) aims to take advantage of existing training data from one language to construct a classifier for another language. In addition to the expected translation issues, CLTC is also complicated by the cultural distance between both languages, which causes that documents belonging to the same category concern very different topics. This paper proposes a re-classification method which purpose is to reduce the errors caused by this phenomenon by considering information from the own target language documents. Experimental results in a news corpus considering three pairs of languages and four categories demonstrated the appropriateness of the proposed method, which could improve the initial classification accuracy by up to 11%.

Adelina Escobar-Acevedo, Manuel Montes-y-Gó

Real-time Traffic

Cross-language Text Classification | Initial Classification Accuracy | MICAI 2009 | Target Language Documents |

claim paper

» Improved kNN Algorithm for Text Classification

» A Fast KNN Algorithm Based on Simulated Annealing

» Multilingual document clusters discovery

» KLocal Hyperplane and Convex Distance Nearest Neighbor Algorithms

» A Regressionbased K nearest neighbor algorithm for gene function prediction from heterogen...

» Classifiers without borders incorporating fielded text from neighboring web pages

» Improving Retrieval Effectiveness by Reranking Documents Based on Controlled Vocabulary

» A comparative study on two largescale hierarchical text classification tasks solutions

Post Info
More Details (n/a)

Added	26 Jul 2010
Updated	26 Jul 2010
Type	Conference
Year	2009
Where	MICAI
Authors	Adelina Escobar-Acevedo, Manuel Montes-y-Gómez, Luis Villaseñor Pineda

Comments (0)

Sciweavers

Using Nearest Neighbor Information to Improve Cross-Language Text Classification

Cross-language Text Classification | Initial Classification Accuracy | MICAI 2009 | Target Language Documents |

Explore & Download

Productivity Tools

Sciweavers