Ranking Multilingual Documents Using Minimal Language Dependent Resources

14 years 10 months ago

Download web2py.iiit.ac.in

This paper proposes an approach of extracting simple and eﬀective features that enhances multilingual document ranking (MLDR). There is limited prior research on capturing the concept of multilingual document similarity in determining the ranking of documents. However, the literature available has worked heavily with language speciﬁc tools, making them hard to reimplement for other languages. Our approach extracts various multilingual and monolingual similarity features using a basic language resource (bilingual dictionary). No language-speciﬁc tools are used, hence making this approach extensible for other languages. We used the datasets provided by Forum for Information Retrieval Evaluation (FIRE) 1 for their 2010 Adhoc Cross-Lingual document retrieval task on Indian languages. Experiments have been performed with different ranking algorithms and their results are compared. The results obtained showcase the eﬀectiveness of the features considered in enhancing multilingual doc...

G. S. K. Santosh, N. Kiran Kumar, Vasudeva Varma

Real-time Traffic

Bilingual Dictionary | CICLING 2011 | Indian Languages | Information Retrieval Evaluation | Natural Language Processing |

claim paper

» Automatic Prior Art Searching and Patent Encoding at CLEFIP 10

» Keep It Simple Sheffield A KISS Approach to the Arabic Track

Post Info
More Details (n/a)

Added	25 Aug 2011
Updated	25 Aug 2011
Type	Journal
Year	2011
Where	CICLING
Authors	G. S. K. Santosh, N. Kiran Kumar, Vasudeva Varma

Comments (0)

Sciweavers

Ranking Multilingual Documents Using Minimal Language Dependent Resources

Bilingual Dictionary | CICLING 2011 | Indian Languages | Information Retrieval Evaluation | Natural Language Processing |

Explore & Download

Productivity Tools

Sciweavers