Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

188

SIGIR
2002
ACM

124views Information Technology» more SIGIR 2002»

Empirical studies in strategies for Arabic retrieval

15 years 6 months ago

Empirical studies in strategies for Arabic retrieval

Download www.isi.edu

This work evaluates a few search strategies for Arabic monolingual and cross-lingual retrieval, using the TREC Arabic corpus as the test-bed. The release by NIST in 2001 of an Arabic corpus of nearly 400k documents with both monolingual and cross-lingual queries and relevance judgments has been a new enabler for empirical studies. Experimental results show that spelling normalization and stemming can significantly improve Arabic monolingual retrieval. Character tri-grams from stems improved retrieval modestly on the test corpus, but the improvement is not statistically significant. To further improve retrieval, we propose a novel thesaurus-based technique. Different from existing approaches to thesaurus-based retrieval, ours formulates word synonyms as probabilistic term translations that can be automatically derived from a parallel corpus. Retrieval results show that the thesaurus can significantly improve Arabic monolingual retrieval. For cross-lingual retrieval (CLIR), we found tha...

Jinxi Xu, Alexander Fraser, Ralph M. Weischedel

Real-time Traffic

Arabic Monolingual Retrieval | Cross-lingual Retrieval | Information Technology | SIGIR 2002 | Thesaurus-based Retrieval |

claim paper

Related Content

» An empirical study of tokenization strategies for biomedical information retrieval

» CrossLanguage and CrossMedia Image Retrieval An Empirical Study at ImageCLEF2007

» Easy on that trigger dad a study of long term family photo retrieval

» CUHK at ImageCLEF 2005 CrossLanguage and CrossMedia Image Retrieval

» Multimodal conceptdependent active learning for image retrieval

» Statistical query expansion for sentence retrieval and its effects on weak and strong quer...

» An analysis on document length retrieval trends in language modeling smoothing

» Engineering a Fast Online Persistent Suffix Tree Construction

» A Reexamination of Query Expansion Using Lexical Resources

Post Info
More Details (n/a)

Added	23 Dec 2010
Updated	23 Dec 2010
Type	Journal
Year	2002
Where	SIGIR
Authors	Jinxi Xu, Alexander Fraser, Ralph M. Weischedel

Comments (0)