Our participation in the ImageCLEF Wikipedia retrieval task aims to study the efficiency of using two contextual factors in image retrieval: metadata which contains specific information about images, and textual content which contains general information about images. For this aim, the Lucene library is used for indexing and searching. We propose also to combine both factors using two different methods: one based on simple linear function and one based on scores comparison. In addition, a comparison between monolingual and multilingual image retrieval using queries in a single language (English) and queries in different language is done. Results show that the use of textual content is more useful then the use of metadata and the combination of both factors further improves results. In addition, the use of all provided languages exceeds over the use of only English langage.