Information retrieval needs to match relevant texts with a given query. Selecting appropriate parts is useful when documents are long, and only portions are interesting to the user...
The bag of words representation (BoW), which is widely used in information retrieval (IR), represents documents and queries as word lists that do not express anything about context...
This paper proposes a Japanese/English crosslanguage information retrieval (CLIR) system targeting technical documents. Our system first translates a given query containing techni...
Abstract. Nowadays, multimedia documents composed of text and images are increasingly used, thanks to the Internet and the increasing capacity of data storage. It is more and more ...
The indexation of documents is a critical step of the information retrieval process and is often a manual task which highly depends on the indexer’s knowledge. We propose to imp...