Sciweavers

SAC
2011
ACM

Biomedical concept extraction based on combining the content-based and word order similarities

13 years 3 months ago
Biomedical concept extraction based on combining the content-based and word order similarities
It is well known that the main objective of conceptual retrieval models is to go beyond simple term matching by relaxing term independence assumption through concept recognition. In this paper, we present an approach of semantic indexing and retrieval of biomedical documents through the process of identifying domain concepts extracted from the Medical Subject Headings (MeSH) thesaurus. Our indexing approach relies on a purely statistical vector space model, which represents medical documents and MeSH concepts as term vectors. By leveraging a combination of the bag-of-word concept representation and word positions in the textual features, we demonstrate that our mapping method is able to extract valuable concepts from documents. The output of this semantic mapping serves as the input to our relevance document scoring in response to a query. Experiments on the OHSUMED collection show that our semantic indexing method significantly outperforms stateof-art baselines that employ word or t...
Duy Dinh, Lynda Tamine
Added 17 Sep 2011
Updated 17 Sep 2011
Type Journal
Year 2011
Where SAC
Authors Duy Dinh, Lynda Tamine
Comments (0)