Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Expansion

14 years 9 months ago

Download fidelia1.free.fr

Abstract. In this paper, we target document ranking in a highly technical ﬁeld with the aim to approximate a ranking that is obtained through an existing ontology (knowledge structure). We test and combine symbolic and vector space models (VSM). Our symbolic approach relies on shallow NLP and on internal linguistic relations between Multi-Word Terms (MWTs). Documents are ranked based on diﬀerent semantic relations they share with the query terms, either directly or indirectly after clustering the MWTs using the identiﬁed lexico-semantic relations. The VSM approach consisted in ranking documents with diﬀerent functions ranging from the classical tf.idf to more elaborate similarity functions. Results shows that the ranking obtained by the symbolic approach performs better on most queries than the vector space model. However, the ranking obtained by combining both approaches outperforms by a wide margin the results obtained by methods from each approach.

Eric SanJuan, Fidelia Ibekwe-Sanjuan, Juan Manuel

Real-time Traffic

Information System | NLDB 2007 | Ranking | Symbolic Approach | Vector Space Model |

claim paper

Post Info
More Details (n/a)

Added	08 Jun 2010
Updated	08 Jun 2010
Type	Conference
Year	2007
Where	NLDB
Authors	Eric SanJuan, Fidelia Ibekwe-Sanjuan, Juan Manuel Torres Moreno, Patricia Velázquez-Morales

Comments (0)

Sciweavers

Combining Vector Space Model and Multi Word Term Extraction for Semantic Query Expansion

Information System | NLDB 2007 | Ranking | Symbolic Approach | Vector Space Model |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers