Acquiring Word Similarities with Higher Order Association Mining

16 years 23 days ago

Download www.comp.rgu.ac.uk

We present a novel approach to mine word similarity in Textual Case Based Reasoning. We exploit indirect associations of words, in addition to direct ones for estimating their similarity. If word A co-occurs with word B, we say A and B share a first order association between them. If A co-occurs with B in some documents, and B with C in some others, then A and C are said to share a second order co-occurrence via B. Higher orders of co-occurrence may similarly be defined. In this paper we present algorithms for mining higher order co-occurrences. A weighted linear model is used to combine the contribution of these higher orders into a word similarity model. Our experimental results demonstrate significant improvements compared to similarity models based on first order co-occurrences alone. Our approach also outperforms state-of-the-art techniques like SVM and LSI in classification tasks of varying complexity.

Sutanu Chakraborti, Nirmalie Wiratunga, Robert Lot

Real-time Traffic

ICCBR 2007 | Order Co-occurrences | Similarity Models | Word Similarity |

claim paper

» Building semantic kernels for text classification using wikipedia

» Structureoriented contour representation and matching for engineering shapes

» On External Indices for Mixtures Validating Mixtures of Genes

» A modelindependent approach to infer hierarchical codon substitution dynamics

Post Info
More Details (n/a)

Added	08 Jun 2010
Updated	08 Jun 2010
Type	Conference
Year	2007
Where	ICCBR
Authors	Sutanu Chakraborti, Nirmalie Wiratunga, Robert Lothian, Stuart N. K. Watt

Comments (0)

Sciweavers

Acquiring Word Similarities with Higher Order Association Mining

ICCBR 2007 | Order Co-occurrences | Similarity Models | Word Similarity |

Explore & Download

Productivity Tools

Sciweavers