A Bayesian Method for Robust Estimation of Distributional Similarities

15 years 4 months ago

Download www.aclweb.org

Existing word similarity measures are not robust to data sparseness since they rely only on the point estimation of words' context profiles obtained from a limited amount of data. This paper proposes a Bayesian method for robust distributional word similarities. The method uses a distribution of context profiles obtained by Bayesian estimation and takes the expectation of a base similarity measure under that distribution. When the context profiles are multinomial distributions, the priors are Dirichlet, and the base measure is the Bhattacharyya coefficient, we can derive an analytical form that allows efficient calculation. For the task of word similarity estimation using a large amount of Web data in Japanese, we show that the proposed measure gives better accuracies than other well-known similarity measures.

Jun'ichi Kazama, Stijn De Saeger, Kow Kuroda, Masa

Real-time Traffic

ACL 2010 | Computational Linguistics | Context Profiles | Similarity Measures | Word Similarity |

claim paper

» Robust estimation in Capital Asset Pricing Model

» Robust imputation method for missing values in microarray data

» Distance Learning for Similarity Estimation

» Estimating HeavyTail Exponents Through Max SelfSimilarity

» Variational methods for spectral unmixing of hyperspectral images

» A New Study on Distance Metrics as Similarity Measurement

» On Monte Carlo methods for Bayesian multivariate regression models with heavytailed errors

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Jun'ichi Kazama, Stijn De Saeger, Kow Kuroda, Masaki Murata, Kentaro Torisawa

Comments (0)

Sciweavers

A Bayesian Method for Robust Estimation of Distributional Similarities

ACL 2010 | Computational Linguistics | Context Profiles | Similarity Measures | Word Similarity |

Explore & Download

Productivity Tools

Sciweavers