The document as an ergodic markov chain

14 years 6 months ago

Download www.comp.rgu.ac.uk

In recent years, statistical language models are being proposed as alternative to the vector space model. Viewing documents as language samples introduces the issue of deﬁning a joint probability distribution over the terms. The present paper models a document as the result of a Markov process. It argues that this process is ergodic, which is theoretically plausible, and easy to verify in practice. The theoretical result is that the joint distribution can be easily obtained. This can also be applied for search resolutions other than the document level. We veriﬁed this in an experiment on query expansion demonstrating both the validity and the practicability of the method. This holds a promise for general language models. Categories and Subject Descriptors H.3.3 [Information Search and Retrieval] General Terms Theory, Algorithms Keywords Language models, ergodic process, semantic space.

Eduard Hoenkamp, Dawei Song

Real-time Traffic

Joint Probability Distribution | Language Models | SIGIR 2004 | Statistical Language Models |

claim paper

Post Info
More Details (n/a)

Added	30 Jun 2010
Updated	30 Jun 2010
Type	Conference
Year	2004
Where	SIGIR
Authors	Eduard Hoenkamp, Dawei Song

Comments (0)

Sciweavers

The document as an ergodic markov chain

Joint Probability Distribution | Language Models | SIGIR 2004 | Statistical Language Models |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers