A Large Scale Ranker-Based System for Search Query Spelling Correction

13 years 10 months ago

Download research.microsoft.com

This paper makes three significant extensions to a noisy channel speller designed for standard written text to target the challenging domain of search queries. First, the noisy channel model is subsumed by a more general ranker, which allows a variety of features to be easily incorporated. Second, a distributed infrastructure is proposed for training and applying Web scale n-gram language models. Third, a new phrase-based error model is presented. This model places a probability distribution over transformations between multi-word phrases, and is estimated using large amounts of query-correction pairs derived from search logs. Experiments show that each of these extensions leads to significant improvements over the state-of-the-art baseline methods.

Jianfeng Gao, Xiaolong Li, Daniel Micol, Chris Qui

Real-time Traffic

COLING 2010 | Computational Linguistics | Noisy Channel | Noisy Channel Model | Noisy Channel Speller |

claim paper

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	COLING
Authors	Jianfeng Gao, Xiaolong Li, Daniel Micol, Chris Quirk, Xu Sun

Comments (0)

Sciweavers

A Large Scale Ranker-Based System for Search Query Spelling Correction

COLING 2010 | Computational Linguistics | Noisy Channel | Noisy Channel Model | Noisy Channel Speller |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers