Learning Noun Phrase Query Segmentation

14 years 1 months ago

Download www.aclweb.org

Query segmentation is the process of taking a user’s search-engine query and dividing the tokens into individual phrases or semantic units. Identiﬁcation of these query segments can potentially improve both document-retrieval precision, by ﬁrst returning pages which contain the exact query segments, and document-retrieval recall, by allowing query expansion or substitution via the segmented units. We train and evaluate a machine-learned query segmentation system that achieves 86% segmentationdecision accuracy on a gold standard set of segmented noun phrase queries, well above recently published approaches. Key enablers of this high performance are features derived from previous natural language processing work in noun compound bracketing. For example, token association features beyond simple N-gram counts provide powerful indicators of segmentation.

Shane Bergsma, Qin Iris Wang

Real-time Traffic

EMNLP 2007 | Natural Language Processing | Query Segmentation | Query Segments | User’s Search-engine Query |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	EMNLP
Authors	Shane Bergsma, Qin Iris Wang

Comments (0)

Sciweavers

Learning Noun Phrase Query Segmentation

EMNLP 2007 | Natural Language Processing | Query Segmentation | Query Segments | User’s Search-engine Query |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers