Sciweavers

SPIRE
1999
Springer

Effects of Term Segmentation on Chinese/English Cross-Language Information Retrieval

14 years 4 months ago
Effects of Term Segmentation on Chinese/English Cross-Language Information Retrieval
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European languages. CLIR problems that involve East Asian languages such as Chinese introduce additional challenges, because written Chinese texts lack boundaries between terms. This paper examines three Chinese segmentation techniques in combination with two variants of dictionary-based Chinese to English query translation. The results indicate that failure to segment terms, particularly technical terms and names, can have a cascading effect that reduces retrieval effectiveness. Task-tuned segmentation algorithms and alternative term weighting strategies are suggested as productive directions for future work.
Douglas W. Oard, Jianqiang Wang
Added 05 Aug 2010
Updated 05 Aug 2010
Type Conference
Year 1999
Where SPIRE
Authors Douglas W. Oard, Jianqiang Wang
Comments (0)