Sciweavers

ICMCS
1999
IEEE

Integrating Web Resources and Lexicons into a Natural Language Query System

14 years 4 months ago
Integrating Web Resources and Lexicons into a Natural Language Query System
The START system responds to natural language queries with answers in text, pictures, and other media. START's sentence-level natural language parsing relies on a number of mechanisms to help it process the huge, diverse resources available on the World Wide Web. Blitz, a hybrid heuristicand corpus-based natural language preprocessor, enables START to integrate a large and ever-changing lexicon of proper names, by using heuristic rules and precompiled tables of symbols to preprocess various highly regular and fixed expressions into lexical tokens. LaMeTH, a contentbased system for extracting information from HTML documents, assists START by providing a uniform method of accessing information on the Web in real time. These mechanisms have considerably improved START's ability to analyze real-world sentences and answer queries through expansion of its lexicon and integration of Web resources.
Boris Katz, Deniz Yuret, Jimmy J. Lin, Sue Felshin
Added 03 Aug 2010
Updated 03 Aug 2010
Type Conference
Year 1999
Where ICMCS
Authors Boris Katz, Deniz Yuret, Jimmy J. Lin, Sue Felshin, Rebecca Schulman, Adnan Ilik, Ali Ibrahim, Philip Osafo-Kwaako
Comments (0)