Extending an on-line information site with accurate domain-dependent extracts from the World Wide Web