Determining Text Databases to Search in the Internet

15 years 11 months ago

Download www.vldb.org

Text data in the Internet can be partitioned into many databases naturally. Efficient retrieval of desired data can be achieved if we can accurately predict the usefulness of each database, because with such information, we only need to retrieve potentially useful documents from useful databases. In this paper, we propose two new methods for estimating the usefulness of text databases. For a given query, the usefulness of a text database in this paper is defined to be the number of documents in the database that are sufficiently similar to the query. Such a usefulness measure enables naive-users to make informed decision about which databases to search. We also consider the collection fusion problem. Because local databases may employ similarity functions that are different from that used by the global database, the threshold used by a local database to determine whether a document is potentially useful may be different from that used by the global database. We provide techniques that...

Weiyi Meng, King-Lup Liu, Clement T. Yu, Xiaodong

Real-time Traffic

Database | Local Database | Text Database | VLDB 1998 |

claim paper

» An Evaluation and Comparison of Current PeertoPeer FullText Keyword Search Techniques

» A webbased kernel function for measuring the similarity of short text snippets

» Classificationaware hiddenweb text database selection

» Tree patterns with Full Text Search

» A case for query by image and text content searching computer help using screenshots and k...

» StoryUpgrade Finding Stories in Internet Weblogs

» KID an algorithm for fast and efficient text mining used to automatically generate a data...

Post Info
More Details (n/a)

Added	06 Aug 2010
Updated	06 Aug 2010
Type	Conference
Year	1998
Where	VLDB
Authors	Weiyi Meng, King-Lup Liu, Clement T. Yu, Xiaodong Wang, Yuhsi Chang, Naphtali Rishe

Comments (0)

Sciweavers

Determining Text Databases to Search in the Internet

Database | Local Database | Text Database | VLDB 1998 |

Explore & Download

Productivity Tools

Sciweavers