Sciweavers

2386 search results - page 422 / 478
» Combining Information Retrieval with Information Extraction ...
Sort
View
SIGMOD
2007
ACM
112views Database» more  SIGMOD 2007»
14 years 8 months ago
A random walk approach to sampling hidden databases
A large part of the data on the World Wide Web is hidden behind form-like interfaces. These interfaces interact with a hidden backend database to provide answers to user queries. ...
Arjun Dasgupta, Gautam Das, Heikki Mannila
SIGMOD
2001
ACM
121views Database» more  SIGMOD 2001»
14 years 8 months ago
XML Document Versioning
Managing multiple versions of XML documents represents an important problem, because of many applications ranging from traditional ones, such as software configuration control, to...
Shu-Yao Chien, Vassilis J. Tsotras, Carlo Zaniolo
SDM
2009
SIAM
140views Data Mining» more  SDM 2009»
14 years 6 months ago
Straightforward Feature Selection for Scalable Latent Semantic Indexing.
Latent Semantic Indexing (LSI) has been validated to be effective on many small scale text collections. However, little evidence has shown its effectiveness on unsampled large sca...
Jun Yan, Shuicheng Yan, Ning Liu, Zheng Chen
WEBI
2009
Springer
14 years 3 months ago
Learning Deep Web Crawling with Diverse Features
—The key to Deep Web crawling is to submit promising keywords to query form and retrieve Deep Web content efficiently. To select keywords, existing methods make a decision based ...
Lu Jiang, Zhaohui Wu, Qinghua Zheng, Jun Liu
ICDE
2007
IEEE
209views Database» more  ICDE 2007»
14 years 3 months ago
Hierarchical Temporal Association Mining for Video Event Detection in Video Databases
With the proliferation of multimedia data and evergrowing requests for multimedia applications, new challenges are emerged for efficient and effective managing and accessing large...
Min Chen, Shu-Ching Chen, Mei-Ling Shyu