Sciweavers

513 search results - page 70 / 103
» Cache Conscious Algorithms for Relational Query Processing
Sort
View
ICDE
2010
IEEE
204views Database» more  ICDE 2010»
14 years 4 months ago
ProbClean: A probabilistic duplicate detection system
— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
TOIS
2008
145views more  TOIS 2008»
13 years 9 months ago
Classification-aware hidden-web text database selection
Many valuable text databases on the web have non-crawlable contents that are "hidden" behind search interfaces. Metasearchers are helpful tools for searching over multip...
Panagiotis G. Ipeirotis, Luis Gravano
SIGIR
2009
ACM
14 years 4 months ago
Compressing term positions in web indexes
Large search engines process thousands of queries per second on billions of pages, making query processing a major factor in their operating costs. This has led to a lot of resear...
Hao Yan, Shuai Ding, Torsten Suel
WWW
2007
ACM
14 years 10 months ago
MedSearch: a specialized search engine for medical information
People are thirsty for medical information. Existing Web search engines often cannot handle medical search well because they do not consider its special requirements. Often a medi...
Gang Luo, Chunqiang Tang, Hao Yang, Xing Wei
ICDE
2010
IEEE
428views Database» more  ICDE 2010»
14 years 9 months ago
Supporting Top-K Keyword Search in XML Databases
Keyword search is considered to be an effective information discovery method for both structured and semistructured data. In XML keyword search, query semantics is based on the con...
Liang Jeff Chen, Yannis Papakonstantinou