Sciweavers

32 search results - page 4 / 7
» Improving Data Cleaning Quality Using a Data Lineage Facilit...
Sort
View
WEBDB
2001
Springer
137views Database» more  WEBDB 2001»
13 years 12 months ago
Using Database Technology to Improve Performance of Web Proxy Servers
In this paper, we propose to use database technology to improve performance of web proxy servers. We view the cache at a proxy server as a web warehouse with data organized in a h...
Kai Cheng, Yahiko Kambayashi, Mukesh K. Mohania
ACIIDS
2010
IEEE
170views Database» more  ACIIDS 2010»
13 years 5 months ago
On the Effectiveness of Gene Selection for Microarray Classification Methods
Microarray data usually contains a high level of noisy gene data, the noisy gene data include incorrect, noise and irrelevant genes. Before Microarray data classification takes pla...
Zhongwei Zhang, Jiuyong Li, Hong Hu, Hong Zhou
LREC
2010
178views Education» more  LREC 2010»
13 years 9 months ago
Data Issues in English-to-Hindi Machine Translation
Statistical machine translation to morphologically richer languages is a challenging task and more so if the source and target languages differ in word order. Current state-of-the...
Ondrej Bojar, Pavel Stranák, Daniel Zeman
ER
2003
Springer
550views Database» more  ER 2003»
14 years 20 days ago
A UML Based Approach for Modeling ETL Processes in Data Warehouses
Data warehouses (DWs) are complex computer systems whose main goal is to facilitate the decision making process of knowledge workers. ETL (Extraction-Transformation-Loading) proces...
Juan Trujillo, Sergio Luján-Mora
ICDE
2009
IEEE
121views Database» more  ICDE 2009»
14 years 9 months ago
Large-Scale Deduplication with Constraints Using Dedupalog
We present a declarative framework for collective deduplication of entity references in the presence of constraints. Constraints occur naturally in many data cleaning domains and c...
Arvind Arasu, Christopher Ré, Dan Suciu