Sciweavers

1260 search results - page 187 / 252
» Data Quality in Genome Databases
Sort
View
DBISP2P
2008
Springer
124views Database» more  DBISP2P 2008»
13 years 9 months ago
Exploiting Distribution Skew for Scalable P2P Text Clustering
K-Means clustering is widely used in information retrieval and data mining. Distributed K-Means variants have already been proposed, but none of the past algorithms scales to large...
Odysseas Papapetrou, Wolf Siberski, Fabian Leitrit...
RTDB
1996
84views more  RTDB 1996»
13 years 9 months ago
Performance-Polymorphic Execution of Real-Time Queries
We are developing an object-oriented real-time database system that includes a relationally complete query language. Unlike conventional query optimizers, our optimizer estimates ...
Thomas Padron-McCarthy, Tore Risch
ICDE
2010
IEEE
214views Database» more  ICDE 2010»
13 years 7 months ago
The Entity Name System: Enabling the web of entities
— We are currently witnessing an increasing interest in the use of the web as an information and knowledge source. Much of the information sought after in the web is in this case...
Heiko Stoermer, Themis Palpanas, George Giannakopo...
DKE
2008
109views more  DKE 2008»
13 years 7 months ago
Deterministic algorithms for sampling count data
Processing and extracting meaningful knowledge from count data is an important problem in data mining. The volume of data is increasing dramatically as the data is generated by da...
Hüseyin Akcan, Alex Astashyn, Hervé Br...
KDD
2001
ACM
163views Data Mining» more  KDD 2001»
14 years 8 months ago
Learning to recognize brain specific proteins based on low-level features from on-line prediction servers
During the last decade, the area of bioinformatics has produced an overwhelming amount of data, with the recently published draft of the human genome being the most prominent exam...
Henrik Boström, Joakim Cöster, Lars Aske...