Sciweavers

1746 search results - page 95 / 350
» On Compressibility of Protein Sequences
Sort
View
JCB
2002
74views more  JCB 2002»
13 years 9 months ago
Using Substitution Matrices to Estimate Probability Distributions for Biological Sequences
Accurately estimating probabilities from observations is important for probabilistic-based approaches to problems in computational biology. In this paper we present a biologically...
Eleazar Eskin, William Stafford Noble, Yoram Singe...
ICDM
2009
IEEE
152views Data Mining» more  ICDM 2009»
14 years 4 months ago
SLIDER: Mining Correlated Motifs in Protein-Protein Interaction Networks
Correlated motif mining (CMM) is the problem to find overrepresented pairs of patterns, called motif pairs, in interacting protein sequences. Algorithmic solutions for CMM thereb...
Peter Boyen, Frank Neven, Dries Van Dyck, Aalt-Jan...
WISE
2002
Springer
14 years 3 months ago
Cluster-Based Delta Compression of a Collection of Files
Delta compression techniques are commonly used to succinctly represent an updated version of a file with respect to an earlier one. In this paper, we study the use of delta compr...
Zan Ouyang, Nasir D. Memon, Torsten Suel, Dimitre ...
BNCOD
2003
104views Database» more  BNCOD 2003»
13 years 11 months ago
External Sorting with On-the-Fly Compression
Evaluating a query can involve manipulation of large volumes of temporary data. When the volume of data becomes too great, activities such as joins and sorting must use disk, and ...
John Yiannis, Justin Zobel
SSDBM
2003
IEEE
83views Database» more  SSDBM 2003»
14 years 3 months ago
PiQA: An Algebra for Querying Protein Data Sets
Life science researchers frequently need to query large protein data sets in a variety of different ways. Protein data sets have a rich structure that includes its primary structu...
Sandeep Tata, Jignesh M. Patel